Research Article
Interactive Q-Learning Approach for Pick-and-Place Optimization of the Die Attach Process in the Semiconductor Industry
Algorithm 1
Routing algorithm of the interactive Q model.
Input: , , , , , | Procedure: Initialize and | Initialize and | Initialize # : current state of the pick agent | Initialize | Until do | Increase by 1 | If do# If a strip is full, the strip is replaced with a new one | | # Pick agent procedure | # Select an action of the pick agent | | # Update the current state of the place agent | # Update the feasible action space of the pick agent | # Place agent procedure | Increase by 1 | # Select an action of the place agent | | | # Update the feasible action space of the place agent | | Output: |
|