Research Article

Reinforcement Learning Guided by Double Replay Memory

Table 1

The scores from CartPole simulations.

Max scoreAverage score

DQN373.80229.30
478.65210.58
500259.23
500285.49
PER500237.63