Research Article

Deep Q-Network with Predictive State Models in Partially Observable Domains

Table 1

The best reward of three methods.

CartPole-v1Swimmer-v1Reacher-v1

DRQN20056−1.15
DQN-1frame5440.58−6.43
RPSR-DQN20059.52−0.02
RPSP15838.96−57.78