Research Article

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Table 3

Experiment results of learning exploration from scratch.

EnvironmentMethodRewardMER (%)IQRE

Maze-1TRPO327.3655.29N/A
VIME321.1453.58N/A
EX2489.2782.43N/A
ICM584.59100.007.93
Ours586.32100.004.72
Maze-2TRPO232.4741.02N/A
VIME228.3439.98N/A
EX2425.7374.56N/A
ICM567.28100.008.07
Ours571.87100.005.15
Maze-3TRPO243.4941.73N/A
VIME276.5447.82N/A
EX2339.6258.35N/A
ICM532.2791.64N/A
Ours579.65100.006.54