Research Article

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Table 5

Experiment results of learning exploration with fine-tuning method (no extrinsic reward).

EnvironmentMethodRewardMER (%)IQRE

Maze-1ICM + scratch584.59100.007.93
Ours + scratch586.32100.004.72
ICM + fine-tuning585.16100.007.58
Ours + fine-tuning585.45100.005.14
Maze-2ICM + scratch567.28100.008.07
Ours + scratch571.87100.005.15
ICM + fine-tuning566.34100.006.49
Ours + fine-tuning568.25100.004.81
Maze-3ICM + scratch532.2791.64N/A
Ours + scratch579.65100.006.54
ICM + fine-tuning573.49100.007.23
Ours + fine-tuning572.86100.004.73