Research Article

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Table 7

Experiment results of “noisy-TV.”

EnvironmentMethodRewardMER (%)IQRE

Maze-1ICM + scratch315.6253.86N/A
Ours + scratch582.741007.58
ICM + fine-tuning374.5264.05N/A
Ours + fine-tuning586.431008.67
Maze-2ICM + scratch279.6848.71N/A
Ours + scratch565.321006.93
ICM + fine-tuning317.5456.18N/A
Ours + fine-tuning566.731007.75
Maze-3ICM + scratch362.4963.28N/A
Ours + scratch577.861007.69
ICM + fine-tuning305.4754.72N/A
Ours + fine-tuning572.631008.12