Research Article

Visual Navigation with Asynchronous Proximal Policy Optimization in Artificial Agents

Table 2

Standard deviation of the reward in stairway_to_melon.

AlgorithmStandard deviation

a3cNav30.16
appoNav27.24