Research Article
Visual Navigation with Asynchronous Proximal Policy Optimization in Artificial Agents
Table 3
The states that the artificial agent sees in stairway_to_melon.
| Time | 1000 | 1200 | 1400 | 1600 | 1800 | 2000 | 2200 | 2400 | 2600 | 2800 | Episode | | | | | | | | | | |
| The first episode | | | | | | | | | | | The second episode | | | | | | | | | | | The third episode | | | | | | | | | | |
| Time | 3000 | 3200 | 3400 | 3600 | 3800 | 4000 | 4200 | 4400 | 4600 | 4800 | Episode | | | | | | | | | | | The first episode | | | | | | | | | | | The second episode | | | | | | | | | | | The third episode | | | | | | | | | | |
|
|