Journal of Robotics / 2018 / Article / Fig 5

Research Article

Dynamic Path Planning of Unknown Environment Based on Deep Reinforcement Learning

Figure 5

The average cumulative reward curves. Each point is the average cumulative reward achieved per hundred episodes. The -axis denotes the average cumulative reward and x-axis denotes iteration epoch.

We are committed to sharing findings related to COVID-19 as quickly as possible. We will be providing unlimited waivers of publication charges for accepted research articles as well as case reports and case series related to COVID-19. Review articles are excluded from this waiver policy. Sign up here as a reviewer to help fast-track new submissions.