Research Article
A UAV Pursuit-Evasion Strategy Based on DDPG and Imitation Learning
Figure 9
Average rewards of IL-DDPG and DDPG algorithm (the thin line of the background is the real-time curve of the total reward per episode, and the thick line is the average reward curve per episode).