Research Article

A UAV Pursuit-Evasion Strategy Based on DDPG and Imitation Learning

Figure 9

Average rewards of IL-DDPG and DDPG algorithm (the thin line of the background is the real-time curve of the total reward per episode, and the thick line is the average reward curve per episode).