Research Article

A UAV Pursuit-Evasion Strategy Based on DDPG and Imitation Learning

Table 1

Table of training hyperparameter.

Training hyperparameterSymbolValue

Discounting factor0.9
Inertial update rate0.01
Memory size30000
Size of batch experienceBatch size64
Simulation time step0.1
Learning rate of Critic network0.002
Learning rate of Actor network0.001
Number of episodesMaxEpisode4000
Number of steps in one episodeMaxStep300