Research Article

Performance Evaluation of Multiagent Reinforcement Learning Based Training Methods for Swarm Fighting

Table 5

Hyperparameters used for this experiment.

TypeMARLMARL-BC

HyperparametersBatch size10241024
Buffer size2048020480
Learning rate0.00010.0001
Entropy bonus0.0050.005
Num epoch33

Network settingsHidden units512512
Num layers33

Reward signalsDiscount factor0.990.99
Strength1.01.0

Behavior cloningSteps/100 M
Strength0.50.5