Research Article

AIBPO: Combine the Intrinsic Reward and Auxiliary Task for 3D Strategy Game

Table 2

Experimental results of the IBPO (the bold is the best).

Evaluation criteriaIBPODFPDRQN

Average reward value0.920.860.79
Average action steps61.869.375.7