Research Article

DQfD-AIPT: An Intelligent Penetration Testing Framework Incorporating Expert Demonstration Data

Table 6

Special hyperparameter setting of the DQfD.

HyperparameterValue

Pretrain step1000
N-step return weight 1.0
Supervised loss weight 1.0
L 2 regularisation weight 1.0
Expert margin 0.8
N of N-step return10
Prioritized replay exponent 0.4
Prioritized replay constants 0.001
Prioritized replay constants 1.0
Prioritized replay importance sampling exponent 0.6