Research Article

Anti-Attack Scheme for Edge Devices Based on Deep Reinforcement Learning

Table 2

Player’s mean payoff comparison.

ValueData 1Data 2Data 3Data 5

Mean payoff21.25120.44820.77515.849
Weight mean payoff21.993721.21921.543516.1693
Error0.7426950.7710.76850.3203
Action1110