Network Security Defense Decision-Making Method Based on Stochastic Game and Deep Reinforcement Learning

<div>The learning mechanism of Q-learning algorithm in network attack and defense events. The attacker/defender must consider not only the network environment but also the behavior of the other party when learning or making decisions.</div>

Security and Communication Networks

fig3

Figure 3

Figure 3: Network Security Defense Decision-Making Method Based on Stochastic Game and Deep Reinforcement Learning