Research Article

Network Security Defense Decision-Making Method Based on Stochastic Game and Deep Reinforcement Learning

Figure 3

The learning mechanism of Q-learning algorithm in network attack and defense events. The attacker/defender must consider not only the network environment but also the behavior of the other party when learning or making decisions.