Research Article

A Dynamic Hidden Forwarding Path Planning Method Based on Improved Q-Learning in SDN Environments

Figure 1

Illustration of policy iteration in reinforcement learning.