Research Article
Reinforcement Learning-Based Service-Oriented Dynamic Multipath Routing in SDN
Figure 15
Performance of three path distribution schemes in the three-service scenario. (a) Reward gained by SPD; (b) reward gained by DQN; (c) reward gained by RED-STAR; (d) average reward of three services of each scheme; (e) maximum bandwidth utilization of each scheme.
(a) |
(b) |
(c) |
(d) |
(e) |