Research Article

NCDN: A Node-Failure Resilient CDN Solution with Reinforcement Learning Optimization

Figure 4

RL environment example. By observing the state of the environment, the RL agent makes actions (i.e., increase or decrease duplications) for which he receives rewards as revenues. The agent’s goal is to find the optimal number of duplicates.