Research Article

Double Deep Recurrent Reinforcement Learning for Centralized Dynamic Multichannel Access

Figure 8

The standard derivation of the average reward in real data trace.