Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Table 12

RL model for the DCS scheme [27].

Action ; each subaction represents the presence of PU activities. Specifically, if a PU agent cannot transmit in channel and so it becomes white space, and if the PU agent can transmit in channel .

Reward