Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Table 14

RL model for the channel hopping scheme [14].

State ; substate indicates an idle or busy channel; specifically, if PU activity does not exist, and if PU activity exists; substate represents gain, while and represent the numbers of control and data channels that get jammed, respectively

Action ; subaction , where action (or ) indicates that the agent will transmit control (or data) packets in (or ) channels uniformly selected from the previously unjammed channels, while action (or ) indicates that the agent will transmit control (or data) packets in (or ) channels uniformly selected from the previously jammed channels

Reward represents the channel gain