The Scientific World Journal

Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

RL model for the channel hopping scheme [14].


State	; substate indicates an idle or busy channel; specifically, if PU activity does not exist, and if PU activity exists; substate represents gain, while and represent the numbers of control and data channels that get jammed, respectively

Action	; subaction , where action (or ) indicates that the agent will transmit control (or data) packets in (or ) channels uniformly selected from the previously unjammed channels, while action (or ) indicates that the agent will transmit control (or data) packets in (or ) channels uniformly selected from the previously jammed channels

Reward	represents the channel gain