Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Table 13

RL model for the channel sensing scheme [20].

State ; each substate indicates SU ’s belief about channel , and it has a value of 0 (busy) or 1 (idle)

Action ; each action represents a single channel chosen for channel sensing purpose

Reward represents the number of channels identified as being idle by SU node