The Scientific World Journal

Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

RL model for the channel sensing scheme [20].


State	; each substate indicates SU ’s belief about channel , and it has a value of 0 (busy) or 1 (idle)

Action	; each action represents a single channel chosen for channel sensing purpose

Reward	represents the number of channels identified as being idle by SU node