Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Algorithm 6

RL algorithm for the channel sensing scheme [20].
Repeat
(a) Take action
(b) Exchange collaboration message with SU neighbor agents // First round of collaboration
(c) Determine delayed reward
(d) Exchange collaboration message with SU neighbor agents // Second round of collaboration
(e) Choose action
(f) Update (see (9))
(g) Update -value,