Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Algorithm 5

RL algorithm for RL-DCS [27].
Repeat
(a) Choose action
(b) Receive delayed reward from ECE
(c) Update -value:
   For each
     
(d) Update probability , which is the probability of taking action :
   For each