Review Article
Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms
Algorithm 5
RL algorithm for RL-DCS [
27].
Repeat | (a) Choose action | (b) Receive delayed reward from ECE | (c) Update -value: | For each | | (d) Update probability , which is the probability of taking action : | For each | |
|