Review Article
Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms
Table 5
RL model for the routing scheme [
33].
| State | ; each state represents a SU destination node . represents the number of SUs in the entire network |
| Action | ; each action represents the selection of a next-hop SU node . represents the number of SU node ’s neighbor SUs |
| Reward | represents the number of available common channels among nodes and |
|
|