Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Table 5

RL model for the routing scheme [33].

State ; each state represents a SU destination node . represents the number of SUs in the entire network

Action ; each action represents the selection of a next-hop SU node . represents the number of SU node ’s neighbor SUs

Reward represents the number of available common channels among nodes and