Review Article

Application of Reinforcement Learning in Cognitive Radio Networks: Models and Algorithms

Table 10

RL model for the channel auction scheme [37].

State ; each state represents a two-tuple information composed of the fullness of the buffer state and channel states , where represents the state of channel in terms of SNR

Action ; each action represents the amount of a bid for white spaces in channel . represents the number of available channels

Reward represents the sum of the number of lost packets and the channel cost that SU must pay for using the channel. Note that the packet loss and channel cost depend on the global state , available channels , and bidding actions of all competing SUs