; each state represents a two-tuple information composed of the fullness of the buffer state and channel states , where represents the state of channel in terms of SNR
Action
; each action represents the amount of a bid for white spaces in channel . represents the number of available channels
Reward
represents the sum of the number of lost packets and the channel cost that SU must pay for using the channel. Note that the packet loss and channel cost depend on the global state , available channels , and bidding actions of all competing SUs