Research Article

Joint Channel Allocation and Power Control Based on Long Short-Term Memory Deep Q Network in Cognitive Radio Networks

Figure 8

Relationship between the number of iterations and reward function (mix-policy).