Research Article
Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band
Figure 2
The average number of RL iterations (slots) necessary for convergence of strategies in JRA with different values of and fixed .