Research Article

Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band

Figure 2

The average number of RL iterations (slots) necessary for convergence of strategies in JRA with different values of and fixed .