Research Article
Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band
Figure 3
The average number of RL iterations (slots) necessary for convergence of utilities in JRA with different values of and fixed .