Research Article
Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band
Figure 8
The average number of FP iterations (per slot) necessary for the convergence of the algorithms with fixed collected during T slots.