Research Article

Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band

Figure 5

The absolute error of utility estimation in JRA calculated upon the algorithm termination with different values of and fixed .