Research Article

Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band

Algorithm 2

FP algorithm for inband resource allocation.
Initialization:
(1) Input , ;
(2) While () do
 (3) Input , , ;
 (4) Solve (20a)–(20f) to find the optimal ;
Main Loop:
 (5) While () do
  Rounding:
  (6) Solve (21a)–(21f) to find the optimal ;
  (7) If () then break;
  Projection:
  (8) Solve (22a)–(22f) to find the optimal ;
  (9) Set ;
(10) End.