Research Article

Dynamically Resource Allocation in Beyond 5G (B5G) Network RAN Slicing Using Deep Deterministic Policy Gradient

Table 2

RL-based resource allocation.

Ref.AlgorithmFocusOptimization objectiveUse case/vertical appTrainingDevelopment

[17]DQNRANImprove resource consumption and slice isolationContinuous bit rate and lowest bit rateCentralizedSimulation
[18]-learningMaximization of resource use while assembling the elements of successful communicationHapticCentralizedSimulation
[19]-learning, SARSA, and Monte CarloRANAssurance of efficient resource use while fulfilling the demands for low latencyInternet of ThingsCentralizedSimulation
[20, 21]DDQN and duelling DQNRANMaximize long-term profits while offering the services that different multitenant customers requireManufacturing, automotive, and utilitiesCentralizedEmulation (TensorFlow)
[22]DQNRANMaximize the utilisation of radio resources while preserving QoSeMBB, mIoT, and uRLLCCentralizedSimulation
[23]DQNE2E (RAN, TN, CN, edge)SFC traffic variations should be accommodated when VNF placement is optimisedeMBBCentralizedEmulation (OpenAI gym)
[24, 25]LSTMRANIt is necessary to maximize spectrum efficiency and the SLA satisfaction ratioVoLTE, eMBB, and uRLLCCentralizedSimulation
[8]A3CRANMaking the most of resources while preserving slice separationUndeclaredDistributedEmulation (TensorFlow)