Research Article

Three-Tier Computing Platform Optimization: A Deep Reinforcement Learning Approach

Table 3

Actor-critic DRL parameters.


Entropy weight0.005
Clip value of the gradient clipping40.0
Buffer size10,000
Minibatch size of DNN64
Maximum episode1000
Maximum number of steps in each episode1000
Actor learning rate 0.0001
Critic learning rate 0.001
Number of iterations X300
Activation function of DNNReLu
Number of hidden layers of DNN2
Number of neurons in the hidden layers300