Research Article

Joint Optimization for MEC Computation Offloading and Resource Allocation in IoV Based on Deep Reinforcement Learning

Table 2

Main hyperparameters of the De-DDPG.

ParametersValue

Size of the first hidden layer for actor and critic300
Size of the second hidden layer for actor and critic300
Learning rate of actor and critic /0.0001/0.001
Size of experienced memory 20000
Parameters for OU noise 0.15, 0.15, 0.10
Discount factor 0.95
Penalty for failed task execution 8
Total number of all episodes 1000
Total time periods of one episode 110