Research Article
Joint Optimization for MEC Computation Offloading and Resource Allocation in IoV Based on Deep Reinforcement Learning
Table 2
Main hyperparameters of the De-DDPG.
| Parameters | Value |
| Size of the first hidden layer for actor and critic | 300 | Size of the second hidden layer for actor and critic | 300 | Learning rate of actor and critic / | 0.0001/0.001 | Size of experienced memory | 20000 | Parameters for OU noise | 0.15, 0.15, 0.10 | Discount factor | 0.95 | Penalty for failed task execution | 8 | Total number of all episodes | 1000 | Total time periods of one episode | 110 |
|
|