Research Article
Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection
Table 3
Hyperparameters of experiment.
| Parameter | Value |
| Discounted factor | 0.80 | Minibatch size T | 128 | Soft update factor | 0.998 | Epoch U | 300 | Learning rate-actor | | Learning rate-critic | | Hidden layers number | 2 | Hidden units number | 64 | Optimizer | Adam |
|
|