Research Article
Decentralized and Dynamic Band Selection in Uplink Enhanced Licensed-Assisted Access: Deep Reinforcement Learning Approach
| Parameter | Value |
| Discount factor | 0.9 | Learning rate | 0.01 | Exploration in -greedy policy | 0.05 to 0.01 | Target network update frequency | 300 | Mini batch size | 32 | Replay buffer size | 1000 |
|
|