Research Article

Decentralized and Dynamic Band Selection in Uplink Enhanced Licensed-Assisted Access: Deep Reinforcement Learning Approach

Table 2

Hyperparameters.

ParameterValue

Discount factor 0.9
Learning rate 0.01
Exploration in -greedy policy0.05 to 0.01
Target network update frequency 300
Mini batch size32
Replay buffer size1000