Wireless Communications and Mobile Computing

Research Article

Resource Scheduling in URLLC and eMBB Coexistence Based on Dynamic Selection Numerology

DQN-based resource allocation algorithm.

1 Initialize replay memory , capacity is
2 Initialize action-value function Q with random weights and random target Q with
3 For episode = 1, do
4 repeat
5 With probability select a random action , update numerology value , with probability select a random action
6 Execute action , observe reward and new station
7 Store in replay memory
8 Collect sample data from replay memory randomly
9 Update action-value function Q with limit
10 Every C steps reset
11 until S termination stat
12 End For
13 Return optimal strategy