Research Article

Edge Caching for D2D Enabled Hierarchical Wireless Networks with Deep Reinforcement Learning

Algorithm 1

Q-Learning-based content caching algorithm.
Initialization: Q-Table
Iteration:
1: for each episode
2: Initialize
3: for each step of episode
4: Generate at random
5: if
6: randomly select an action
7: else
8: choose using policy derived from
9: Take action
10: Obtain   and  
11: Update Q-Table:
12:
13: end for
14: end for