Research Article
Edge Caching for D2D Enabled Hierarchical Wireless Networks with Deep Reinforcement Learning
Algorithm 1
Q-Learning-based content caching algorithm.
Initialization: Q-Table | Iteration: | 1: for each episode | 2: Initialize | 3: for each step of episode | 4: Generate at random | 5: if | 6: randomly select an action | 7: else | 8: choose using policy derived from | 9: Take action | 10: Obtain and | 11: Update Q-Table: | 12: | 13: end for | 14: end for |
|