Research on the Difficulty of Mobile Node Deployment’s Self-Play in Wireless Ad Hoc Networks Based on Deep Reinforcement Learning

<div>Schematic diagram of the realization process of “exploration-utilization” heuristic search.</div>

Wireless Communications and Mobile Computing

Figure 6: Research on the Difficulty of Mobile Node Deployment’s Self-Play in Wireless Ad Hoc Networks Based on Deep Reinforcement Learning