Minimizing the Cost of Spatiotemporal Searches Based on Reinforcement Learning with Probabilistic States

<div>Overall Process of Quasi-Dynamic Programming (QDP).</div>

Wireless Communications and Mobile Computing

Figure 2: Minimizing the Cost of Spatiotemporal Searches Based on Reinforcement Learning with Probabilistic States