Research Article

Minimizing the Cost of Spatiotemporal Searches Based on Reinforcement Learning with Probabilistic States

Figure 2

Overall Process of Quasi-Dynamic Programming (QDP).