Research Article

Minimizing the Cost of Spatiotemporal Searches Based on Reinforcement Learning with Probabilistic States

Table 3

Accumulative search cost of QDP with different discount rates.

Start moment8 : 0010 : 0012 : 0014 : 0016 : 0018 : 0020 : 00

α = 1,γ = 0.9734.3835.0536.5134.8635.5732.4436.61
α = 1,γ = 0.9933.2533.4235.0133.2734.3130.4635.11
α = 1,γ =1.0032.8132.9034.8532.7234.0630.1534.97
α = 1,γ =1.0133.5932.9534.9133.0833.7230.1835.14
α = 1,γ =1.0333.1533.1935.7933.2434.1030.3635.13
α = 1,γ =1.0533.6833.6436.1733.9234.9631.2135.79