Research Article

Decomposition Methods for Solving Finite-Horizon Large MDPs

Table 1

Characteristics of the three racetracks.

MDP’s horizonNumber of possible statesNumber of SCCs

Race-1311327505120
Race-2181062005103
Race-3481233003554

Note. The value iteration (VI) algorithm under the infinite-horizon discounted MDP is used in [30] in order to solve racetrack problems. presents the comparison between VI, BI, and HBI algorithms. As it can be seen, the BI algorithm outperforms the VI algorithm, but the proposed HBI algorithm is more efficient than the BI algorithm.