Research Article
Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning
Figure 7
Comparisons of cumulative rewards and time steps for reaching goal.
(a) Comparisons of cumulative rewards |
(b) Comparisons of time steps for reaching goal |