Research Article

Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning

Figure 1

Pole balancing problem.