A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning
Figure 2
The performance comparison between hierarchical reinforcement learning based on action subrewards without divide and rule policy and basic reinforcement learning.