Research Article
A Novel Reinforcement Learning Architecture for Continuous State and Action Spaces
Figure 10
Accumulated frequency: Comparison of the reliability of the policies found with the SARSA Actor-Actor-Critic algorithm and the -learning algorithm.