Research Article

A Novel Reinforcement Learning Architecture for Continuous State and Action Spaces

Figure 10

Accumulated frequency: Comparison of the reliability of the policies found with the SARSA Actor-Actor-Critic algorithm and the -learning algorithm.
492852.fig.0010