A Novel Reinforcement Learning Architecture for Continuous State and Action Spaces
Figure 8
Learning curves of the SARSA algorithm using three different function approximators: radial basis functions, multilayer perceptrons, and multilayer perceptrons with a layer of radial basis functions.