Research Article

A Sarsa(λ) Algorithm Based on Double-Layer Fuzzy Reasoning

Table 2

Performance comparison of the two algorithms in Cart-pole Balancing problem.

AlgorithmEpisodes Average time
within an iterative
step
Minimum episodesAverage episodes

DFR-Sarsa(λ)135155100%
GD-Sarsa(λ)17920446%