Research Article

Learning Diverse Policies with Soft Self-Generated Guidance

Figure 3

A collection of environments with continuous state-action spaces that we use. (a) Swimmer in the maze. (b) Ant in the maze.
(a)
(b)