Research Article
Learning from Demonstrations and Human Evaluative Feedbacks: Handling Sparsity and Imperfection Using Inverse Reinforcement Learning Approach
Figure 2
(a) Grid world navigation domain. The white, blue, green, gray, and red cells depict the ground, puddle, grass, obstacle, and goal, respectively. The black circle represents the learner robot. (b) A snapshot of our highway car driving simulator.
(a) |
(b) |