Research Article

Learning from Demonstrations and Human Evaluative Feedbacks: Handling Sparsity and Imperfection Using Inverse Reinforcement Learning Approach

Figure 6

(a) The relation between sparsity level of demonstrations in stage-one and the number of feedbacks needed to reach “” score equal to 1.17 using . (b)’s stage-two performance in face of optimal and different demonstration sparsity levels in stage-one (point “B1”, …, “B10” in Figure 3) and the number of evaluative feedbacks. The black curve has no initial demonstration (point “C” in Figure 3).
(a)
(b)