Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network

<table>Change in the <math id="M68" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>Q</mi></mrow></math>-value for “catch” action for three velocities. The timing for the increase in value differs, but the position of the object at that timing is almost the same in all cases.</table>

Journal of Robotics

fig6

Figure 6

Figure 6: Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network