Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network

<table>Sample object and agent trajectories for the object moving at 35° with a velocity of 0.50/step, and the object cannot be seen at <math id="M57" xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mo>&gt;</mo><mn>3.0</mn></math>. The agent does not move in the <i>x</i> direction actually, but for easy understanding, it is shown to be moving in the <math id="M58" xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mi>x</mi></mrow></math> direction along with the object.</table>

Journal of Robotics

fig4

Figure 4

Figure 4: Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network