Emergence of Prediction by Reinforcement Learning Using a Recurrent Neural Network
Figure 4
Sample object and agent trajectories for the object moving at 35° with a velocity of 0.50/step, and the object cannot be seen at . The agent does not move in the x direction actually, but for easy understanding, it is shown to be moving in the direction along with the object.