Research Article

Context Transfer in Reinforcement Learning Using Action-Value Functions

Figure 7

The comparison of regret of learning with and without transfer for crossroad traffic controller.