Research Article

The Study of Reinforcement Learning for Traffic Self-Adaptive Control under Multiagent Markov Game Environment

Table 1

The learned Q-values of TSCA 5 in specified state.

Local state of TSCA 51, 1, 2, 11, 1, 2, 21, 1, 2, 31, 1, 3, 1

Max Q 231.4278.9297.4211.8
Timing60302525