Research Article

Adaptive Traffic Signal Control Model on Intersections Based on Deep Reinforcement Learning

Table 3

Performance in single-intersection case. Travel time: the lower the better; other measures: the higher the better.

SL no.MethodRewardAverage travel time (s)Average speed (m/s)

1DQN (ours)1.93222.512.29
Q-learning1.90228.422.26
LQF1.85230.662.24
Webster1.66240.961.84

2DQN (ours)5.0289.314.83
Q-learning4.6897.554.52
LQF0.08170.932.63
Webster2.42166.612.24

3DQN (ours)3.96113.573.51
Q-learning3.54118.983.37
LQF2.39154.852.43
Webster2.54163.892.33