Research Article

Adaptive Traffic Signal Control Model on Intersections Based on Deep Reinforcement Learning

Table 5

Performance in multi-intersection case. Travel time: the lower the better; other measures: the higher the better.

SL no.MethodRewardAverage travel time (s)Average speed (m/s)

1DQN (ours)2.54438.262.49
DQN (base)2.42486.952.21
Q-learning1.49752.171.28
LQF2.37496.801.93
Webster2.05528.131.88

2DQN (ours)2.74418.112.57
DQN (base)2.51498.692.16
Q-learning1.68701.821.38
LQFāˆ’0.02816.291.18
Webster1.71644.271.57

3DQN (ours)2.78391.012.64
DQN (base)2.24573.161.83
Q-learning1.71681.121.44
LQF1.28827.841.15
Webster1.75588.221.71