Research Article

Traffic Status Prediction of Arterial Roads Based on the Deep Recurrent Q-Learning

Table 9

Parameter combinations for the third.

GroupLearning rateReward delayGreedyOptimal loss coefficient

c0.010.90.30.067
f0.030.90.30.060
g0.050.90.30.061