Research Article

Rebalancing Docked Bicycle Sharing System with Approximate Dynamic Programming and Reinforcement Learning

Table 2

Key performance indicators using different benchmark strategies.

StrategyAverage unmet demand (person)Average travel time (min)Average delivery amount (bike)

No rebalance10.6ā€”ā€”
STR8.519.82.7
SLA9.640.211.7
RTDP (1.00)3.837.116.8
RTDP (1.65)3.537.221.9
RTDP (2.33)2.338.321.4