Abstract

To facilitate the intelligent and automated provision of mobility services by autonomous transportation systems, bike-sharing can be a supplement to public transport for addressing their point-to-point issue, namely, “last mile” service. However, according to the different nature of land use, the uneven spatio-temporal distribution of travel demand can directly lead to difficult access to bikes with high travel costs for users and operating costs for operators. Based on this, this paper analyzes the user behavior patterns within different areas by using GeoHash encoding and proposes a hierarchical autonomous vehicle scheduling model based on tide bicycle-sharing traffic, namely, HATB. It minimizes operating costs and maximizes user satisfaction to dynamically optimize scheduling routes and required vehicles within each layered zone. As for extracting historical orders of Mobike in Beijing, an example analysis through the genetic algorithm of HATB is conducted to support effective and efficient scheduling. Compared to existing scheduling methods, HATB has faster convergence and lower time complexity, which improves bike turnaround efficiency and practical application ability, thus making it more convenient for the public to travel during peak hours.

1. Introduction

Nowadays, as the modern transportation systems (TSs) develop, problems among mobility services (MSs), e.g., congestion, route adjustment, user dispersion, and peak-time conflicts have become commonplace [13] in terms of the adjustment of urban planning and the year-on-year increase in car ownership. Since the MS are fundamental in propelling current intelligent TS (ITS) evolving towards autonomous TS (ATS), they are also being renovated to assist the public on a daily basis and explore the advancement of ATS. Some theoretical development has revealed that the creation of shared transport, called bicycle-sharing, has effectively alleviated the “last mile” service (LMS), which is the most predominant pain point in the current MS [4] in terms of the most immediate interaction with users, and some emerging companies, e.g., Mobike have also taken this trend to a new level. However, bikes must be parked in GPS-identified areas to address ill-posed problems in the case of illegal parking, vandalism, or theft. Since then, to avoid constant billing, users never consider the capacity of the parking area, resulting in an uneven spatial distribution of bikes, namely, some areas suffer a severe accumulation of bikes while others are “one bike is hard to find” [5]. Therefore, scientific and reasonable scheduling strategies are required to overcome the imbalance between the supply and demand of bikes and improve resource utilization.

Rebalancing and optimizing bicycle-sharing distribution constitutes the vehicle routing problem (VRP), and most current research is based on this theory. For instance, Caggiani et al. [6] proposed a decision support system for the reallocation problem by forecasting the demand for spatio-temporal bikes. Similar research can, accordingly, be divided into static and dynamic scheduling to optimize VRP models with different objectives. Specifically, in static scheduling, Kadri et al. [7] and Dell’Amico et al. [8] developed optimization models for user satisfaction and operating cost, respectively. Yan et al. [9] investigated the deterministic and stochastic demand for bicycle-sharing in dynamic scheduling. In general, these methods assume that the overall supply and demand within a scheduling station are in equilibrium without supporting the mobility of bikes between zones. Moreover, limited open literature has reported that current research focuses too much on mathematical modeling, neglecting the analysis of actual demands. Therefore, these methods have encountered three challenges in practice, namely, slow convergence, high time complexity, and problematic application, from perspectives of fulfilling the actual demands in real-time and adjusting scheduling as needed.

As current diversified mobility demands tend to be managed and fulfilled by more intelligent and automated systems with fewer human intervention [10], it is an urgent need to collaborate corresponding functions to renovate the conventional MS of ITS in the context of ATS [11], i.e., update the ability to sense user demands and rearrange system supplies [12, 13]. Hence, to promote the MS provided by ATS, this paper proposes a hierarchical autonomous vehicle scheduling model based on tide bicycle-sharing traffic, namely, HATB. This model uses GeoHash coding to divide the scheduling into three layers, i.e., top, middle, and bottom, corresponding to the scheduling terminus, area, and point. Based on the genetic algorithm (GA), the model can achieve hierarchical and dynamic scheduling of vehicles and routes to maximize user satisfaction, while minimizing operating costs.

Furthermore, in contrast to current studies on scheduling bikes in ITS, the HATB makes three main contributions to optimizing convergence speed, time complexity, and application difficulties of actual scheduling. The scheduling results based on actual orders ultimately demonstrate HATB can provide a rational reference for LMS in ATS and guide the development of bicycle-sharing regulation and operation.

The overall structure of this paper is divided into five sections. Section 2 introduces related solutions and emerging challenges. The methodology relevant to HATB is described in Section 3. Section 4 elaborates on the scheduling results and superiority of the model. Finally, Section 5 summarizes this study and sketches future research directions.

1.1. Related Works

In the transport domain, LMS refers to the direct interaction between the end of public transport and users, which often suffers from scattered users, peak-time conflicts, and uneven distribution. As an effective way to cope with the LMS problems, bicycle-sharing has become a non-negligible component of urban transport. For example, Cheng et al. [14] have demonstrated that bicycle-sharing increases the proportion of green transport in cities and solves the low efficiency at the end of the travel chain.

In general, current research on bike-sharing mainly focuses on its development status and travel characteristics, but few on its scheduling. Researchers like Soriguera and Jiménez-Meroño [15], Gimon [16], and Lu et al. [17] concur that even while bicycle-sharing has considerable quantities, the spatio-temporal differences in user demands, no fixed parking area, and fewer available bikes may lead to a more significant overall imbalance. Therefore, it is vital to take effective scheduling strategies to rebalance and optimize the distribution of bikes, thereby addressing difficulties in management and operation. In this context, scheduling bicycle-sharing can be regarded as a heuristic algorithm-based, e.g., GA, ant colony algorithm (ACO), and vehicle routing problem (VRP) [1820], which can be classified as static or dynamic scheduling according to different strategies and objectives.

Dynamic scheduling mainly focuses on peak time and relies on user demands. For instance, a mathematical model for dynamic scheduling is created by Zhang et al. [21] based on the parking area’s actual capacity and users’ predicted arrival times. Shui and Szeto [22] partition peak time to optimize scheduling routes by regarding scheduling in each time interval as static scheduling. Chiariotti et al. [23] propose that scheduling bikes can dynamically determine the scheduling time through historical orders. In general, dynamic scheduling lessens operating costs’ impact on operators by prescheduling bikes to avoid a shortage occurs. However, based on the uncertain use of bikes, frequent scheduling with complex constraints is necessary, which may lead to higher operating costs and slower convergence, thus making it challenging to fulfill user demands in real-time.

Another more common scheduling strategy is static scheduling during off-peak time. For example, Lang [24] provides a multiwarehouse model based on the Tabu search algorithm to minimize scheduling distance and improve scheduling efficiency and robustness. Bae and Moon [25] use a dual time window with customer service levels to reduce total transport and labor costs. Since static scheduling only considers the predicted demands of stations, increasing more bikes for stations to guarantee user demands means that the time complexity of the heuristic algorithm grows exponentially. Moreover, to fulfill the actual demands, the allocated bikes by these studies may exceed the station’s capacity.

Besides, such above-given studies are mainly applied to typical scenarios, as presented in Figure 1, where a single scheduling station serves one zone and only the routes within the scheduling zone are considered. It is often limited in actual scheduling by the service range of the station, which needs to frequently adjust the boundaries of this scheduling zone, thus leading to some research on hierarchical scheduling strategies. By defining scheduling priorities based on demand intensity, Sakakibara et al. [26] and Ni et al. [27] highlight the feasibility and reliability of hierarchical scheduling. In order to illustrate the flexibility, Zhang [28] and Ma et al. [29] set stations with similar demands in the same layer in accordance with the spatio-temporal characteristics of bikes. However, the definition of hierarchies in these methods is too subjective and not clear, making it difficult to implement in practice.

In summary, whilst a considerable body of research has been carried out on VRP, much less fits the spatial-temporal and cross-regional mobility characteristics of bicycle-sharing. In addition, it seems to be a common problem that existing studies focus more on mathematical modeling but neglect the analysis of actual demands. When scheduling according to the above-given methods, three challenges in terms of the problematic application in practice, namely, the high time complexity of models and slow convergence for algorithms, are increasingly apparent.

Particularly driven by diverse and emerging technologies and demands, ITS is evolving into ATS, which illustrates that MS should be autonomously fulfilled and managed by more intelligent systems with fewer human intervention [30]. Therefore, it is an urgent need to study and improve the monolithic strategies and rationalize the actual demands to achieve hierarchical and autonomous scheduling of bicycle-sharing, thus rationally guiding the provision of the LMS by ATS.

2. The HATB Methodology

This section uses three subsections to present the framework, hypotheses, and construction of the proposed HATB.

2.1. Model Framework

As existing scheduling methods, in general, require frequently adjusting boundaries and increasing vehicles to alleviate the difference between bikes’ supply and demand, which may inevitably increase transport time and costs; the scheduling framework can be set up as a hierarchical scheduling structure, i.e., a top-middle-bottom hierarchy of scheduling terminus-area-point.

However, since current hierarchical methods have a subjective definition of their hierarchies, geocode can ensure the objective; e.g., what3words [31] uses fixed squares to divide the earth, and pluscode [32] represents each latitude and longitude level by 2-bit code, whose length range in levels 1 to 3 jumps from and to . These geocodes, accordingly, have good accuracy but loss flexibility; they may not meet the actual scheduling requirements.

Therefore, GeoHash encoding, proposed by Morton [33], can be used to better support effective and efficient scheduling. Its maximum length of 12 bits can represent a geographic location with arbitrary precision. For example, the GeoHash strings WX4ER and WX4G2 represent two regions of Beijing (China), where each character is a certain rectangular area. Moreover, the order information (Data Sources: https://biendata.xyz/competition/mobike/) on bicycle-sharing, as extracted in Table 1, also indicates the feasibility of dividing the scheduling layer via GeoHash.

The coding definition, as described in Table 1, illustrates that the 7-bit string matches the characteristics of actual bike stops, namely, area size, and the 5-bit string suits for vehicles to dispatch bikes in light of their loading capacity, i.e., 400 bikes. Therefore, the overall framework of the proposed hierarchical scheduling model, called HATB, can be obtained as presented in Figure 2. In general, this framework is characterized by a number of scheduling areas in each of the three layers, namely, bike stops consist of the bottom layer of scheduling, while the top and middle layers likewise have demands and capacity restrictions for bikes, and hence, the scheduling within the same layers is regional scheduling for seeking optimization.

2.2. Model Hypotheses

Considering the complexity of actual scheduling, the proposed HATB in this paper defines the following hypotheses and the frequency of scheduling as once in the morning peak and once in the evening peak, respectively.(1)All scheduling vehicles own the same attributes(2)In each scheduling route, the vehicle departs from one scheduling terminus (area) and returns to this place after deploying bikes to corresponding areas (points) contained(3)Fuel consumption and vehicle loss should be considered(4)Each scheduling area can only be served once(5)The actual orders determine the scheduling demand(6)All scheduling tasks are required to be completed within the specified scheduling cycle(7)The scheduling areas and points have sufficient space to accommodate the bikes deployed in or out during a scheduling cycle

2.3. Model Construction

Based on the above-given hypotheses and the actual operations of bike-sharing, considering only the operating costs will gradually lose customers, and weighing only user satisfaction runs counter to the essence of business profitability. Hence combining these two factors, this paper constructs a regional scheduling model for bikes to minimize operating costs and maximize user satisfaction .

The parameters ∂ and μ indicate that bike-sharing operators need to adjust the weight coefficients of operating and penalty costs according to their own emphasis.

2.3.1. : The Objective of Minimizing Operating Costs

The actual operation costs need to consider both fixed and flexible costs, as summarized in equation (2), which is determined jointly by the value of scheduling vehicles [34], the unit transport cost (i.e., vehicle loss: 1 , fuel consumption: 1 , and labor cost: 100 ), and the scheduling distance.

Equation (3) indicates that this scheduling vehicle starts and ends at the terminus; equation (4) suggests that each area can only be served once; equation (5) shows that the transport distance must not exceed the maximum scheduling distance; equation (6) points that the number of bikes loaded by the vehicle must not exceed its maximum capacity, namely, 400; equation (7) means as 0-1 variable; equation (8) proves that the number of bikes deployed by the vehicle is a non-negative integer.

2.3.2. : The Objective of Maximizing User Satisfaction

User satisfaction can be improved by adding time window constraints, as described in equation (9), which means maximizing user satisfaction can equivalently transfer into minimizing the penalty cost of scheduling timeout.

Equation (10) indicates that the vehicle departs from the terminus at time zero; equation (11) means that the time calculation for a vehicle to arrive in an area; equation (12) demonstrates that the scheduling cannot arrive in area from area before ; equation (13) represents that vehicle needs to arrive in an area within the time window.

The meaning of the parameters in the above-given equationsare shown in Table 2.

3. Case Study

The highlights of HATB solving are illustrated in terms of algorithm settings, scheduling results, and model evaluation in this section.

3.1. Algorithm Settings

The experiment data comes from the 2017 Mobike cup algorithm challenge, which involves 3,214,096 orders and 485,465 bikes (10th May 2017–23rd May 2017). The characterized user travel, as presented in Table 3, reflects that the data are consistent with the “last mile” definition [16], and hence, shows its reasonable usability according to prominent tidal characteristics.

This paper proposes a GA with natural number encoding (NGA), as defined in Algorithm 1, to optimize the scheduling. In general, the first and the last 0 s represent the scheduling terminus or area, [1, N] represents the zones that need to be scheduled, and other 0 s separate the routes of different vehicles, e.g., a chromosome example might be 0-3-0-1-2-5-7-0-4-8-6-0, namely, three vehicles serving eight zones.

Since the crossover and mutated sub chromosomes may lead to transport overload and overtime and the time window is more likely to be violated, the penalty factors for the two constraints are set to 10 and 500, respectively.

Input: SchedulingArea SA, DistanceMatrix DM
Output: SchedulingRoute SR, SchedulingVehicle SV, DeployedBikes DR
Initialization: Generation Gen= 100, CrossoverRate CR= 0.8, MutationRate MR= 0.2, Population Pop ← ∅, Chromosome CH ← ∅, Fitness Fit=Null,
(1)For all the SA do
(2)Pop ← CH.generate (SA.Encoding)//Randomly Generation for Population (100)
(3)End for
(4)Fit ← Fit.Calculation//Calculate the Fitness
(5)if Fit.change or Gen < 100 then
(6)For all the CH do
(7)  CH.select//Chromosome Selection
(8)  CH.crossover//Chromosome Crossover
(9)  CH.mutate//Chromosome Mutation
(10)  Pop ← CH.NewGenerate//New Population
(11)  Fit ← Fit.NewCalculation
(12)End for
(13)End if
(14)SR ← CH.Decoding
(15)Return SR, SV, DR
3.2. Scheduling Results

A total of 220 scheduling areas in the morning peak are used as HATB test cases to obtain the optimal hierarchical scheduling routes, as shown in Figure 3 and Tables 4 and 5.

As described in the previous section, regional scheduling is applied for each layer. Note that, each scheduling area has a positive or negative raw demand that reflects the redundancy or scarcity of bikes. Furthermore, scheduling prioritizes self-satisfaction in the route, namely, redundancy supports scarcity, and hence, route demand indicates the self-satisfaction gap for corresponding regional scheduling.

Therefore, the experimental results can be summarized as the total scheduling distance for optimal routes in the entire Beijing is around 719.5 km. As for satisfying the demands, 17 vehicles are required to participate in the scheduling to deploy bikes. Moreover, using route one in Table 4 as an example, the scheduling can be summarized as a vehicle departing from the scheduling terminus and returning to the terminus after completing regional scheduling sequentially and autonomously in accordance with the area-point (middle-bottom) hierarchy. In addition, the routes’ information in reality for Table 5 is mapped to Table 6 via GeoHash.

3.3. Model Evaluation

To further verify the reliability of this model, Figure 4 shows the comparison for iteration between the HATB and other models proposed in the literature with similar objectives. Specifically, based on the GA, Gao et al. [35] provided a promising perspective on improving operation efficiency by reducing operating costs and service quality during peak times to minimize the total operating costs. Angeloudis et al. [36] achieved user appeal increase by offering a new method of planning bike routes and distributions. Moreover, Zhao et al. [37] optimized the total scheduling distance to accommodate large-scale scheduling via an ACO.

The proposed HATB converged at the 64th generation, and the total time cost is 148.9 s, with an average running time of 15 s per generation. Due to the different objectives, only the convergence speed of the above models is compared. It can be seen from Figure 4 that the HATB significantly outperforms the model proposed in the existing literature. Such a result indicates that this model is more proper for practical scheduling applications since it optimizes with higher convergence speed and lower time complexity.

4. Conclusion and Future Works

Even though the emerging and diversified technologies and demands are driving TS to renovate conventional MS to be self-actuating, the current bicycle-sharing scheduling and maintenance rely on manual experience, which lacks scientific guidance and efficiency. Therefore, to achieve sustainable development, operators urgently need to develop a rational scheduling strategy to balance the distribution conflicts and supply user demands in time.

Therefore, this paper proposes a hierarchical scheduling model, called HATB, to address the unsolved issues by current studies in terms of slow convergence, high time complexity, and problematic application, and hence, to support the rational and autonomous provision of LMS. In summary, according to bicycle-sharing properties, namely, spatio-temporal characteristics, cross-regional mobilities, and actual demands, HATB takes 220-morning peak areas as tests to validate its improved validity, feasibility, and efficiency for practical application.

As compared to the similarly used methods, HATB can, accordingly, obtain the following improvements. A hierarchical framework is first designed through GeoHash encoding to solve the cross-regional mobility of bikes and reduce the time complexity of global optimization. Next, a GA for regional scheduling is built by combining the tidal characteristics of bicycle-sharing to minimize operating costs and maximize user satisfaction, which significantly accelerates the algorithm’s convergence. Last, the use of actual orders considerably enhances the ability in the practical application of instant response to any regional scheduling demand.

This work was carried out as a preliminary to obtain the present results. However, there are still problems such as the inability to adapt scheduling throughout 24 hours or the lack of comprehensive constraints. As the closure of this study, one recommendation for further research is to use a form of “GA + Tabu” algorithm to exploit its global search capability and thus improve the big data processing capability. Another research direction is adding weather and road characteristics to optimize the model reliability further.

Data Availability

The data used to support the findings of this study have been deposited in the 2017 Mobike Cup Algorithm Challenge repository (https://biendata.xyz/competition/mobike/).

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was supported by National Key Research and Development Program of China (Grant no. 2020YFB1600400) and Collaborative Innovation Center for Transportation of Guangzhou (Grant no. 202206010056).