Abstract
In this work, we investigate a new paradigm for dockless bike sharing. Recently, it has become essential to accommodate connected and freefloating bicycles in modern bikesharing operations. This change comes with an increase in the coordination cost, as bicycles are no longer checked in and out from bikesharing stations that are fully equipped to handle the volume of requests; instead, bicycles can be checked in and out from virtually anywhere. In this paper, we propose a new framework for combining traditional bike stations with locations that can serve as freefloating bikesharing stations. The framework we propose here focuses on identifying highly centralized clubs (i.e., connected subgraphs of restricted diameter). The restricted diameter reduces coordination costs as dockless bicycles can only be found in specific locations. In addition, we use closeness centrality as this metric allows for quick access to dockless bike sharing while, at the same time, optimizing the reach of service to bikers/customers. For the proposed problem, we first derive its computational complexity and show that it is hard (by reduction from the 3SATISFIABILITY problem), and then provide an integer programming formulation. Due to its computational complexity, the problem cannot be solved exactly in a largescale setting, as is such of an urban area. Hence, we provide a greedy heuristic approach that is shown to run in reasonable computational time. We also provide the presentation and analysis of a case study in two cities of the state of North Dakota: Casselton and Fargo. Our work concludes with the costbenefit analysis of both models (docked vs. dockless) to suggest the potential advantages of the proposed model.
1. Introduction
Bikesharing systems (BSSs) have become a prominent mode of transportation around the world, especially in urban areas. BSSs bring a number of advantages to existing transportation networks. Among them, we note the increased personal mobility, reduced transportation costs, reduced traffic congestion, decrease in use of and dependence in fossil fuel, increase in public transit visibility, enhancement of downtown areas along with the economic development that follows, health benefits, and increase in environmental awareness [1–3].
Since their introduction in Europe in the 1960s, BSSs have undergone a series of developments. The most recent of these developments is referred to as the dockless or freefloating BSS. In a dockless BSS, residents that are interested in using a bicycle can check out and in bicycles throughout an urban area using nothing more than their smartphones. The bicycles are equipped with a geographic positioning system (GPS), thus enabling users to locate the nearest available bicycle and to unlock it with the use of an app. Riders are then allowed to drop off (check back in) the bicycle anywhere within a geographic area (referred to as the geofenced area). Within that area, bicycles are allowed to be parked legally. The trip ends as soon as the checked out bicycle is parked and securely locked anywhere in the geofenced area.
As is obvious from the description, dockless or freefloating bicycles offer enhanced convenience and improved accessibility, which in turn translates to increased personal mobility, compared to conventional bike sharing. The enhanced convenience stems from the fact that users no longer have to wait for a parking spot to become available in a bicycle dock so as to return their bicycle after the trip (especially in heavily trafficked areas). However, as with many other technologies, dockless BSSs also present new challenges. The one we deal with in this work is the fact that bicycles can now be left unattended in improper positions [4, 5]. Piles of bicycles from bikesharing companies are increasingly becoming a public nuisance and they make the aesthetic environment of an urban area less enjoyable with clogged sidewalks that are rendered no longer walkable. Hence, a dockless BSS development without proper control and regulation mechnisms can result in blocked rights of way, increase in trash, vandalism, and other negatives.
The framework we develop here aims to address this issue of control while at the same time advocating for dockless bike sharing. We propose a framework that will both (i) allow users the increased benefits of a dockless system (easy and fast access to bicycles, reduced parking space needs) and (ii) reduce the coordination costs for controlling the sprawl of the dockless bikesharing operations by restricting the size of the geofenced area.
Continuing with our motivation, a critical component to the success of every BSS is that users can check out a bicycle within convenient walking distances [6]. This simple, yet powerful, principle guides BSSs to offer the right number of bikes in the right locations at the right time so as to accommodate daily commuting demands. In other words, station location and density are key factors in any BSS [7]. In the traditional dockbased system, BSS operators are expected to rebalance bike inventory between different stations so as to meet (asymmetric) demands. As an example of such rebalancing operations, Chiariotti et al. proposed a dynamic model to address the fluctuations in demands of a BSS in New York City [8]. In a similar note, Wang et al. applied a datadriven approach for defining a safe rebalancing range and provide rebalancing operators with the next targeted station and the number of bikes to move [9]. Rebalancing bicycle inventory imposes extra costs associated with human and physical capital on bikesharing programs. These costs can grow to be considerable if the system is large. Moreover, the rebalancing problem is even more pronounced in dockless BSSs because of unrestrained parking locations [10]. Finding the right locations for stations and rebalancing are correlative problems. According to the Department of Transportation, a dockless program should be initiated where demand is highest and designated bike parking areas, referred to as bike hubs, should be used to maintain some order. This policy would help mitigate the hodgepodge of problems that can result from adopting a dockless system [11].
As our framework will optimize the reach of dockless bikesharing operations, while also restricting the size of the system, our model will also alleviate some of the problems involved with rebalancing. To further elaborate on our model, we offer Figure 1. On the right, we present a conventional dockbased BSS. The transportation network is presented with nodes and edges (representing streets), with the bicycle docks being noted with blue rectangular nodes: observe that docks are not necessarily located in nodes only, but can also be located along the edges of the network. On the other hand, the figure on the right shows our proposed framework. We now allow for a geofenced area (represented by the shaded area) where users can check out and in bicycles from anywhere. This allows for more people to have fast access to bicycles and reduces the need for docks within that area. Due to that, these docks could be moved to other areas, further than the geofenced area, to enable bikesharing use to other residents. In addition to that, the area where bicycles can be dropped off anywhere is significantly decreased, making it easier for operators to find and collect bicycles so as to rebalance their inventory. Last, we note here that the shaded area of the network on the left forms a club (i.e., a subgraph of diameter equal to 2).
(a)
(b)
We can summarize our contributions in the following three components:(i)First, we use the club concept, combined with closeness centrality, so as to identify candidate locations that could be geofenced. We also allow for a weight at each node of the network: this modification enhances the speed of the club formation through the heuristic algorithm devised.(ii)Then, we turn our attention to a realworld application. We present an experimental study on the cities of Fargo and Casselton. In the study, we analyze the exact optimization model and the heuristic devised and compare them in computational time and solution obtained. In each club obtained for varying values of , riders (commuters) are able to reach to any other neighboring sites within a fixed distance (controlled by ), implying that the virtual locations provide better accessibility to demand points.(iii)Last but not least, we present potential strategies for operators to further manage the inventory by applying incentives and making bicycle collection and rebalancing more costeffective.
The remainder of the paper is organized as follows: the next section reviews related literature on BSS design and discusses how those relate to the objectives of this work. Then, we provide the necessary mathematical background, provide the definitions of all notation used, and derive the computational complexity of the problem studied. The next section illustrates the mathematical formulation that can be solved using a commercial optimization solver and also proposes a greedy heuristic to solve it. In the following section, we discuss two computational experiments that reveal our findings in two realworld transportation networks: namely the smaller city of Casselton, ND, and the larger city of Fargo, ND. However, due to the size of the network in Fargo, we only test and present the results of the heuristic approach. The last section of the paper is devoted to our conclusions and a brief overview of future plans.
2. Related Works
There is a plethora of studies on bikesharing systems. These studies generally fall into three major areas:(1)General quantitative analysis;(2)Facility location problems;(3)Redistribution problems.
The first body of literature focuses on the quantitative analysis of existing BSSs, analyzing their characteristics, and examining empirical evidence of usage patterns in cities including Dublin [12], Beijing [13], Montreal [14], Brisbane [15], Helsinki [16], Paris [17], Switzerland [18], and New York [19]. Nair et al. examine several aspects of the Velib BSS in Paris, France [17]. Their findings show that integrating transit and BSS can yield higher utilization. BachandMarleau et al. surveyed residents of Montréal, Quebec, in Canada to determine the factors leading to use BSS as well as the frequency of use [14]. Campbell and Brakewood quantify the impact that BSSs have on bus ridership in New York City [19]. They conclude that either bikesharing members substitute bike sharing for bus trips or the implemented BSS led to travel behavior changes of nonmembers. Audikana et al. studied the impact of a BSS in a small city (less than 100,000 residents) in Switzerland [18]. They suggested that BSS network density along with the developed partnerships play a critical role in its success.
The second stream of literature focuses on the strategic design of BSS where the ultimate goal is to find the locations, capacity, and coverage areas of BSSs [20]. These studies try to determine the number and location of stations, fleet size, and network structure of the underlying BSSs. They consider various objectives, including the maximization of demand coverage, the minimization of transportation cost, and the minimization of the overall cost. Lin et al. address the strategic design problem by formulating it as a hub location inventory model [21]. In their work, they consider both total costs (travel cost of users, bike inventory costs, facility costs) and service level (bicycle lanes) in their model. The authors then propose a heuristic method to find highquality solutions. In a similar study, Lin and Yang propose a nonlinear integer method to determine the optimal location, bike lanes, and routes [22]. Their model assumes a penalty for uncovered demand but does not consider relocation (rebalancing) of bikes. Martinez et al. present a mixed integer linear program to maximize the net revenue by simultaneously optimizing the locations of stations, the fleet size, and bike relocation activities for a regular operation day [23]. Nair and MillerHooks formulate an equilibrium network design model to address the same objective as the previous study [24]. They propose a metaheuristic solution approach to overcome the intractability of the exact solution for reallife, largescale networks. In another study Reijsbergen identifies alternative locations with the aid of spatial data and simulation techniques: more specifically, a datadriven approach to determine how attractive city areas are for station placement is presented [25]. The literature offers other methodologies, that are not based on facility location models, to define the location of the stations. GarciaPalomares et al. develop a GISbased model to calculate the spatial distribution of the potential demand for trips and find the locations of bike stations using the locationallocation modeling approach [26].
Finally, a third group of the literature is associated with the relocation of bicycles in a BSS. The problem arises from demand imbalance leading to accumulation of bicycles at some stations (and consequently, limited bicycle availability in other stations). Vogel and Mettfeld apply a system dynamic method to model the effect of dynamic repositioning on the service level [27]. Shu et al. develop a stochastic network flow model with proportionality constraints to determine bike flow in a bikesharing network. They also present a numerical analysis on the Singapore BSS and find that period distribution is the most effective for system performance [28]. Forma et al. develop a 3step heuristic and mixed integer linear programming model for repositioning [29]. The first step involves clustering the stations based on geographic location and inventory levels using a heuristic method. In the second and third steps, they employ a mixed integer linear program to find the best routes for repositioning vehicles. AlvarezValdes et al. address the static repositioning problem using simulation techniques in two stages [30]. In the first stage, they estimate the levels of unsatisfied demand for a set of stations in a given period. In the second stage, they use the estimation as an input to their redistribution algorithm. Schuijbroek et al. combine service level requirements and vehicle routes to rebalance the inventory [31]. They propose a “clusterfirst routesecond” heuristic considering the service level feasibility and approximate routing costs simultaneously. Yan et al. develop four planning models for leisurebased BSSs given deterministic and stochastic demands [32]. They apply nonlinear timespace network models to integrate bike repositioning and vehicle routing with user dissatisfaction estimations. In a recent study, Celebi et al. propose a hybrid approach jointly considering location decisions and capacity allocation [33]. Their goal is to find the optimal configurations of a BSS by combining setcovering and queuing models to determine service levels.
One of the gaps in the current stateoftheart is that most focus only on either user accessibility or rebalancing strategies to manage supply and demand within an urban area. As described in the Introduction section, our contribution is to fill exactly that gap and propose a framework that allows for both high accessibility for the users and reliable and cost effective rebalancing and coordination for BSS operators. Our proposed model relies on the definition of a club from graph theory, whose definition and related literature is offered in the next paragraphs.
Given a simple undirected graph, a club is a subset of vertices inducing a subgraph of diameter at most . These structures represent cohesive subgroups in social network analysis with common applications in networkbased data mining and clustering. Several authors have discussed mathematical formulations for identifying clubs of maximum cardinality, as well as various methods to locate clubs within a network [34–36]. In addition to using clubs, our work also focuses on the centrality of a group of a specific structure. Group centrality, introduced by Everett and Borgatti, aims to identify groups or classes of high centrality [37]. Centrality measures the aim to characterize the importance of an element in a network. They typically fall into three main classes [38], referred to as degree (i.e., the number of connections of a specific element in the network), closeness (i.e., how close an element is to every other element in the network), and betweenness centrality (i.e., how important an element is in the communications between any two other elements in the network, assuming all such communications take place using the shortest path between the elements).
More recently, researchers have focused on highest betweenness groups [39]. Finally, another extension of identifying highly centralized groups has to do with the added restriction that the group induces a subgraph “motif”, such as being a complete subgraph/clique [40, 41], or inducing a star [42].
In this paper, we propose an integer programming formulation and a heuristic algorithm to find the most centralized club in a transportation network based on closeness centrality. The resultant club consists of a set of nodes in which the maximum traversing distance is hops (by definition), and the total weighted by population distance to a node in the club is minimized (as it will be the club with maximum closeness centrality). Based on this result, a BSS operator could then enable the area covered by the club as the geofenced area where dockless bikesharing is allowed and satisfy the following objectives:(1)Maximize demand coverage (as the area obtained is the most centralized, with respect to closeness centrality);(2)Minimize distances traversed for rebalancing operations (as the geofenced area is of restricted diameter);(3)Offer a large, convenient geographical area for checking in/out the available bikes without need for physical stations. As the success of a BSS heavily depends on the network of bike paths and bike stations in the community, this is an important objective facilitated by our framework.
Most of the previous work that addresses physical bike station location problems illustrates problems including station capacity decisions and demand predictions, among others. To the best of our knowledge, this paper is the first to suggest a solution to problems that have arisen from the emergence of dockless bikesharing systems with the aid of a club. The ultimate goal is to locate potential hubs in a city, referred to as clubs, by geofencing a suitably small area of a city.
3. Definitions and Notation
Let be an undirected network, with symbolizing the vertices (intersections of the transportation/biking network) and the edges (streets in the transportation/biking network). Every node is assumed to be assigned a nonnegative parameter, , referred to as the weight at this specific location. This weight parameter can be used to capture different aspects of the problem at hand, depending on the application. For example, the weight of a node could capture socioeconomic attributes like population, points of interests in the vicinity, number of jobs, etc. Another possible way to model and use the weight parameter is through the interactions between different pairs of origin and destination, like traffic flows (outgoing traffic from an origin node, incoming traffic to a destination node, or simply a summation of outgoing and incoming traffic to a specific node). In either way, we assume a distinct, nonnegative number explaining the level of attraction for that node.
We say that if there exists an edge starting from node and ending in node , in which case we write that . We also denote with the open neighborhood of node . We write that the diameter of graph is if the maximum shortest path distance between two nodes in the graph is . Clearly, all pairs of nodes in the graph will be located at a distance from one another with . Let be the distance between two nodes and , and as the distance of a node to a set of nodes . Then, for any set of nodes , we define a function , as
Last, we use to denote all paths of length less than or equal to . Similarly, is the set of all paths of length at most connecting two nodes and (). Clearly, we have that .
The decision version of the problem we are trying to solve is provided in Definition 2. Before that, we need to provide the definition of a club.
Definition 1. A set of nodes is said to form a club if the subgraph induced by , , has a diameter .
Definition 2. Given a graph with a nonnegative weight assigned to every node, an integer number , and a real number , does there exist a club such that ?
Detecting a club of maximum cardinality is a wellknown hard problem [34, 43]. Hence, it is expected that our problem, as described in Definition 2 will also be shown to be complete, rendering the optimization version hard. This is exactly what we show in Theorem 1. Before we do that, we define 3SATISFIABILITY(3SAT), a famous complete problem.
Definition 3 [3SAT]. Given clauses and literals and their complements and , does there exist an assignment such that a formula in conjunctive normal form is true, when every clause consists of exactly 3 literals?
Theorem 1. The decision version of our problem, as described in Definition 2, is complete.
Proof. The problem can be shown to be in , as both verifying that a subset forms a club and that can be done in polynomial time.
Now consider an instance of 3SAT with clauses on literals. We will reduce it to a version of our problem using the following gadget/transformation. First, create two nodes for every literal and its complement (); we connect every node by a chain of nodes () to every other node, but its complement (this forms edge set ). Moreover, create one node for every clause (); connect each node in by a chain of nodes () to the literals that the corresponding clause consists of (), where . Finally, assume that all nodes in have a weight of , while all other nodes in have a weight of 0. We will show that the 3SAT instance has a feasible assignment if and only if the constructed graph with and has a club such that . The gadget is also shown in Figure 2.
Assume that the 3SAT instance has a feasible assignment . Then, it is easy to see that by construction, the nodes corresponding to the literals in form a club (let them be ). Moreover, satisfies all clauses, hence there exists at least one node in that is at a distance of from each node in . Hence, we have that .
For the other direction of the proof, assume there exists a club such that ; yet, there exists no feasible assignment of literals to satisfy the 3SAT instance. We distinguish between four cases:(1) consists of exactly one node and nodes in in as many as all chains connecting them to all other literals (but its complement).(1) consists of exactly one node and nodes in in as many as chains connecting to the literals clause contains.(3) consists of only nodes in in exactly one chain connecting a literalnode to a clausenode .(4) consists of several nodes in , along with the nodes in in all chains necessary to connect them within hops.
Case 1. Let be the literalnode in . From the nodes in the chains connecting to the other literals (but the node corresponding to its complement), one chain can have at most nodes in and the remaining chains can have at most , where . Now, at best, this literal can satisfy at most clauses (since by assumption there exists no satisfiable assignment) whereas the literal that satisfies the remaining clause is located within a distance of from . Hence, we have:This contradicts the assumption that is a club with .
Case 2. Let be the clausenode in . Since we have a 3SAT instance, has exactly 3 chains around it, and contains at most nodes from one chain with the remaining chains having at most nodes in . The three literalnodes connected through the chains to clausenode can satisfy at most other clauses (apart from ). Hence, at best, we have: By assumption, though, we have that , which, combined with inequality (3), leads to:which is a contradiction.
Case 3. A similar contradiction to Case 2 is obtained when club consists only of nodes in . Let the club be at a distance of from the clausenode and at a distance of from the literalnode of that chain. We then have one clause at a distance of , at most clauses (as, otherwise, literal satisfies all clauses, a contradiction) at a distance of , and at least 1 clause at a distance of, at best, , leading to: This leads to the same contradiction as in Case 2.
Case 4. Finally, in the last case, the club is built so as to contain a series of nodes corresponding to literals. At best, those literals satisfy clauses (as, again, the 3SAT instance is assumed to be without a solution). Hence, we have that:This last contradiction finishes the proof.
4. Mathematical Formulation
In this section, we present our mathematical formulation and a greedy heuristic algorithm to solve larger scale instances. We also present some computational results on generated and reallife instances for smaller clubs ().
4.1. Formulation
We begin this section with the definition of our variables. We will use two sets of binary variables, defined as follows.
We can now proceed to describe the mathematical formulation, shown in (7). It is based on the maximum club chain formulation presented in [34]. Newer formulations for identifying clubs (as in, e.g., [44]) can also be employed, but are not explored here.
The objective function in (8a) aims to minimize the total weighted distance every node outside the club needs to traverse until it accesses a node in the club. The constraint family in (8b) restricts that a path can only be within the club if every node that belongs to it belongs in the club. Constraints (8c) enforce that every node in the graph is at a distance from a node in the club. The following constraints, shown in (8d), recursively enforce that a node can be at a distance of from the club if it is neighboring a node that is located at a distance of itself. The constraint family in (8e) restricts that two nodes can not both belong in the club unless there exists at least one path connecting them within hops or less that is in the club. Finally, the binary nature of all variables involved is enforced with (8f) and (8g).
4.2. Greedy Heuristic
The above formulation is difficult to solve, as the underlying problem was shown to be hard (with a decision version being complete per Theorem 1). Hence, along with solving the formulation using a commercial solver, we also devise a practical heuristic. In our case, we opted for a greedy heuristic that always chooses to increase the club at hand by choosing a node with a maximum weighttodistance ratio: that is, if a node is located near many nodes with big weights, it is more prone to being selected. This approach is shown in Algorithm 1.

The backbone of the heuristic method is the spatial interaction model known as the gravity model (as it is similar to Newton’s law of gravity). Its basic formula is as follows:
where and are the weight parameters (or, importance) of the origin and destination locations and is (as defined earlier) the distance between the origin and destination . In this work, we slightly change the interaction term in the numerator given in (9). Starting from some origin , we are searching all adjacent (nearby) locations so as to add it to the club being built. Since the term is the same for all considered locations (as ), we drop it from consideration and hence are left with a ratio of the importance of candidate location (given in the weight parameter ) versus the distance.
The algorithm is initialized with all nodes in the nodeset being in the candidate list, , and the starting club, , is empty. Then, for every node in the candidate list, we “add” it in and calculate the shortest paths from every node to any node inside . Then, the ratio becomes the summation of fractions . The node with maximum ratio is indeed added in , and the candidate list is updated with only neighboring nodes that satisfy the club criterion. A pictorial example, and its calculations are provided in Example 1.
Example 1. Assume that we have the graph of Figure 3 with weights , , and we are looking for a club. Initially, contains all 7 nodes and is empty.
Starting from node , we see that it is located at a distance of 0 from itself, a distance of 1 from nodes 2 and 3, a distance of 2 from node 4, a distance of 3 from node 5, and a distance of 4 from nodes 6 and 7. Hence, we have that In the example, it is easy to see that exactly the same is true for nodes , , and .
Similarly, for nodes and , we have . Finally, for node , we have that . Hence, we update to include () and the candidate list to include all nodes in the open neighborhood of , such that their distance to is less than or equal to . . We are now ready to start the second iteration.
For node we now have the following distances from to : nodes and are located one hop away, nodes and are zero hops away, node is also one hop away, and nodes and are two hops away. Hence, we have that . The key realization here is that the distances are no longer between the candidate node and every other node in the graph, but instead between including the candidate node and every other node in the graph. We also note that node will have exactly the same ratio, by construction of the example. Let us add node to (hence, ), and
For the third iteration, we have: and Hence, is added leading to . Now, observe that , but adding any of those nodes leads to a distance of hops within : hence, , and Algorithm 1 terminates with .
5. Computational Results
The developed algorithm and optimization model were implemented in Python and all numerical experiments were conducted on a Lenovo laptop with an Intel GHz quadcore processor and GB of RAM. To diversify the experiments and fully explore the behavior of the proposed algorithm as well as the optimization approach, two different sets of instances were considered. The first set of instances consists of Watts–Strogatz smallworld graphs with a varying number of nodes, edges, and diameter (stylized as –). The second group are three cities (Sioux Falls, Eastern Massachusetts/EMA, and Berlin) from a networks repository for transportation research [45]. In Table 1, we present the computational times as well as information for each network (such as the number of nodes, the number of edges, and the diameter).
Although the computational time expectedly grows for both the commercial solver and the heuristic as the number of nodes increases, the growth rate is much slower for the heuristic algorithm. This is verified by Table 1 for identifying highly central and clubs. Note that, with the exception of the Berlin graph, the heuristic approach shows a speedup that is on average 3 and 7 times faster than the exact optimization model for and , respectively. The case of the Berlin network is very important. In this transportation network, the exact optimization fails to find a solution within reasonable computational time, and instead spends hours trying to prove optimality. This happens because the diameter of the graph is big, and the number of binary variables in model (6) becomes prohibitively large.
6. Case Study
In this section, we investigate two case studies from the state of North Dakota, in the cities of Fargo and Casselton. Case studies and realworld visualization are necessary to put the problem in its related context and understand its implications. However, due to the computational complexity of our problem, reaching a solution within reasonable computing time is challenging. Hence, the exact optimization model of (6) was only solved on the (smaller) city of Casselton, whereas in the (larger) city of Fargo, we only present the results of the heuristic (as in Algorithm 1).
6.1. Data Description
Casselton is a city in the state of North Dakota, with a population of 2,329 in the 2010 census. To the best our knowledge, there is no bikesharing program planned for deployment in the near future. Figure 4 illustrates the overall geography of the city and the population distribution in proportionally graduated circles.
The network for the city of Casselton was built with TIGER/Line® road data and block population with ArcGIS 5.0. All roads were converted to sets of vertices and edges representing intersections and road segments, respectively. There are vertices and edges in the resulting graph. The block population polygons are turned to point features for weighing the graph vertices. According to a National Association of City Transportation Officials (NACT) report [6], to achieve an increase in ridership as well as in overall system utility, bikesharing kiosks should be located no more than 1000 feet apart from one another. Therefore, every single vertex has the potential to become a dockless bike station within 1000 feet. Then, each vertex is weighted based on the closeness to the population points.
For the city of Fargo, due to its size, only the greedy heuristic of Algorithm 1 was put to the test. The population in Fargo is 105,545. At the moment, a bikesharing system is in place, with 11 stations in the locations shown in Figure 5 with a triangle. The same figure also presents the geography of the city and the population in proportional circles. The network for the city of Fargo is obtained in the same way as the one for Casselton. The final graph contains vertices and edges, which is indeed largescale for the exact optimization solver.
All codes for solving the problem, both for the exact optimization model and the greedy heuristic were coded in Python. For solving the optimization model, we used Gurobi 7.5 [46]. We are now ready to present our findings in the next section.
7. Results
We investigate three measures obtained by both the heuristic and the exact optimization: (1) Number of nodes selected in the club (cardinality); (2) Population located in the selected nodes (immediate access); (3) Distanceweighted cost from all nodes to the club (general accessibility).
The number of nodes in the club represent the desirable, potentially geofenced, sites where a rider could check in/out a bike. The population measure represents the number of the residents within the club: they are the ones with immediate access to a location with bicycles. Finally, the distanceweighted cost describes the total distance a commuter (from any location in the network) should walk to reach some node in the club to get access to a bike. Therefore, as was also shown in our optimization model, lower distanceweighted costs are preferable.
Table 2 summarizes the results for in Casselton. The population represents the number of residents living in the club. The distanceweighted cost is the actual objective function of our optimization model. Finally, time shows the computational time required to solve the problem.
Starting from the population, in the case of exactly optimizing the formulation, it is consistently smaller than the population covered by the heuristic approach. On the other hand, distanceweighted cost represents the distance that the residents living outside the club must travel to access to desginated geofenced areas. The optimization model expectedly offers better results than the heuristic for all clubs obtained. Finally, when looking at the computational time, it becomes clear that even in a small city like Casselton, the exact optimization approach is prohibitively expensive, with taking a little less than 10 hours, and requiring more than 24 hours of computation before it terminates upon reporting a suboptimal solution and an optimality gap of 56.8%. The heuristic though is significantly and consistently faster, with a small uptick in computational time linear with the value of as it increases.
Figures 6 and 7 present the solutions within the city, and show the sets of nodes selected. Both the heuristic and the optimization approaches suggest groups of vertices located nearby—seeing as the resulting set of nodes forms a club. However, the heuristic approach starts with the most populated points in the city, and expands the set of nodes around that same point as the diameter of the set () increases. On the other hand, the optimization model is more dynamic, as it tries to minimize the overall distanceweighted cost.
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
We note that the heuristic is also inconsistent, as there are cases (see, e.g., vs. ) where a solution worsens as far as the distanceweighted cost is concerned as increases. This happens because the heuristic of Algorithm 1 myopically chooses the “best” candidate node to add so long as it respects the club diameter requirement. Because of this, the population immediately covered is bigger in the solution from the heuristic as opposed to the optimization model. We note though that this is not necessarily good, as it might result in locations where a high number of residents have immediate access to dockless bike sharing, but other residents have to travel very far to access it.
In the case of Fargo, as shown in Figure 8, we only applied the heuristic algorithm to validate our model, as optimizing for the values of that would be meaningful resulted in running out of memory. Figure 8 illustrates the club heuristic solutions for Fargo, for . The potential sites were located in a highly populated area next to the university campus. The existing bike stations already in operation in Fargo are only blocks away from the suggested the club. Table 3 summarizes the numerical results. It is intuitive that due to the fact Fargo has a larger overall population per block, the corresponding numbers in the table are much larger than the ones for Casselton.
(a)
(b)
(c)
(d)
8. CostBenefit Analysis
Equipment, installation, and maintenance are three significant costs involved in implementing a bikesharing program. The main drawback to physical bike station systems (known as kiosk system) is their high acquisition and operating costs. Stations are costly including tens of thousands of dollars to manufacture and install along with several thousand dollars to acquire customized bikes. Moreover, kiosk systems mandate constant bike rebalancing. This happens because every bike needs to be returned to a kiosk: if the kiosk is full, the riders must find another location with available spots, resulting in higher operational cost and a decrease in customer satisfaction.
The cost of each bike is estimated at $1,234 [47]. Assuming a cost of $1000 on average for each bike, the cost for a typical kiosk with docks will range from $29,000 to $34,000, excluding operating costs. Figure 9 shows the relationship between the cost and number of docks. These figures are even higher at the planning stage ($55,000 per station) [48]. The optimal number of docks is another critical factor in a bikesharing program. Increasing the number of docks leads to higher costs, and a pile up of bikes in one location, which consequently results in higher rebalancing cost. At the same time, it leads to higher customer satisfaction. The dockless option would at least avoid initial capital investment and pave the way to introduce bikesharing programs to cities, without sacrificing customer satisfaction with the program.
9. Conclusions
In this work, we discussed a new paradigm for selecting where a dockless (geofenced) bikesharing system should be enabled within an urban area. This paradigm tries to solve the disadvantage of kioskbased bikesharing programs such as high equipment costs and costs associated with customer dissatisfaction due to lack of bikes/docks at the desired location. Also, the proposed model offers a better solution to existing dockless problems.
We modeled our problem as one of detecting a connected set of nodes of restricted diameter (that is, where any two nodes are reachable within hops using nodes inside the set), or a club. The goal was to find a club of maximum closeness, so as to make sure that all other nodes in the transportation network are close enough to the bikesharing locations. We showed that, as expected, the problem is hard, and provided an integer programming formulation to solve it. We also propose a greedy heuristic, which is computationally inexpensive. As increases for the obtained club, we should expect the coordination costs to increase along side as greater values of will imply larger geofenced areas. From a practical perspective, BSS operators would have to trade off the size of the geofenced area (the larger, the more easily accessible and more convenient to users) to the rebalancing costs (the smaller, the more easily coordinated and cheaper for BSS operators).
We also used our methods to study the resulting setup in two cities of the state of North Dakota, Casselton (of smaller population) and Fargo (of bigger population). The potential cost savings in the dockless approach could decrease initial capital investments for introducing a bikesharing program in a city. It also leads to an increase in the number of the virtual docks (capacity) without blocking streets or pedestrian walkways. One might say that dockless bike sharing brings chaos to cities, due to the freedom of allowing bike check in/out anywhere in a geofenced area. That is why our approach could mitigate the described situation and leverage this dockless alternative, by only enabling some areas with this capability. The model at the moment is built based on the population as the only location weight.
Future directions for our work include the following. First, we could investigate the identification of multiple clubs of varying sizes within a city. This would allow BSS operators to have multiple smaller geofenced areas or fewer larger geofenced areas to cover all bikesharing demands. As a second direction, we should consider more ways to build the weight parameter in our framework. For example, we plan to investigate how club formation and how the geofenced areas change as we consider city points of interest, distance to nearby transit points, and origindestination demands throughout the day, among others. Next, another future avenue for our research would be to investigate more closely the interactions between different operators (e.g., dockless bike sharing and scooter sharing, or dockless bike sharing and public transit) with respect to different geofenced areas.
Data Availability
The geospatial data used to support this study are made available from the North Dakota GIS Hub Data Portal at https://gishubdata.nd.gov. The processed data resulting in the graphs produced are available upon request by AliRahim Taleqani at ali.rahimtaleqani@ndsu.edu. Previously reported networks were also used to support this study and are available at https://github.com/bstabler/TransportationNetworks. The dataset is cited at a relevant place within the text as reference [45].
Conflicts of Interest
The authors declare that they have no conflicts of interest.