#### Abstract

A good timetable is required to not only be efficient, but also yield effectiveness in preventing and counteracting delays. When travelling via urban rail transit networks, transferring passengers may miss their scheduled connecting train because of a feeder train delay that results in them experiencing increased travel costs. Considering that running time supplements and transfer buffer times yield different effects on the travel plans of transferring and nontransferring passengers, we formulate an expected extra travel cost (EETC) function to appropriately balance efficiency and robustness, which is then implemented in the construction of a robust transfer optimization model with the objective of minimizing the total EETC. Next, to improve the computational efficiency, we propose an approximate linearization approach for the EETC function and introduce two types of binary variables and auxiliary substitution variables to convert the nonlinear model to a mixed-integer linear model. Experimental results show that our proposed method can yield practically applicable solutions with significant reductions in both EETC and probability of missing a transfer.

#### 1. Introduction

Concurrently optimizing train timetables and enhancing robustness is the main problem on the operational level of railway systems; in general, optimization aims to prevent and counteract delays such that the deviations of the original timetable are as small as possible. Currently, there are four general methods to improve the robustness of train timetables: adding recovery time, decreasing capacity for a given traffic pattern, decreasing heterogeneity, and decreasing average speed [1]. For an urban rail transit system with fixed capacity, the most widely used method is adding recovery time that includes a running time supplement and dwell time supplement, as shown in Figure 1. The standard running time is the ideal running time required for a specific train to travel from one station to another. The running time supplement is primarily utilized to offset train delays occurring during operation. The standard dwell time is the dwell time necessary for passengers to board and alight. The dwell time supplement is primarily purposed to manage fluctuations in passenger volume at stations and to ensure that trains depart on schedule. Besides, the headway buffer between two consecutive trains, the difference between the headway and minimum headway, enables the second train to run at unrestricted speed, which has a direct influence on the capacity of urban rail transit system. Landex [2] held that the more trains operated on a given railway line, the higher the risk to encounter delays. The capacity issue is a systematic and complex problem on the crowded urban rail line.

Because trains are operated within a network, a single train delay can affect other lines via transfers; consequently, this delay may postpone the departure of scheduled connections. Missing a transfer is a crucial measure in the evaluation of network timetable performance, as has been mentioned in many research studies, such as Dollevoet [3], Corman, D’Ariano, and Hansen [4], and Parbo, Nielsen, and Prato [5]. Furthermore, because the headway between trains on suburban rail transit lines is typically longer for the majority of the day, missing a transfer implies a long mean waiting time for transferring passengers that significantly affects the level of service. Therefore, with the expansion of suburban rail transit networks in many large cities, it has become essential to optimize the robustness of transfer timetables in order to provide high-level service and obtain satisfaction with operations.

A running time supplement and transfer buffer time can each yield improvements in the robustness of the train timetabling problem with respect to transfer connections. However, in terms of a connection, for nontransferring passengers, their travels are only affected by the running time supplement while the travels of transferring passengers are affected by both the running time supplement and transfer buffer time, a phenomenon that is herein referred to as the “double-buffer effect.” The double-buffer effect is illustrated in Figure 2. Most researchers are dedicated to optimizing timetable robustness by adjusting only the running time supplement. It seems unfair for nontransferring passengers because they potentially need to wait more time that is in effect the buffer for transferring passengers. However, transfer buffer time is specially used to buffer for transferring passengers and would not hold up the travels of nontransferring passengers. Alternatively, based on the notion of the double-buffer effect, the first contribution of this study is optimizing timetable robustness via simultaneous adjustment of the running time supplement and transfer buffer time (we do not consider the dwell time supplement addressing the demand fluctuations at station in this paper). Accordingly, depending on the value of time (VOT) and distribution of train delays, an expected extra travel cost (EETC) function of all passengers (transferring, passing, departing, and arriving passengers) can be formulated as a nonlinear function of the running time supplement and transfer buffer time.

The EETC is a probabilistic weighted function of all cases of delay to occur that incorporates the cases of successful and missed transfers with respect to transferring passengers and the cases of early-arriving and late-arriving with respect to nontransferring passengers. It is clear that both the efficiency and robustness have an effect on EETC in practice. Therefore, a robust transfer optimization model is subsequently developed by minimizing the total EETC throughout a network, which is the second contribution of this study. Implementing the total EETC yields the following three advantages: a more accurate description of the effects of buffer time, an appropriate compromise of efficiency and robustness to avoid too small or too large buffer times, and a coordination of all of the relevant buffer times throughout a network as a whole.

However the nonlinearization of the objective function makes solving the optimized model complex and difficult. Hence, the third contribution of this study is putting forward an improved approximate linearization approach that, respectively, linearizes the EETC function for transferring and nontransferring passengers. In addition, a further linearization of the objective via introduction of binary variables and auxiliary substitution variables is proposed to entirely convert the optimized model to a mixed integer linear programming model. The improved approach takes into account both the difference between the effects of buffer time on transferring and nontransferring passengers and the fact that the running time supplement and transfer buffer time yield equivalent effects on transferring passengers. Finally, an experimental analysis is conducted to demonstrate the performance of our proposed methodology.

The remainder of this paper is organized as follows: Section 2 provides literature review; Section 3 provides the EETC function formulation; Section 4 establishes a robust transfer optimization model; Section 5 proposes an approximate linearization approach with respect to the characteristics of local EETC and demonstrates the secondary linearization; Section 6 provides experimental analyses; and Section 7 provides the conclusions and future research directions.

#### 2. Literature Review

Generally, optimization of train timetabling can be performed from the perspective of robust train timetabling or nominal train timetabling. However, in many cases, the latter, and most efficient, solution yields worse performance in actual operation [6]. For example, although synchronization timetabling aims to generate timetable solutions to reduce passenger transfer waiting time as much as possible, in the case of synchronization, once a delay occurs, transferring passengers will miss their scheduled connecting train. In fact, transfer waiting time can be used as a buffer in robust level. When required to choose one among several efficient solutions, operators prefer to implement a more robust solution rather than the most efficient solution. Furthermore, a discrepancy exists between how train-oriented operations are planned with the main focus being on the trains and how passengers actually perceive and respond to railway performances. Hence, researches on robust train timetabling can be conducted with the train-oriented and passenger-oriented focus [5].

##### 2.1. Train-Oriented Robust Train Timetabling

Traditionally, the focus in robust train timetabling is train-oriented, aiming to ensure the on-time performance of the trains. Kroon et al. [7] proposed a two-stage stochastic programming approach to minimize the average weighted delay of all trains by allocating a time supplement and buffer time. Jovanović et al. [8] modelled the allocation of slack times as a knapsack problem, where the profit for inserting a buffer corresponded to the importance of reducing delay propagation. Alternatively, Fischetti and Monaci [9] introduced the concept of light robustness by allowing deterioration of the nominal timetable objective function value. A set of slack variables was used to measure the robustness of solution, and their sum was minimized in the objective function. D’Ariano, Pacciarelli, and Pranzo [10] referred to the flexible timetable in which fewer details are fixed in schedule and dynamic traffic management was herein to solve the conflicts during operations, and it allowed more freedom to the real-time control. Liebchen et al. [11] presented the concept of recoverable robustness that a solution was recovery robust if it could be recovered by limited means in all likely scenarios. D'Angelo et al. [12] and Goerigk and Schöbel [13] also applied this concept of robustness. Besides, when very frequently delays arise, the delay management should be taken into account at an operational level. In consideration of the efficiency in real-time rescheduling, Samà et al. [14] were dedicated to reducing the related train delays and developing efficient algorithms to fast rescheduling trains. Andersson, Peterson, and Krasemann [15] presented an optimization approach to reduce the propagation of delays by introducing a more efficient margin time allocation in the timetable.

The above research mainly focused on the robustness, and the efficiency of timetable was generally addressed as a constraint on the maximum efficiency loss; or the robustness optimization aimed at enhancing the ability to absorb minor disruptions and the recoverability of the schedule; however, they would get trouble in describing the size of “minor” delays. Some researches were conducted with the motivation of finding robust yet efficient solutions. Schlechte and Borndörfer [16] proposed a bicriteria optimization approach by considering the robustness and efficiency to be incomparable entities. Cacchiani, Caprara, and Fischetti [17] approximately solved a relaxation of the timetable through an iterative Lagrangian optimization scheme capable of dealing with robustness. During the optimization process, a set of, roughly speaking, “Pareto optimal” heuristic solutions could be collected. Lee et al. [18] developed a simulation-based timetable robustness and efficiency heuristic approach by changing the penalization threshold for the total time supplement. Generally, the efficiency depended on the amount of time supplement, and the robustness was related to the unscheduled delay. Although these studies provided the Pareto optimal solutions to numerically establish the relationship between robustness and efficiency, they still failed to find a compromise and balance between the robustness and efficiency, because the train-related measurements are of different dimensions for the robustness and efficiency. Some researchers introduced the weighted value for the different objective. However, it is too arbitrary and subjective in this way.

##### 2.2. Passenger-Oriented Robust Train Timetabling

Without knowing or monitoring the exact impact on the passengers’ travel experience, train-oriented robust train timetabling would not take the passengers explicitly into account. However, the optimal timetabling for the operators could be far from optimal for the passengers with the reasons being cancelled trains, missed transfers, or route choice adaptations [19]. Recent studies have emphasized that exhaustive passenger-oriented measurements are needed, and maintaining transfers is often the main concern.

Many researchers conducted delay management studies that required operators to make a decision on whether or not to keep a connection when trains were delayed. In this regard, Kanai [20] combined the simulation and optimization to minimize passengers’ dissatisfaction. The decision of keeping or dropping connections was solved by a tabu search algorithm and evaluated by a passenger simulation model. Corman et al. [21] developed a biobjective delay management strategy to minimize the number of missed connections and avoid train conflicts when rescheduling. Nesheli and Ceder [22] took holding, skip-stops, and short-turn control statics into account to minimize total passenger travel time and maximize direct (without waiting) transfers real-time control action.

The delay management problem is for a fixed plan and a fixed scenario of delays, while the delay resistant timetabling optimizes with respect to all scenarios of delays [23]. Liebchen and Stiller [24] optimized the allocation of a certain buffer budget to achieve the maximum resistance against delays, which simplified the distributions of delay on each arc to two values, a fixed delay with a certain probability* p* and no delay with the probability* 1-p*. In this case, the whole system had billions of different delay scenarios. Schöbel and Kratz [25] established a bicriteria model built on a set of determined delays and proposed a Pareto optimization technique for the balance of the efficiency and robustness. The timetable robustness was defined as the largest possible passenger delay such that all transfers were maintained under some given strategy. Liebchen et al. [23] modelled the periodic event-activity networks with an expansion of a driving and waiting activity to (three) different scenarios where the weight of transfer arcs was relative to probabilities of the scenarios. Even though the above research attempted to allow for all scenarios of delays, the delay distribution was in fact assumed as discretized, finite, and determined beforehand, and the optimized objective was weighted by a set of scenarios where the delay of driving and waiting activities had several possibilities. Alternatively, Vansteenwegen and Van Oudheusden [26, 27] introduced an expected passenger waiting time function of running time supplement as the objective to weigh total minutes spent waiting across all passengers for purpose of the appropriate allocation of running time supplement, in which the transfer buffer time was assumed to be zero. The function emphasized the effects of missed transfer by setting a larger value of time. Dewilde [28] and Sels et al. [29] also applied this concept of robustness, which allowed for all cases of delay. However, the researches failed to recognize the effects of transfer buffer time and the double-buffer effect between running time supplement and transfer buffer time. Conversely, Goverde [30] focused on the effects of the transfer buffer time to formulate the expected waiting time for transferring passengers while omitting the effects of the running time supplement.

In summary, the previous research in the field of robust train timetabling is typically done by allocating the time supplements. Even when it comes to reducing the risk of missing transfer connections, the effects of transfer buffer time are often overlooked. Recent researches attempt to cope with the issues but are too rough. The delay distributions are assumed as several possibilities and determined beforehand in regard to the practicability of model. Alternatively, the EETC is weighted by the probabilities of all cases of delay and can be embedded in many robustness researches. To the best of our knowledge, our paper is the first attempt to explicitly describe and distinguish the effects of the running time supplement and transfer buffer time to formulate an EETC. To optimize one of them or regard them as single combined buffer times would make the measured objective unfair when applied to all connections within a network in terms of the double-buffer effect. The travels of nontransferring passengers only necessitate consideration of the running time supplement and are thus exclusive of transfer buffer time. Based on the difference, an improved approximate linearization approach is proposed.

#### 3. Expected Extra Travel Cost (EETC) Function

Weston et al. [31] held that, because of missed transfers, minimizing train delay did not necessarily minimize passenger delays. Therefore, from the perspective of passengers, a guarantee of reliable transfers is crucial for urban rail transit systems. In this paper, we formulate the expected extra travel cost (EETC) function to explicitly describe the passengers’ perception of trip.

##### 3.1. Distribution of Train Delay

Note that the train delay time is defined as the difference between actual running time and standard running time in this paper. Generally, the negative exponential distribution has often been used to model the train primary delay in the literature [26, 27, 29, 32]. Moreover, several statistical analyses of empirical data have also confirmed this hypothesis [33–35]. The probability density function and distribution function can be expressed aswhere is the primary delay time and is the mean delay time.

Here, we denote the train delay time as , the running time supplement as , the transfer buffer time as , and the headway between trains on connection line as . Three distinct cases of actual running time are defined in accordance with the respective time horizon, as illustrated in Figure 3.

*Case 1 (). *In this case, the connecting train should remain at the station for a period of time to depart at the scheduled departure time.

*Case 2 (). *In this case, the transfer buffer time affords transferring passengers time to catch their scheduled connecting train.

*Case 3 (). *In this case, because the delay time exceeds the running time supplement and the buffer time, transferring passengers will miss the scheduled connection and must wait for the next connecting trains.

From the perspective of train operation, Case 1 is the early-arriving circumstance, and Cases 2 and 3 are the late-arriving circumstances, which result in scheduled delays. However, in terms of transfers, Cases 1 and 2 result in successful transfers, while Case 3 results in a missed transfer.

##### 3.2. Expected Extra Travel Cost Function

Passengers at a single station are categorized as follows according to their type of travel relative to the station: transferring, passing, departing, and arriving passengers. The EETC function for passengers of each type is a probabilistic weighted value depending on the VOT and distribution of train delays in practice. In this paper, the standard travel time equals standard in-vehicle time that includes standard running time and standard dwell time. The extra travel time includes the extra in-vehicle time (the difference between actual and standard in-vehicle time) and the extra waiting time on platform. A substantial amount of research has been carried out to determine the value of time (VOT) in various circumstances [36–39]. Given that each passenger has different feelings concerning the scheduled and unscheduled travel time, we refer to these research results and, respectively, set the VOT for cases of scheduled and unscheduled, as indicated in Table 1.

Here, we define the VOT for the standard in-vehicle time as equal to one. The VOT for the scheduled extra in-vehicle time is equal to 1.5, which means that passengers value 1 min. of the scheduled extra in-vehicle time as 1.5 min. of the standard in-vehicle time. As an unscheduled extra waiting yields undesirable effects on passenger travel, the VOT for the unscheduled extra in-vehicle time is set to 2.0. The VOT for the scheduled and unscheduled waiting time are similarly set to 2.0 and 2.5, respectively. Because missing a transfer yields more significant adverse effects, the VOT for the waiting time due to a missed transfer is 2.7, which is a value greater than those previously mentioned. This increased VOT is justified by the extended waiting time that would be attributed to the long headway between connections.

###### 3.2.1. Transferring Passenger

The total buffer time for a transferring passenger is the sum of the running time supplement and transfer buffer time. If delay time is less than the total buffer time , transferring passengers are able to board the scheduled connecting train, which departs according to schedule. In this case, the extra travel time utilized by transferring passengers consists of the scheduled extra in-vehicle time and the scheduled extra waiting time . Then, the EETC for each transferring passenger can be calculated as

Conversely, when delay time is greater than the total buffer time , transferring passengers miss the scheduled connection and must wait for the next connecting train. Thus, in this case, transferring passenger extra travel time comprises the scheduled extra in-vehicle time , the unscheduled extra in-vehicle time , and the waiting time because of the missed transfer . Then, the EETC for each transferring passenger can be calculated as

###### 3.2.2. Passing Passenger

The trips made by passing passengers only necessitate consideration of the running time supplement and are thus exclusive for transfer buffer time. When delay time is less than the running time supplement and the feeder train departs according to the scheduled timetable, the extra travel time of passing passengers is always equal to the running time supplement . Then, the EETC for each passing passenger can be calculated as

Accordingly, when the delay time is greater than the running time supplement , the extra travel time of passing passengers comprises the scheduled extra in-vehicle time and the unscheduled extra in-vehicle time . Then, the EETC for each passing passenger can be calculated as

###### 3.2.3. Arriving Passenger

For arriving passengers, when the delay time is less than the running time supplement , the extra travel time is equal to the delay time . Thus, the EETC for each arriving passenger can be calculated as

Additionally, in the case of a late-arriving train, the extra time of arriving passengers is equivalent to that of passing passengers; the EETC in this case can be calculated as

Besides, in urban rail transit network, there are some specific stations that are accessed by feeder bus, and the arriving passengers can transfer to bus network to go to their destination. In this case, the EETC is supposed to take into account the feeder bus timetable. In terms of transfers involving bus transit lines, passengers are also likely to encounter a missed transfer, thus, and there are two circumstances: catching and missing the scheduled feeder bus. The calculation of EETC can be implemented similar to transferring passengers.

###### 3.2.4. Departing Passenger

In the case of an early arrival, as the feeder train departs according to the scheduled timetable, there is no extra travel time for departing passengers; therefore, . However, if delay time is greater than the running time supplement , then the extra travel time of departing passengers equates to the unscheduled extra waiting time . Accordingly, the EETC for each departing passenger can be calculated as

###### 3.2.5. Expected Extra Travel Cost

The EETC is the expected extra travel cost of all cases of delay with the specific probability distribution. For a connection, the local EETC of passengers is the weighted sum of the EETC of all types of passengers, (3)-(9), as expressed via (10). The local EETC is a nonlinear and probability function of the running time supplement and transfer buffer time corresponding to the connection and implies the probability distributions of delay, which is expressed as follows:where ,,, and represent the numbers of transferring, passing, arriving, and departing passengers, respectively.

#### 4. Robust Transfer Optimization Model

Ideally, passengers without any risk-preference expect to minimize their transfer waiting time. However, in case of short transfer time, missing a transfer may occur due to train delays. The focus of robust transfer optimization is to guarantee the transfer reliability at a certain efficient level of transferring. In more detailed ways, the concept of robustness requires the compromise and balance of the efficiency and robustness; in this case, the buffer times should be large enough to compensate for typical delays and not be too large to avoid needless extra waiting or travelling time for all passengers. In this consideration, the EETC function is a probabilistic weighted measurement of all cases in practice.

For a connection, the most desirable solution requires minimization of the local EETC; however, this is infeasible and impractical because of a variety of factors, such as resources, operation safety, and level of service, that affect the scheduled timetable. To overcome this problem, a robust transfer optimization model is developed by coordinating all of the local EETCs within a network as a whole such that practically applicable robust timetables can be obtained. The framework of the model is constructed depending on the topology of urban rail transit network, where the transfer arcs are directed and connected between the feeder trains nodes and the connection trains nodes, as illustrated via Figure 4. The double buffer effect between running time supplement and transfer buffer time on each connection and the interactions between transfer buffer times on all connections would cause the interactions between running time supplements within network, and all the interaction effects determine the optimization and feasibility of the solution and play an essential and crucial role in coordinating all of the connections within the network.

##### 4.1. Assumptions

Here, several assumptions are made throughout the paper for simplicity in the model formulation.

*Assumption 1. *The transfer walking time is assumed constant and fixed for all transferring passengers without allowing for the difference of passengers’ age, sex, packages, etc., which mainly depends on the walking distances at transfer station and can be obtained by actual surveys.

*Assumption 2. *The path choices of passengers are known and fixed without considering the slight adjustment of the timetable plan. Although there may be alternative lines in the system to complete the travel, we do not consider them in this paper.

*Assumption 3. *The passenger flows are evenly distributed within a short time period. It is rather difficult to count the exact number of passengers getting on a train due to the randomness of the arrival time of passenger and the differences of headway between trains.

*Assumption 4. *The train person capacity is sufficient to accommodate all passengers, and passengers will always ride the first arriving train after reaching the connection platform. Although passengers may potentially encounter congestion delays at transfer stations in some specific cases, it is troublesome to formulate the passenger flows and timetable information, and thus we ignore the congestion effects.

##### 4.2. Nomenclature

Model parameters are defined as follows: : the set of lines in a network, , where is the total number of lines : the set of trains on line , , where is the total number of trains on line : the set of stations on line , , where is the total number of stations on line : the set of connections in a network, , where denotes train on line at station , and train on line at station is a connection : the set of connection origins, : the minimum and maximum headway on line , respectively : standard running time of train on line from station to station : transfer walking time on connection

Variables of the proposed model are defined as follows: : arrival time of train on line at station : departure time of train on line at station : dwell time of train on line at station : running time supplement of train on line from station to station : transfer buffer time for connection

##### 4.3. Constraints

###### 4.3.1. Resources Constraints

Generally, train operation follows a cyclic pattern of events: departure, running, arrival, and dwelling. Because of the continuity of these events, the departure time of a train at a station is the sum of the arrival time and dwell time at this station, as modeled via (11); meanwhile, the arrival time of a train at a station is the sum of the departure time at the first station, the total running time of the train, and dwell time at the previous station, as modeled via (12).

Accordingly, modeling transfers requires consideration of the arrival time of the corresponding feeder train, transferring passenger walking time at transfer station, transferring passenger waiting time, and departure time of the connecting train. The transfer waiting time can be used as a buffer to prevent and counteract the effects of feeder train delays; therefore, the transfer buffer time is the difference between the departure time of the connecting train and the sum of the arrival time of the feeder train and transfer walking time, as modeled via

###### 4.3.2. Operational Constraints

The constraint defined via (14) imposes a minimum headway between departures of consecutive trains to guarantee operational safety, and a maximum headway to maintain the standard level of service. Similarly, the constraint defined via (15) ensures that the dwell time is between a lower bound and an upper bound . Besides, the departure times of the first train and the last train at first station on each line should be covered by the related planning horizon of each line, as demonstrated via (16).

While the running time supplement is able to prevent delay propagation, an excessive supplement will reduce passenger satisfaction and the efficiency of rolling-stock usage. Thus, the acceptable range for the running time supplement is delimited below in (17); in addition, the constraint presented via (18) limits the total travel time of a train defined as the time travelling from the originating terminal station to the destination terminal station, which avoids inserting too much running time supplements in a line.

##### 4.4. Objective Function

The EETC function is weighted with the probabilities of all cases of delay to occur that incorporate the cases of successful and missed transfers with respect to transferring passengers and the cases of early-arriving and late-arriving with respect to nontransferring passengers, which should be a compromise between travel time extension and delay resistance risk. It is formulated based on the probability distributions of delay. The aim of robust transfer optimization is to decrease the EETC as much as possible by adjusting the running time supplement and transfer buffer time within the boundaries of the constraints described above. For this purpose, we implement the total EETC within a network that can be calculated by the summation of each connection’s EETC (see (10)) as the objective to establish the robust transfer optimization model. Therefore, the objective function is given as follows:

#### 5. Approximate Linearization of Expected Extra Travel Cost

As the objective function is nonlinear, solving the optimized model appears as a complex and difficult task. Thus, it is essential to determine an approach that is able to simplify the problem. Because linear systems typically exhibit features and properties that are considerably simpler, linearization of nonlinear functions is the most commonly used and effective method.

In this section, we first conduct a common and simple piecewise linearization of the local EETC and then discuss its feasibility and analyze the drawback that the approximated error of test points around the minimum local EETC is large and irregularly distributed. Then, to overcome the drawback, an improved piecewise linearization that, respectively, linearizes the EETC function for transferring and nontransferring passengers is proposed. The improved approach takes into account the characteristic of EETC and yields a higher goodness of fit. Finally, a further linearization of the objective via introduction of binary variables and auxiliary substitution variables is implemented to entirely convert the optimized model to a mixed integer linear programming model.

##### 5.1. Piecewise Linearization

The common approximate linearization of a function is conducted via implementing the segmentation according to the general up or down trend of the function at first and then linearizing the function of each segment. A series of numerical tests shows that the local EETC can be approximated as a two-segment piecewise linear function, which yields higher goodness of fit and smaller errors than other types of approximation functions (e.g., continuous quadratic and cubic fitting function). As an example, we assume the following: , with the function of the local EETC and its approximation shown in Figure 5. The red line located near the bottom of the left plot in Figure 5 is the three-dimensional contour of the absolute value of the approximate error. The approximation function comprises two planes and is the approximate reflection of the general trend of the local EETC that is a continuous concave surface. Although clear advantages of piecewise linearization can be observed via the figure, the errors of test points around the minimum local EETC are significantly large.

The reason for this large error is that this approximation approach does not consider the double-buffer effect and thus regards it as the effect of proportional substitution for a certain proportion. More specifically, the effects of a 1 min. running time supplement on local EETC are equivalent to that of a 1.2 min. transfer buffer time. To correct this problem and achieve a higher goodness of fit, an improved linearization technique that takes into account all factors affecting the EETC is proposed.

##### 5.2. Improvement of Piecewise Linearization

As previously mentioned, while both the transfer buffer time and running time supplement affect the EETC of transferring passengers, the EETC of nontransferring passengers is only affected by the running time supplement. More importantly, the effects of each type of buffer time are equivalent for transferring passengers; this suggests that an alternative variable can be used to represent the double-buffer effect. Thus, the local EETC can be divided into two categories: the EETC of transferring passengers and the EETC of nontransferring passengers. These two categories can then be approximately linearized independently of each other, as expressed via (20) and (21) for transferring and nontransferring passengers, respectively.

Figure 6 illustrates the contour plots of the EETC and the improved EETC approximation, which is calculated by performing separate linearization approximations for each of the two categories of EETC; the assumptions applied in the previous example in Section 5.1 were applied (). The red line located near the bottom of the left plot in Figure 6 is the three-dimensional contour of the absolute value of the approximate error. From the figure, it can be determined that the independent linearization technique yields a better goodness of fit, which is evidenced by decreased error about the minimum local EETC. The contour of the absolute value of the error is observed as constrained, regular, and ordered, thereby suggesting that the improved piecewise linearization technique can provide a more accurate description of the up or down trends of the local EETC. These results demonstrate that the linearization technique presented in this paper effectively represents and explains the characteristics of the EETC function.

In summary, the ordinary approximation approach simplifies the relationship between the effects of the running time supplement and transfer buffer time, which causes the large and irregularly distributed errors of test points around the minimum local EETC. However, the improved approach takes into account both the difference between the effects of buffer time on transferring and nontransferring passengers and the fact that the running time supplement and transfer buffer time yield equivalent effects on transferring passengers. In this way, even though the errors do exist, the trend, up or down, of the local EETC can be captured with reasonable accuracy.

##### 5.3. Objective Conversion

Although the linearization approximation technique proposed here yields improvements, the EETC of nontransferring passengers still pose a problem, as it may be included in computations more than once, as demonstrated in Figure 7. More specifically, Figure 7 illustrates Train A as having two connections at the transfer station: Connections 1 and 2. In this case, the EETCs of nontransferring passengers are identical for the two connections. Considering this, if the total EETC is a simple sum of local EETCs, although the EETCs are identical and therefore need to only be considered once as a single EETC, they will receive duplicate consideration in computation. This concept is also applicable in the cases of nontransferring passengers on Trains B, C, and D. When three or more lines intersect, it could be coped with in the same way and the EETC of nontransferring passengers only needs to be counted once.

Therefore, to avoid duplicated consideration, we define a set of transfer connection points of origin , , to calculate the EETC of transferring and nontransferring passengers separately. In this consideration, the objective function (19) can then be expressed by formula (22).

Although the function has been linearized, because remains to be a nonlinear function, the optimized model is still nonlinear. Considering this, several binary variables and auxiliary substitution variables are introduced to fully linearize the objective function (22), which can facilitate the optimized model solving.

###### 5.3.1. Approximation Function

The constraint presented as (23) activates the binary variable . If , is equal to 0; otherwise it is equal to 1. Then, the approximation function for nontransferring passengers can be expressed via a piecewise function, as presented in (24), where* M* is a large positive integer.

Because (23) is a nonlinear function with a cross product of decision variables, we introduce auxiliary substitution variables to construct (25) and (26) for subsequent substitution into (23). In addition, the approximation function can be substituted with .

###### 5.3.2. Approximation Function

Although is a function with two variables, can be taken as a new and alternative variable because they each yield identical effects. Similarly, we can formulate (27)–(29) by introducing binary variables and auxiliary substitution variables , . The approximation function can then be substituted with .

###### 5.3.3. Approximation of Objective Function

Subsequently, the objective function can be expressed as (30), which then enables conversion of the proposed model to a mixed-integer linear programming model that can be solved via any linear programming software, such as Cplex or Gurobi. The variables of the optimized model are separated into three different types: timetable variables , , , , and ; binary variables and ; and auxiliary substitution variables , , , and .

#### 6. Experimental Simulation

##### 6.1. Description

In this section, a test network comprising four bidirectional subway lines (8 unidirectional subway lines) and 5 transfer stations is created and used to validate the performance of our proposed model and the improved linearization approach (Figure 8). In addition, the primary train delays are assumed to yield a negative exponential distribution, and the mean delay times are approximately one-fifth of the standard running times. The mean headways for Lines 1, 2, 7, and 8 are 15 min., while those for Lines 3, 4, 5, and 6 are 8 min.

##### 6.2. Comparison Schemes

In order to demonstrate the advantages of the proposed method, of which the resulting optimized timetable is designated as RobT, we design two other types of buffer schemes for the purpose of comparison: the proportional buffer scheme (k%-ProT) and the optimal buffer scheme that consider only the effects of the running time supplement (RobT-S).(1)k%-ProT: for actual operation, the commonly implemented method to improve timetable robustness and absorb delay is to define the running time supplement as a certain proportion (e.g., 4%–10%) of the standard running time. For the k%-ProT scheme, the running time supplement is* k* percent of its standard running time. This is a simple method.(2)RobT-S: this RobT-S scheme is based on the commonly applied concept of adjusting only the running time supplement and is subsequently solved by minimizing the EETC, which is determined by assuming the transfer buffer time is zero in the objective function.

##### 6.3. Optimized Results

The optimal scheme and comparison schemes are all generated via C#, and computational results are presented by using the integer linear solver Cplex 12.5 on a desktop computer (Intel Pentium CPU G3240 3.1 GHz, 4 GB RAM). The EETC and App_EETC are the actual and approximate EETC of all passengers within the network, respectively. Besides, 1000 simulation scenarios are randomly generated based on the delay probability distributions. The proportion of missed transfer passengers (PMTP) for each buffer scheme is computed via discrete-event simulation and can be used as a measure to evaluate the timetable. The optimized results are shown in Table 2.

Among these buffer schemes, the RobT scheme is the most efficient, as it yields the minimum EETC and a small and acceptable PMTP. By comparison, the RobT-S scheme exhibits inferior performance with respect to both EETC and PMTP; this indicates that the efficiency and robustness of the timetable can be improved by appropriately allocating the running time supplement and transfer buffer time. On the other hand, the RobT scheme yields a fine approximate error (3.601-3.432)÷3.432=4.92%, while the RobT-S scheme does badly in approximation with an approximate error (5.364-4.458)÷4.458=20.32%. This noticeable error stems from two reasons: ignorance of the effect of transfer buffer time that generates fundamental deviation from actual EETC and further limits in approximation method that produces approximate error. On the whole, even though the approximate error of RobT scheme is not small enough, it is still enough to provide superior description of the trends of optimization objective.

In addition, the robustness of the k%-ProT schemes increases with increased proportion k%. When k = 0%, the PMTP is substantially high and the EETC is also relatively high. However, although the PMTP is reduced to the smallest value for k = 20%, the EETC still remains high; this is indicative of the fact that a tiny and an excess of buffer time are both uneconomical.

In summary, because the k%-ProT schemes disregard passenger characteristics and the effects of buffer time, it is unable to appropriately balance efficiency and robustness. Moreover, the RobT-S scheme does not consider the double-buffer effect, thereby yielding a less practical EETC. Furthermore, our proposed model can significantly improve the robustness of the timetable by concurrently adjusting the running time supplement and transfer buffer time, while yielding only a small PMTP.

###### 6.3.1. EETC of Nontransferring and Transferring Passengers

Figure 9 shows the EETC histogram for transferring and nontransferring passengers. As can be seen, for k%-ProT schemes, with respect to increasing proportion* k*, the EETC of nontransferring passengers exhibits a trend of initial decrease and subsequent increase. This trend occurs because a moderate running time supplement can improve the timetable flexibility that allows more freedom at the operational level to arrive or depart flexibly and compensate for typical delays. As there is a higher VOT under the delay circumstances, the timetable flexibility can bring about a reduction in the EETC of nontransferring passengers. However, when the proportional value of the standard running time exceeds a certain value, it will become excessively and needless redundant and reduce the efficiency of the timetable, thereby leading to an increase in the EETC of nontransferring passengers.

In contrast to that of nontransferring passengers, the EETC of transferring passengers continuously decreases with increased proportion* k*. This is because the timetable for the k%-ProT scheme is a synchronization timetable aiming to minimize the total passenger transfer waiting time and the possibilities of delay are not considered. Hence, the transfer buffer times tend to be as small as possible for the sake of efficiency (for example, zero if there are no conflicts). Furthermore, because of the long headway on the connecting line, the running time supplement can be used as a buffer to avoid missing transfers, as well as to reduce the EETC of transferring passengers.

In addition, as compared to the RobT-S scheme, although the increased EETC of transferring passengers with respect to the RobT scheme is not significant, the increase in EETC of nontransferring passengers is relatively large. Thus, from the perspective of passengers within a network, because disregarding the double-buffer effect is demonstrated to increase the EETC of nontransferring passengers, the RobT-S scheme is shown to be an inefficient scheme. As a result, it is of great significance to consider the double-buffer effect by concurrently adjusting the running time supplement and transfer buffer time to generate a robust, yet efficient, timetable.

###### 6.3.2. Missed-Transfer Passengers

Figure 10 illustrates the distribution of the missed-transfer passengers for three different buffer schemes under 1000 random simulation scenarios. In general, a small mean and fluctuation of the missed-transfer passengers are indicative of a fine recoverable robustness such that the scheduled timetables are easier to recover by taking a set of recoverable actions. The concept of recoverable robustness combines the optimization of the scheduled timetable and the strategy of recovery. As can be seen, when there is no running time supplement, the 0%-ProT scheme exhibits poor performance with respect to both the mean and fluctuation of the missed-transfer passengers. Conversely, the RobT scheme results in a small number of passengers missing their transfer, thereby also implying small fluctuations in this amount, which is comparable with the 20%-ProT scheme. This implies that the timetable generated via the proposed method is more robust and stable in the event of a delay.

#### 7. Conclusions

To date, the primary objective when solving train timetabling optimization problems has been to improve the robustness by only adjusting the running time supplement. However, in this paper, we not only consider both the running time supplement and the transfer buffer time and concurrently adjust them to increase efficiency. The main contributions of this paper can be summarized as follows:

(1) This paper demonstrates analysis of the effects of the two types of buffer times on train delays and formulates the nonlinear function for expected extra travel cost (EETC) that includes the running time supplement and transfer buffer time.

(2) To coordinate all the connections within a network as a whole, a robust transfer optimization model is constructed by minimizing the total EETC. The resulting objective function that is a compromise between travel time extension and delay resistance risk can be used to appropriately balance efficiency and robustness in practice.

(3) To simplify and enhance the efficiency of the model, we propose an improved approximate linearization approach that is based on characteristics of the EETC; we then introduce several binary variables and auxiliary substitution variables to fully linearize the EETC, enabling the proposed model to be solved via linear programming software.

Furthermore, experimental simulations are conducted to demonstrate that our proposed method can generate robust yet efficient solutions. It is shown that the proportional buffer schemes do not consider the characteristics associated with each type of passenger and that the buffer scheme only considers the effects of running time supplement; these oversights each yield impractical expectations of overall extra travel cost. Because the performances of both are shown to be inferior to our optimized scheme, we have proven that it is of vital importance to optimize timetable robustness by considering both the running time supplement and transfer buffer time.

Some extension is suggested for future research. First, the approximate linearization approach can be improved by introducing a multisegment, more than two segments, piecewise linear function to obtain a higher goodness of fit. However, the computational times may be affected. Second, although our proposed approach is based on fixed transfer pairs, the main idea can be implemented to integrate connection precedence coordination and transfer robustness optimization. Further research might also address applicable and efficient ways to generate the integrated solutions.

#### Data Availability

The data used to support the findings of this study are included within the article.

#### Conflicts of Interest

The authors declare that they have no conflicts of interest.

#### Acknowledgments

This work was supported by the Fundamental Research Funds for the Central Universities (grant number 2018YJS074), the National Natural Science Foundation of China (grant number 51478036), and the Talent Fund Project of Beijing Jiaotong University (grant number 2017RC036).