Abstract

The importance of travel time reliability in traffic management, control, and network design has received a lot of attention in the past decade. In this paper, a network travel time distribution model based on the Johnson curve system is proposed. The model is applied to field travel time data collected by Automated Number Plate Recognition (ANPR) cameras. We further investigate the network-level travel time reliability by connecting the network reliability measures such as the weighted standard deviation of travel time rate and the weighted skewness of travel time rate distributions with network traffic characteristics (e.g., the network density). The weighting is done with respect to the number of signalized intersections on a trip. A clear linear relation between the weighted average travel time rate and the weighted standard deviation of travel time rate can be observed for different time periods with time-varying demand. Furthermore, both the weighted average travel time rate and the weighted standard deviation of travel time rate increase monotonically with network density. The empirical findings of the relation between network travel time reliability and network traffic characteristics can be possibly applied to assess traffic management and control measures to improve network travel time reliability.

1. Introduction

Urban travel times are well recognized to be variable due to time-varying demand, stochastic capacity, different control measures (e.g., signal control at intersections), and external factors (e.g., weather conditions). The most widely used travel time uncertainty modelling philosophy is based on the concept of probability. With the probability distributions, the reliability analysis of link/route/network travel times can be conducted. In the literature, travel time distribution estimation methods in the urban context can be categorized into three groups: analytical approach, statistical (or numerical) approximation, and simulation.

For the analytical approach, Zheng et al. [1, 2] proposed an analytical link and trip travel time distribution model for urban signalized arterials. The model explicitly accounts for the influence of stochastic traffic processes, variation of traffic demand, queue dynamics, and different traffic signal control schemes (including cycle time, green time, and offset) on travel time variability. The “spillback” phenomenon (queued traffic from one intersection affecting operations at adjacent intersections) is also taken into account. The advantage of the analytical model lies in its transferability and applicability in different traffic conditions. However, the main drawback is the complexity of the modelling process for network-wide travel time distribution estimation.

For the statistical (or numerical) methods, normal, lognormal, truncated normal/lognormal, Gamma, and Weibull distributions [35] are widely applied to describe travel time reliability (variability). Due to the complexity of urban traffic conditions, a single distribution model could not well represent the travel time distribution of an urban road. A mixture of statistical distribution models [6, 7] has been proposed to capture multistate traffic conditions and the influence of signal control at intersections for urban roads. Clark and Watling [8] employed the method of moments to calculate various travel time reliability measures (expected travel time, variance, skewness, and kurtosis). The values of these four measures were then fitted to a family of probability densities called “Johnson” curves. In order to capture both the vehicle-to-vehicle and day-to-day variability in travel time data, Kim and Mahmassani [9] proposed a Gamma-Gamma mixture distribution model. Based on their proposed model, the heterogeneity within these two variability types under different weather conditions can be described as well.

For the simulation method, Kim et al. [10] employed traffic simulation models to derive travel time distributions under different scenarios considering various demand and supply uncertainty factors, such as weather, traffic incidents, work zones, and traffic control. Chen and Zhou [11] proposed a simulation approach to derive network travel times by solving a traffic equilibrium problem considering demand uncertainty. The network travel time samples were further used to estimate the network travel time distribution and its statistics. The main advantage of the simulation approach lies in its flexibility. However, intensive computation is required to conduct the simulation.

Although the concept of travel time reliability is relatively new in the field of transportation compared to some other engineering fields, various travel time reliability measures have been proposed by researchers, for example, standard deviation, statistical range [12], buffer index [13], 90th or 95th percentile [14], skewness-width [15], travel time budget or total travel time budget [16], and mean-excess total travel time [17]. The choice of the most suitable reliability model depends on the context and purpose for which reliability is evaluated and on the characteristic of the travel time distribution [15]. Skewness is sometimes a relevant characteristic because the more extreme long travel times may deter travelers to choose a certain route and the economic value of extreme long travel times is important [18]. In cases for which skewness in the travel time probability distribution is not an issue, the variance of the travel time is a logical reliability measure. Total travel time budget (TTTB) and mean-excess total travel time (METTT) are two risk measures proposed for assessing transportation network performance under uncertainty [11, 17]. TTTB is defined as the minimum travel time threshold that satisfies a certain reliability requirement given by decision-makers at certain confidence level. Xu et al. [17] define the METT as the conditional expectation of travel times exceeding the corresponding TTTB at a given confidence level. The numerical example shows that applying METTT in the network design problem under uncertainty lowers the construction cost and reduces the unacceptable risk in the distribution tail substantially while incurring a minor TTTB increase.

Up to now, most research focuses on modelling or measuring link/segment level or route level travel time reliability. Research on network wide travel time reliability is rather limited and mainly based on simulation. Mahmassani et al. [19] used both simulated data and field GPS trajectory data to model travel time reliability. Their investigation with field data is rather limited. Moreover, GPS data is only a sample of the total traffic on the network and no flow data is available in their study area. The question is to what extent the derived relationship between the travel time variability and the network flow or density deduced from GPS sample data can represent the real situation.

The network travel time reliability has been considered as an important aspect in network management, control, and design [20]. In [1, 2], the authors have developed an analytical travel time distribution model for link and route levels. When it comes to the network level, the previously developed “pure analytical model” is unable to describe the travel time variability. Therefore, in this paper, a network travel time distribution model based on flexible probability densities known as Johnson curves is introduced in Section 2. Different from previous research on travel time reliability modelling with Johnson curves estimated based on the method of moments, we apply a percentile-point method to estimate Johnson curves. The capabilities of Johnson curves are demonstrated with field travel time data. In Section 3, we discuss the calculation of travel time reliability measures for urban networks and their connection with network traffic characteristics (e.g., network traffic density and flow). An illustration of travel time reliability based on empirical data for a real urban network of a city in China using ANPR data is provided as well. Finally, some discussion and conclusions are provided in the last section.

2. Network Travel Time Distribution

2.1. Vehicle-to-Vehicle Travel Time Variability from Empirical Observations

Travel times that vehicles experience in the urban network can be influenced by many factors, such as fluctuations in traffic demand and supply, signal control at intersections, turning vehicles from cross streets, bus maneuvers at bus stops, parking vehicles along the roadside, crossing pedestrians and cyclists, and, of course, intermediate stops of the vehicles themselves, for example, for certain activities of the driver along the trip. As a result, urban travel times are rather variable given known traffic condition (traffic flow). Figure 1(a) illustrates the empirical travel time-flow relationship derived from local 5 min aggregated measurements of time-mean flow and median travel time for each 5 min on an urban arterial road “Renmin Road” in April 2015 in Changsha City, China. The region “A” illustrates the low demand condition and region “B” shows the high demand condition. It can be clearly seen in Figure 1(a) that there is no one-one correspondence relationship between travel time and flow over the whole range of traffic flow. Even in the condition of low traffic volume (region “A”), for a certain traffic flow, there is a big range of travel time corresponding to it. In the high demand condition, a large variation of travel times can be observed. Figure 1(b) illustrates travel times experienced by vehicles departing within a short time period (e.g., 1 min). The difference between the maximum travel time and minimum travel time can be as large as 221 seconds even within a small departure time period of 1 min.

2.2. Modelling Network Travel Time Distribution Using Johnson Curves

Due to a lot of uncertainties in urban transportation systems, travel times on both the link level and the route level are rather variable, which aggregately results in the variability of travel times for the network level. As shown in some empirical studies, the travel time distribution is unlikely to have a normal distribution but is heavily skewed with a long fat tail [15, 21, 22]. In this case, flexible probability densities known as Johnson curves [23] can be applied to model travel time data. The Johnson curves have been widely applied in the field of agriculture and environmental engineering and proved to be a powerful tool to model nonnormal and multimodality data [24, 25]. One important property of Johnson curves is that parameters in the model can be estimated analytically or explicitly, which facilitate the generalization to different situations and data sets. In the following, the Johnson curves approach is discussed.

Let denote the variable of intersection-normalized travel time which is defined as the travel time trip divided by the number of intersections on the trip. Jones et al. [26] and Mahmassani et al. [19] suggested that the use of distance-normalized travel time can help exclude the source of variability coming from trip distance and only captures variability due to speed variation. This is the case with trips without intersections, when the variance of travel time depends only on the variation in speed and congestion at bottlenecks. However, on urban arterials, a major part of travel time variance comes from signalized intersections. Therefore, we propose to normalize travel time on the number of intersections for a certain trip and focus on investigating the intersection-normalized travel time distribution for urban networks. This normalized travel time will be denoted by “travel time rate.” In the Johnson system, three types of probability density functions are used:(1)The unbounded system (or ): (2)The bounded system (or ): (3)The lognormal system (or ): where is a standard normal variable and , , , and are parameters of the Johnson curves. In the curves, the domain is the entire real line. By contrast, the domain of curves is defined over the open interval between ε and . curves can be further simplified by eliminating the parameter as . In the literature, Johnson curves were usually estimated by the method of moments [8, 27, 28], which requires calculating four moments (mean, standard deviation, skewness, and kurtosis) of a distribution and sometimes it is not straightforward to derive these moments. Slifker and Shapiro [24] proposed a method to estimate parameters of Johnson curves based on percentiles. Their method has shown to be an easier and more reliable procedure compared with methods based on moments [25]. The procedure to estimate parameters is as follows.

Step 1. Estimate the percentiles in the travel time data corresponding to the cumulative probabilities , , , and associated with four numbers , , , and The value of should be chosen such that the percentiles can be reliably estimated. Slifker and Shapiro [24] suggested a choice of , which provides reliable estimates of four percentiles and works well for a variety of data sets.

Step 2. Estimate the percentile travel times at , , , and based on the probabilities , , , and and calculate the parameters , , and as

Slifker and Shapiro [24] showed that the expression can be used to determine the appropriate Johnson curve. The Johnson corresponds to , the Johnson corresponds to , and Johnson corresponds to . It should be noted that when using the value of to select among Johnson curve types, and predominate since only in rare cases will be exactly equal to 1.

Step 3. Estimate the Johnson curve parameters , , , and . For the detailed calculation of these parameters, readers can refer to literature [24, 25].

Based on the above estimated parameters, travel time rate distributions using the Johnson curves can be derived straightforwardly. It is worthwhile noting that the circumstances in which exactly equals 1 are rather rare. Therefore, and are more frequently chosen when the value of is used to determine the type of Johnson curves. However, could be less than or larger than 1 when samples are extracted from an distribution in some data sets [29]. In this case, examination of the third (skewness) and fourth (kurtosis) moments of data in relation to the moment plane could help determine the proper type of Johnson curves when is close to 1. For instance, as suggested by Mage [29], if the (third and fourth) moments lie within the region and is larger than 1, the distribution is an appropriate choice.

2.3. Illustration with Field ANPR Data
2.3.1. Study Area and Data Description

In order to investigate how the Johnson curve system performs in terms of describing network travel time distribution (variability), we selected a study network located in the CBD of Changsha City, China. Figure 2 shows the selected network with 73 signalized intersections and 21 intersections with ANPR cameras. The data were collected for three days from 20 April 2015 to 22 April 2015. The time period of analysis considered in this study is between 7:00 a.m. and 7:00 p.m. Each day was divided into 30 min time intervals. OD travel times between intersections with ANPR can be estimated from camera records. Outliers such as double counting and mismatching were eliminated from the data set. Apart from these outliers, by comparing travel times between vehicles consecutively passing the intersection, abnormal link travel times between two adjacent cameras (e.g., parking along the road for a coffee) were identified and eliminated as well. The process can be described as follows.

Vehicles sequentially pass intersections . Let us denote as the time instant when vehicle passes the upstream intersection as the time instant when the immediate preceding vehicle passes the upstream intersection as the time instant when the immediate following vehicle passes the upstream intersection as the time instant when vehicle passes the adjacent downstream intersection as the time instant when vehicle passes the adjacent downstream intersection , and as the time instant when vehicle passes the adjacent downstream intersection ; then the link travel time between intersections and for vehicles , and can be calculated as

The link travel time differences between vehicle and and vehicle and are calculated as

If and , vehicle is considered as “abnormal” vehicle whose travel time is much longer compared with “normal” preceding vehicle and following vehicle . Then the link travel time of vehicle is eliminated, where is the predefined threshold (the average cycle time of the intersections is about 3 min; therefore, “3 min” is chosen in our case).

The vehicle travel time rates were then calculated with individual travel times divided by their travel distance.

2.3.2. Results

We calculated the value of parameters , , , and in the Johnson curve system according to the steps described in the previous subsection. The value of is smaller than 1 for all cases, which indicates that the Johnson curves are appropriate to describe travel time rate distributions for our selected network.

Figures 3(a)3(c) illustrate the comparison between the network travel time rate distributions estimated from ANPR data and the Johnson fitted and lognormal fitted curves during peak time period of 7:00–7:30 a.m. for three days. The Johnson curves show a better fit to the ANPR travel time data compared with the lognormal fit. Similar results can be identified in Figures 3(d)3(f), where ANPR travel time rate distributions are presented along with Johnson and lognormal fitted probability curves during off-peak period of 10:00–10:30 a.m. We further applied the one-sample Kolmogorov–Smirnov (KS) test with four time periods each day (in total, 12 time periods). Table 1 presents the K-S test results. From Table 1, it can be concluded that the null hypothesis that the travel time rates follow the Johnson curve (SB) distribution cannot be rejected for most of the time periods (except time periods 5:00 p.m.–5:30 p.m. on day 2015-04-21) at the significance level of 5% since the value is larger than 0.05. The better fitting performance of the Johnson curve model is due to the fact that the Johnson system allows describing a wide spectrum of traffic conditions as usually existing in the urban traffic system. The Johnson model can flexibly adapt its shape to the observed travel time data. We calculated the Akaike information criterion (AIC) for both the Johnson curve model and the lognormal model as well. AIC is a measure of the relative quality of statistical models for a given set of data. It is defined aswhere is the number of parameters in the model; is the maximized value of the likelihood function for the model. From (4), it can be seen that AIC rewards goodness of fit (as assessed by the likelihood function), but it also includes a penalty that is an increasing function of the number of estimated parameters.

Given a set of candidate models for the data, the preferred model is the one with the minimum AIC value. For more detailed information about AIC, readers can refer to Akaike’s work [30]. Table 2 shows the AIC values for both models in different time periods. The lower AIC values of the Johnson curve model further confirm that the Johnson curve model outperforms the lognormal distribution model.

Besides, as can be seen from Figure 3, network travel time distributions are wide and skewed with a long fat right tail during different time periods, which suggest that the network travel time rate is rather variable (unreliable). In the following section, we will look at network travel time reliability measures and how these measures vary under different traffic conditions.

3. Network Travel Time Reliability Metrics and Their Connection with Network Traffic Characteristics

3.1. Network Travel Time Reliability Measures

Based on travel time distributions as proposed in the previous section, any travel time reliability metrics (e.g., standard deviation, percentiles, skewness, and buffer index) can be derived. In the past decades, travel time reliability metrics are commonly used to quantify link or route travel time reliability, whereas very little research has been dedicated to investigate the network travel time reliability. In this section, we investigate the network travel time reliability (in terms of intersection-weighted standard deviation and intersection-weighted skewness of travel time rate) and its connection with network traffic characteristics under different traffic conditions.

A method that is frequently used in literature to characterize traffic states in a network is the macroscopic fundamental diagram (MFD) [3133]. In the MFD, the traffic state is characterized by production (sum of distance-weighted traffic flow) and accumulation (sum of distance-weighted traffic density). Let us denote the length of a link by , the outflow of link by , the traffic density (vehicles per length of the road) of link by , and the total number of links in the network by . Then, the weighted average network density during time interval is defined as

Since we use ANPR data, the traffic density cannot be derived directly. We calculate the traffic density from the total vehicle travel time on link and the link length . We use the methodology proposed by Edie [34] applied on single links, based on trajectory analysis [35]. The average traffic density of link during time interval is estimated as the total time spent on the link during the time interval.

There is a methodological problem in the choice of the frame in space and time for the definition of the traffic density. The time the vehicle spent on the link for the first few vehicles passing the end of the link after is not entirely spent during the time period as shown in Figure 4. The situation is represented in a rather extreme way to show the problem. The first four vehicles registered by the camera have a travel time which has not been spent completely in the time frame . The next four vehicles are not registered in time frame although they spend a part of the time in the space-time frame . Although the detailed trajectories are not known, we can apply the Three-Dimensional Trajectory method (Saberi et al. [35]). To be accurate we have to correct the travel time of the first vehicles passing in , that is, the vehicles for which the travel time is longer than the time in at which they are registered. Similarly, we have to add at the end of the timeframe the times spend by the vehicles that entered but did not reach the end of the road section within the timeframe.

The travel time of vehicle on link is given by the difference in passing time at the stop line of the link and the link as . The part of the travel time that should be allocated to the time interval is calculated as

In an analogue way, the vehicles that are not counted at the end of the time frame can be detected in the next time frame and partly allocated to . The travel time that has to be added to the time spent in the period is calculated as

In practice, if the time intervals are chosen longer than the travel time on the link the boundary problem gives only a marginal influence of the calculation of the time spent. Furthermore, is chosen long enough to have little influence of the signal cycle time (which was not a common cycle time for all intersections). This avoids the time-space window problems as discussed by Courbon and Leclercq [36]where is the corrected travel time of vehicle on link passing the stop line at the end of link between and ; is the total number of vehicles leaving link during time interval united with the vehicles that entered the link in ; is the total number of links in the network; and are the start and end of the time interval.

The weighted (average) network flow during time interval is defined aswhere is determined as the number of vehicles passing the end of link during . Apart from the network flow and density, three reliability related quantities which are intersection-weighted mean travel time rate, intersection-weighted standard deviation of travel time rate, and intersection-weighted skewness of travel time rate distributions are used in this study.

As we proposed in the previous section, the travel time rate of trip can be calculated as the trip travel time divided by the number of intersections during this trip:

The intersection-weighted mean travel time rate during time interval is defined aswhere is the number of intersections during trip , is the travel time rate of trip , and is the total number of trips observed in the network during time period .

The intersection-weighted standard deviation of travel time rate during time interval is defined as

Skewness is a measure of the asymmetry of the probability distribution of a random variable about its mean. In terms of reliability, the more skewed the travel time distribution is, the less reliable travel times become. If the value of skewness is zero, the travel time is normally distributed. If the value of skewness is larger than zero, the travel time distribution is right-skewed with a longer tail to the right. Otherwise, the travel time distribution is left-skewed with a longer tail to the left. The weighted skewness of travel time rate for a network during time interval is defined aswhere is the mean travel time rate during time interval ; is the standard deviation of travel time rate during time interval .

3.2. Case Study with Field ANPR Data

The selected study area consisting of 54 links is shown in Figure 2 in the previous section. Again we use the three-day (between 7:00 a.m. and 7:00 p.m. from 20 April until 22 April 2015) ANPR data from Changsha City, China. The traffic flow can be derived directly since each ANPR camera records the plates number of all vehicles on lanes for straight-on and left turning movements passing the stop line. The traffic density however cannot be obtained directly from ANPR data. We apply (6) to estimate traffic density for each link. Figure 5 illustrates the macroscopic fundamental diagram for the selected CBD network based on three days’ observations with time intervals of 5 min (Figure 5(a)) and 15 min (Figure 5(b)), respectively. This macroscopic fundamental diagram reveals that the observed maximum weighted traffic volume in this network is around 400 veh/h/lane for the weighted traffic density around 30 veh/km/lane. No gridlock happened in this region from the observed data. That does not mean that the network operates below capacity, but it is an indication that the traffic police actively takes measures to reduce inflow and increase outflow on links at the moment that spillback threatens to occur. Of course, this form of active traffic management has an influence on travel times and travel time reliability.

Figure 6 illustrates the relation between the weighted average travel time rate and the weighted standard deviation of travel time rate for every 5 min. Figures 6(a)6(c) are intersection-weighted travel times and Figures 6(d)6(f) are distance-weighted travel times. A clear linear trend can be observed with both intersection-normalized and distance-normalized approaches, though the travel time data appear to be more scattered with a slightly lower correlation coefficient ( = 0.7167 and 0.5938) on 21 April. Higher correlations can be observed with intersection-weighted approach compared with the distance-weighted method for all three days’ travel times. The magnitude of the slope which reflects how much the weighted standard deviation will increase when the weighted average travel time rate increases is between 0.46 and 0.65 for intersection-weighted approach and between 0.35 and 0.62 for distance-weighted approach within these three days’ observations. The larger this value is, the more vulnerable to congestion induced reliability deterioration the network becomes.

Figure 7 shows the relation between the weighted average travel time rate and the network density, where the upper three figures (Figures 7(a)7(c)) are intersection-weighted travel time rates and the lower three ones (Figures 7(d)7(f)) are distance-weighted travel time rates. A relatively strong positive correlation exists between the weighted average travel time rate and the network density for all three days with > 0.55 for intersection-weighted method and > 0.48 for distance-weighted approach. The comparison between the intersection-weighted method and the distance-normalized method illustrates that closer correlations can be observed with the former than the later. Lower correlation coefficients can be observed for the weighted standard deviation versus the network density as shown in Figure 8. Nevertheless, the positive linear trend still exists, which indicates that the travel time variability increases when the network density increases. This is consistent with the argument in literature that the network travel time variability has a linear relation with the network density with simulated data [37].

Apart from investigating network travel time variability in terms of the standard deviation, the intersection-weighted skewness of ANPR travel time data is calculated as well according to (15). Figure 9 shows the relation between the weighted skewness and the network density. There is no strong correlation between the network density and skewness, though a weak trend, indicating that the skewness decreases with the increase of network density, can be observed. Nevertheless, we can observe that most of the values of skewness between 7:00 a.m. and 7:00 p.m. are larger than zero, which indicates that the travel time distribution is right-skewed with a longer right tail. This is consistent with travel time distributions derived in Section 2 with Johnson curves.

The relations between travel time reliability and network flow are illustrated in Figure 10. It is observed that the relationship between travel time reliability (e.g., the intersection-weighted standard deviation of travel time rate) and network flow rate is not monotonic anymore. The intersection-weighted standard deviation of travel time rate increases with network flow rate in low traffic demand condition until the maximum flow rate, beyond which the network flow rate drops while the standard deviation continues to increase. The results shown in Figure 10 are quite consistent with those derived from simulated data in [37].

4. Discussion and Conclusions

This paper proposes a network-level travel time distribution model by applying a flexible system of Johnson curves. The proposed model is able to describe a variety of travel time distributions with properly estimating model parameters based on the observed percentiles in the travel time data. We validate the performance of the proposed model using ANPR data collected in the CBD area of Changsha City, China. The results indicate that the travel time data can be better modelled with the proposed model than the widely used lognormal model in the transportation field. The reason behind is that the Johnson curve model is able to better capture the skewness in the data set with parameters describing the tail of the distribution (e.g., with the third and fourth moments or four percentile points), while in the lognormal distribution, only the mean and standard deviation of the logarithm are used to characterize the distribution of the data set.

We further investigated the network travel time reliability measures and their connection with network traffic characteristics in terms of network density and network flow, which are known as the macroscopic fundamental diagram. Three network travel time reliability measures which are the weighted average travel time rate, the weighted standard deviation of travel time rate, and the weighted skewness of travel time distributions are discussed, and the relation between these reliability measures and the network density is illustrated using the field ANPR data. The weighting is done with respect to both the number of intersections on a trip and the trip distance. Results show that the weighted average travel time rate has a positive linear relation with the weighted standard deviation of travel time rate for different time periods under time-varying demand. A positive linear trend can be clearly observed between the weighted standard deviation (or the weighted average travel time rate) and the network density. The correlations between reliability measures and network states are higher with the intersection-weighted approach than those using the distance-weighted approach. The investigation of the relation between the weighted skewness and the network density shows that there is a weak negative correlation between the former and the latter based on the available ANPR data. The value of skewness is larger than 0 for most observation periods between 7:00 a.m. and 7:00 p.m., which indicates that travel time distributions are right-skewed with a longer tail to the right.

The findings in this paper provide some insights into the network travel time reliability. The empirical relation between the network travel time reliability and network traffic characteristics (e.g., network density) can be possibly applied to assess the traffic management and control measures for reliability purposes; for example, by applying the relations between network flow and network density, as well as network reliability (network standard deviation) and network density, network control strategies can be optimized to improve network reliability while maintaining required network throughput.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work is supported by the National Science Foundation of China (NSFC) under Projects 51308475, 61673321, and 71671147, the Fundamental Research Funds for the Central Universities under Contract 2682016CY07, Chengdu Science and Technology Commission under Contract 2015-RK00-00190-ZF, and Changsha Science and Technology Commission under Projects K1001010-11 and K1106004-11.