Abstract

Identifying the influential factors in incident duration is important for traffic management agency to mitigate the impact of traffic incidents on freeway operation. Previous studies have proposed a variety of approaches to determine the significant factors for traffic incident clearance time. These methods commonly select a single “true” model among a majority of alternative models based on some model selection criteria. However, the conventional methods generally neglect the uncertainty related to the choice of models. This paper proposes a Bayesian Model Averaging (BMA) model to account for model uncertainty by averaging all plausible models using posterior probability as the weight. The BMA model is used to analyze the 2,584 freeway incident records obtained from I-5 corridor in Seattle, WA, USA. The results show that the BMA approach has the capability of interpreting the causal relationship between explanatory variables and clearance time. In addition, the BMA approach can provide better prediction performance than the Cox proportional hazards model and the accelerated failure time models. Overall, the findings in this study can be useful for traffic emergency management agency to apply an alternative methodology for predicting traffic incident clearance time when model uncertainty is considered.

1. Introduction

Improving the efficiency of the traffic incident management is a common measure to alleviate traffic congestion [13]. Although the definition and components of traffic incident duration vary in different studies, clearance time is regarded as the most uncontrollable component [4]. As a phase of incident duration, it depends on the unique factors of each individual incident [5]. Thus, understanding the effects of influential factors and accurate incident clearance time prediction are essential to evaluate traffic incident management strategy [6].

A variety of approaches have been utilized to predict incident clearance time and analyze the effects of influential factors during the past several decades. In general, these approaches can be classified into statistical methods [7] and machine learning methods [8, 9]. Statistical methods generally can explain the mechanism between independent and dependent variables based on the rigorous mathematical formula [10, 11]. From a methodological perspective, regression analysis and hazard-based methods are the two main statistical methods to analyze incident duration data. Regression methods were widely used for incident duration prediction in previous studies, such as linear regression [12, 13]. To overcome the simple linear assumption between incident clearance time and explanatory variables, researchers proposed the hazard-based duration models (HBDM) to predict the incident duration precisely and explore the influence of significant factors on the incident duration, such as the Cox Proportional Hazards (PH) model and the Accelerated Failure Time (AFT) model [14]. Nam and Mannering [5] applied different distributions in HBDMs to analyze incident duration based on 681 incidents from Washington State’s incident response team program. Lee and Fazio [15] used a proportional hazard-based Cox-regression model to analyze the effect of explanatory variables on response time and clearance time, respectively. Li and Shang [16] selected the best-fit model from 17 candidate different AFT models with different distributions according to the Bayesian Information Criterion (BIC) values, to investigate the effective factors of each incident and predict the time of each incident phase. Ghosh et al. [17] pointed out that the AFT model with a generalized F distribution outperformed other five parametric hazard-based duration models when analyzing the incident clearance time data collected in Michigan. Haule et al. [18] investigated the effects of significant factors on incident clearance time and selected the best-fit model from three AFT models with different distributions in terms of the Akaike information criterion (AIC).

Recently, large quantities of alternative methods have been introduced under the framework of the HBDMs in recent years. For example, a 2-component log-logistic finite mixture model was used by Zou et al. [19] to analyze the incident clearance time data obtained from the Washington Incident Tracking System (WITS), and the model could be better in accounting for the heterogeneities compared to the standard survival model. Different from conventional conditional-mean HBDMs, the quantile regression (QR) model can estimate the changing influence of explanatory variables on incident clearance time as each quantile of the incident duration distribution varied [20, 21]. And a copula-based approach was proposed to accommodate the correlation between incident clearance and other phases of incident duration, and the results showed that the copula models outperformed the log-normal AFT model in predicting the clearance time [22, 23].

Compared with the statistical methods, machine learning approaches can be employed to overcome complex nonlinear relationships between incident clearance time and explanatory variables without prior assumptions about input data [24, 25]. In the previous studies, K-Nearest Neighbor algorithm (KNN) and Bayesian networks were used to predict the incident clearance time [2628]. Besides, decision trees (DT) models can specify the relative importance of different explanatory variables associated with response variables and are widely used to analyze the traffic incident duration [29]. Nevertheless, the structure of the decision tree highly depends on the data, resulting in instability. To overcome the imbalanced traffic incident duration data problem of the single-tree-based method, Ma et al. [30] found that the gradient boosting decision tree model has a superior performance in model interpretation and prediction accuracy to conventional DT models [31]. Also based on traditional DT models, the extreme gradient boosting machine algorithm was applied to analyze and predict the clearance time data [32, 33]. However, the machine learning models are usually not capable of interpreting the mechanism between estimator and explanatory variables.

Overall, the hazard-based models and the tree-based models are two commonly used statistical and machine learning models in traffic accident duration analysis. However, these aforementioned approaches generally failed to take account of the model uncertainty (e.g., the AFT models with Gamma, Weibull, or log-logistic model) [34]. In other words, traditional methods typically assumed that the model to be estimated is the “true” model and then made tests among a majority of alternative “true” model according to some criteria like BIC [35]. Consequently, it is important to consider the uncertainty between candidate models, especially when these model are considered reasonable in spite of difference in predictions [36]. Otherwise, the resulting model estimates may be biased and lead to erroneous inference in the analysis of incident clearance time. The Bayesian Model Averaging (BMA) proposed by Draper [37] provided a statistical theoretical basis for solving the model uncertainty problem in econometric modeling. This approach combines and averages all plausible models considered (models with various combinations of influential variables) through setting different prior probability distributions [38] and has been widely used in various fields, such as water main failures prediction [39], firm default prediction [40], and chemical engineering [41].

The objective of this research is to apply the BMA method to account for both model and parameter uncertainty when modeling the incident clearance time. To examine the proposed approach, 2,584 freeway incident records obtained from I-5 corridor in Seattle are analyzed. Estimation and prediction results from the proposed BMA model and conventional HBDMs are then compared and analyzed.

2. Methodology

This section describes the general features of the BMA, two conventional HBDMs and Occam’s Window Method. The last method is one of the sampling techniques used in the model space.

2.1. Bayesian Model Averaging

BMA uses posterior probability as the weight to average all plausible models considered. Thus, let denote the set of all models and let denote the future observed value of the incident clearance time using new input data. Then, in accordance with the law of total probability, the probability density function (PDF) of under the observed dataset D iswhere is the mean of the posterior distribution of based on candidate model , which is the output of BMA approach. And is the probability of the correct prediction model , which is also referred to as the posterior model probability (PMP). And in model space , . The PMP is given by Bayes’ rule:whereis the marginal model likelihood for the given model and is the vector of parameters of the model . Additionally, and are the prior density of and the likelihood given the model , respectively. refers to the prior probability that is considered as the “true” model. The posterior mean and variance of are presented as follows:

Based on the previous work, the size of model space makes the summation of (1) impractical. In order to solve this problem, Occam’s window method is used to select an appropriate collection of candidate models from the model space and will be introduced in the following section.

2.2. Occam’s Window Method

Occam’s window approach was proposed by Madigan and Raftery [42] to reduce the number of models with low posterior probability in the model space . Two basic principles were conducted to eliminate models that predict far less well than their competitors.

First, if the PMP for the model in the model space is calculated far lower than the model that provides the highest PMP, model is discarded from the model space . Those models failing to satisfying the formulashould be excluded from equation (1). The refers to the model with the highest PMP, and the value of C equals 20 since it is determined by the data analyst.

Second, if the PMPs of complex models are lower than their simpler submodels using Occam’s razor method, these models should be discarded from model space belonging to

Therefore, formula (1) can be expressed aswhere .

Additionally, the leaps and bounds algorithm is applied to implement the above principles as the search strategy. Interested readers can see the companion paper for more details about this algorithm [43].

2.3. Cox Proportional Hazards (PH) Model

The Cox proportional hazard (PH) model is one of the most commonly used semiparametric survival analysis models. This model is used for investigating the relationship between survival time of respondents and predictor variables [44] and is given as follows:where is the survival time, represents a vector of explanatory variables, are the coefficients to be estimated, which measure the impact of the p-covariates, is the hazard function, is an unknown baseline hazard, that is, the value of the hazard if all the are equal to zero, and equation (8) can also be written aswhere is defined as hazard ratios (HR), in which any two individuals are constant over time according to the assumption of the Cox PH model.

2.4. Accelerated Failure Time (AFT) Model

Based on the hazard function, the AFT model assumes that the log of the survival times is affected linearly by the covariates of and can be clearly written aswhere are the coefficients to be estimated, and is an error term. There are kinds of different parametric distributions (e.g., Weibull, log-normal, and log-logistic models), which can be used for the AFT model.

For comparison purposes, the incident data were also analyzed using the Cox PH model and the log-logistic AFT model. In the previous study, it is suggested that the log-logistic AFT model outperformed the other two AFT models (with Weibull or log-normal distribution) in terms of the goodness-of-fit statistics and predictive performance. Therefore, the two conventional HBDMs were finally selected as the benchmark models in this study.

2.5. Data Description

In this study, the incident data obtained from the I-5 corridor between Boeing Access Road and the Seattle Central Business District were retrieved from the WITS, which is used to manage the incident log data. The reason for choosing this site is the heavy traffic demand and frequent incident-induced traffic congestion events. Additionally, in previous studies, Tang et al. [32] used the data source to analyze the influence of explanatory variables and examine the prediction performance of the proposed model. And Hou et al. [45] analyzed the time-varying effects of significant variables based on this dataset. A total of 2,584 valid incidents from 1 January to 31 December 2009 were selected from the WITS dataset including 15 categorical candidate explanatory variables. Additionally, in this dataset, the mean and median of the incident clearance time were 13.58 minutes and 9 minutes. The minimum and maximum values were 1 minutes and 382 minutes, respectively. And the standard deviation was 17.35 minutes. The key information of candidate variables is presented in Table 1.

3. Results and Discussion

This section describes the modeling results for the BMA approach, Cox PH model, and the log-logistic AFT model. In terms of the model averaging strategies, it is assumed that all possible combinations of 15 candidate explanatory variables are equally likely a priori. As previously mentioned, Occam’s window method is implemented to exclude complex models that perform far less well than their simpler competitors and discard them from the model space. The results show that the BMA approach and two benchmark models differ in interpreting the explanatory variables based on the same dataset, and the former can provide better prediction performance. All statistical analyses were conducted using statistical software R.

Table 2 lists the posterior means, standard deviations, and posterior effect probabilities for the coefficients related to different explanatory variables by using the proposed method. The posterior effect probability equals the sum of the posterior probabilities for all the selected models containing that explanatory variable. For example, the posterior effect probabilities associated with the explanatory variable single lane blocked have 7.8% of its mass at zero. Unlike other statistical models, it can be observed that the variable “single lane blocked” is not included in all the selected models, although it has a significant impact on incident clearance time. 20 different models with the highest PMP were finally selected by Occam’s window approach, among which five optimal models (PMP > 0.05) shown in Table 2 account for 59.69% of the total posterior probability. Additionally, the estimated coefficients for some variables (i.e., advised WSP, disabled vehicle, debris, heavy truck, weekends, winter, weather, and incident on HOV lane) are not listed in Table 2 because the values are larger than the significance level (0.05). It is found that model 1 with the largest PMP can account for 22.8% of the total posterior probability, which means that there exists quite a bit model uncertainty.

As shown, the model averaging results of the predictors for incident clearance time generate interesting interpretations. The posterior effect probabilities of seven explanatory variables equal 100%. This result demonstrates that, for response time, traffic control, collision, multiple lane blocked, total closure, injury involved, and summer, all the selected models with the highest PMP contain these explanatory variables, and thus they are the main factors affecting the duration of traffic incident clearance. The positive coefficient in the results of the proposed model indicates that the hazard is higher, and the clearance time decreases with higher values of that variable.

It is clear that summer results in shorter clearance time, which means that the incident clearance time varies with the month of year and gets shorter in summer than other seasons, and for the remaining variables, they all result in longer clearance time.

As the BMA results in Table 2 indicate, we can see that response time shows a weak effect on incident clearance time, the coefficient of which implies that more time is required to prepare for incident response in congested traffic periods. The estimated coefficient −0.527 of traffic control on clearance time deserves attention to improve the measures undertaken by traffic incident management like incident response team (IRT) on directing upstream traffic around the incident.

Among the considered incident types, collision is the only one significant variable affecting the clearance time. And the involvement of fire and injury shows positive effect on incident clearance time. These results demonstrate that both fire and collision involvement can result in longer clearance time or smaller hazard rates and are generally the main factors causing traffic congestion.

Four types of lane closure except all travel lanes blocked all have a significant impact on incident clearance time. This is caused by the increasing operational complexity of the recovery process around the incident. On closer inspection, of all significant variables, total closure has the strongest impact on incident clearance time, which indicates that longer clearance time is generally associated with total closed lanes. To further explain the BMA model, the estimated survival probabilities are plotted in Figure 1.

Meanwhile, Tables 3 and 4 list parameter estimation results of the Cox PH model and the log-logistic AFT model for comparison purposes, respectively. The estimated coefficients for some variables are not listed in Tables 3 and 4 because the values are larger than the significance level (0.05). It is noted that, in the Cox model, the sign of the regression coefficient (coef) is opposite to that of the AFT model. The positive coefficient in the results of the Cox PH model indicates that the hazard is higher, and thus the clearance time decreases with higher values of that variable. Note that, different from the results of the BMA, all types of incident are significant in terms of values in the Cox PH model. Moreover, three lane closure types (single lane blocked, multiple lane blocked, and total closure), response time, traffic control, injury involved, fire involved, and work zone involved all result in longer clearance time or smaller hazard rates. Debris, abandoned vehicle, night, and summer all result in shorter incident clearance time or larger hazard rates. These findings are consistence with the previous study using quantile regression.

In the log-logistic AFT model, night is found statistically significant compared with the BMA model. And disabled vehicle, debris, and abandoned vehicle all result in longer clearance time in both survival models, while these variables are not statistically significant in the BMA model. The reason is that when model uncertainty is considered, their posterior effect probabilities from BMA are significantly lower than other variables. To further explain the Cox PH model, the log-logistic AFT model, the estimated survival probabilities are plotted in Figures 2 and 3.

The estimated coefficients from the Cox PH, the log-logistic AFT, and BMA models can give macroscopic and valuable insights for incident duration time. For all travel lanes blocked, the estimated coefficient −0.796 of the Cox PH model and the coefficient 0.615 of the log-logistic AFT model indicate a strong impact on clearance time, while the posterior effect probability in the BMA model equals 8.2%, which means that this variable is actually a less significant determinant of incident clearance time. Furthermore, the contributing factors resulting in longer clearance time can be identified, and thus traffic emergency management agency can improve their management planning and response strategies. As shown in Tables 24, the variables “total closure” and “fire involved” have a stronger positive impact on incident clearance time. Thus, when closing all lanes and fire involved traffic incidents are considered, more attention should be given to prevent further severe incidents.

More importantly, the posterior effect probability in the BMA model can overcome the overstatement of the evidence for an effect. The values for the coefficients indicate whether these relationships are statistically significant. In general, the values for the predictors for the single model are used as a measure of interpreting the effect on incident clearance time. However, values generally overstate the evidence for an effect [46].

Turning to the model results, Table 5 shows the posterior effect probabilities and values of several variables. For a few variables (i.e., response time, traffic control, collision, multiple lane blocked, total closure, injury involved, and summer), the values from Cox PH model and the posterior effect probability from the BMA model all indicate that there is a highly significant effect ( and ) on the clearance time. However, the BMA model and Cox PH model show the different conclusions for the variables in Table 5. The values from the Cox PH model overstate the evidence for the effects. For night, debris, and work zone involved, they are considered highly significant in terms of values (), while the posterior effect probabilities from the BMA model suggest that the evidence for an effect is not strong. Additionally, disabled vehicle, abandoned vehicle, and all travel lanes blocked are less significant determinant of incident clearance time for values (), but the posterior effect probabilities provide weak evidence for the effect on incident clearance time. And for the last variable incident on HOV lane, both approaches show extremely low certainty evidence for an effect. Thus, the Cox PH model overstates the evidence for the effect on explanatory variables compared with the posterior effect probabilities () for the BMA model according to the results in Table 5.

Furthermore, when rejecting the null hypothesis of “no effect,” values used by the Cox PH model cannot distinguish between two scenarios, which are as follows: (a) there are few data to detect the effect on explanatory variables, and (b) the data provide evidence for the null hypothesis. However, the posterior effect probability in BMA can overcome the overstatement of the evidence for an effect resulting from the above issues. For instance, all travel lanes blocked and incidents on HOV lane are found to be a less significant determinant of incident clearance time for values and posterior effect probabilities. And for all travel lanes blocked, equals 8.2%, which is exactly described in scenario (a), whereas the of incident on HOV lane indicates the evidence for the null hypothesis of “no effect.” The latter can be approximated as the posterior probability with the effect small enough, namely, , when is less than half of a standard error [47].

3.1. Comparison of the Prediction Performance

As mentioned before, BMA can improve the prediction performance than any other single candidate model due to the overlook of model uncertainty for single selected models. The incident duration data were also applied to analyze the importance of each explanatory variable contributing to response variables by using two traditional survival analysis models. It is noted that the predicted values of Cox PH model are relative to the sample observations. Thus, we applied the nonparametric step-function (NPSF) approach to predict expected durations [48]. To further demonstrate the prediction performance of BMA, the Mean Absolute Error (MAE), the Root Mean Square Error (RMSE), and the mean absolute percentage error (MAPE) were used to measure the discrepancy between BMA and Cox PH model on assessing the accuracy. These performance indexes were calculated as , and , where and are the observed value and the predicted value of clearance time for incident , respectively. Smaller MAE, RMSE, and MAPE values indicate a better prediction performance.

In this study, 2,584 incident records were randomly divided into two subsets, one of which was used as training subdataset for the application of BMA and benchmark models, and the other is used as testing subdataset to measure prediction performance. The number of sections used to build and test the model was 1500 and 1084, respectively. To make the results more credible, training and testing subdataset varied according to ten random seeds selected.

Table 6 reports the mean of the prediction performance indexes from the BMA and Cox PH model, as well as the log-logistic AFT model. Bold values are the smallest MAE, RMSE, and MAPE values among three models. As shown in Table 6, multiple random experiments in this study indicate that the proposed BMA model results in better prediction performance in contrast with two traditional survival models in terms of three performance indexes. Thus, it can be concluded that the BMA model can improve the prediction performance for the conventional survival model.

4. Conclusions

This study has applied Bayesian Model Averaging (BMA) for incident clearance time analysis and prediction. The BMA approach was compared with the Cox Proportional Hazards model and Accelerated Failure Time model (with the log-logistic distribution) in analyzing the 2,584 freeway incident records obtained from I-5 corridor in Seattle. The main conclusions are summarized as follows: (1) Response time, traffic control, collision, multiple lane blocked, total closure, injury involved, and summer are the main factors affecting the duration of traffic incident clearance. (2) The modeling results from three considered models show discrepancy in estimated coefficients, such as response time, traffic control, and night. (3) Compared to the Cox PH and the log-logistic AFT model, the posterior effect probabilities for BMA can overcome the overstatement of the evidence for the effect on explanatory variables in contrast with values for a single model. (4) The prediction performance of BMA is better than the two classical survival models when predicting the traffic incident clearance time.

For future work, BMA can be applied to explain the effect of explanatory variables on incident clearance time or other phases of incident duration. Markov chain Monte Carlo model composition should be conducted to make a comparison with Occam’s window method. Additionally, BMA can be extended to identify outliers in the clearance time data in terms of posterior model probabilities.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was sponsored jointly by the National Key Research and Development Program of China (grant no. 2018YFE0102800), the Shanghai Science and Technology Committee (grant no. 19210745700) and the Fundamental Research Funds for the Central Universities (grant no. 22120210009).