Advanced Models and Practice for Addressing Emerging CrossCutting Issues in Multimodal Transportation Research
View this Special IssueResearch Article  Open Access
Xu Miao, Bing Wu, Yajie Zou, Lingtao Wu, "Examining the Impact of Different Periodic Functions on ShortTerm Freeway Travel Time Prediction Approaches", Journal of Advanced Transportation, vol. 2020, Article ID 3463287, 15 pages, 2020. https://doi.org/10.1155/2020/3463287
Examining the Impact of Different Periodic Functions on ShortTerm Freeway Travel Time Prediction Approaches
Abstract
Freeway travel time prediction is a key technology of Intelligent Transportation Systems (ITS). Many scholars have found that periodic function plays a positive role in improving the prediction accuracy of travel time prediction models. However, very few studies have comprehensively evaluated the impacts of different periodic functions on statistical and machine learning models. In this paper, our primary objective is to evaluate the performance of the six commonly used multistep ahead travel time prediction models (three statistical models and three machine learning models). In addition, we compared the impacts of three periodic functions on multistep ahead travel time prediction for different temporal scales (5minute, 10minute, and 15minute). The results indicate that the periodic functions can improve the prediction performance of machine learning models for more than 60 minutes ahead prediction and improve the over 30 minutes ahead prediction accuracy for statistical models. Three periodic functions show a slight difference in improving the prediction accuracy of the six prediction models. For the same prediction step, the effect of the periodic function is more obvious at a higher level of aggregation.
1. Introduction
Travel time can effectively measure roadway traffic conditions [1]. Thus, accurate prediction of freeway travel time is important for traffic management agencies to provide better traffic guidance. However, it is challenging for researchers to predict travel time accurately due to the complex changes in traffic states [2]. A large number of algorithms have been proposed to improve the prediction accuracy of travel time. The existing shortterm traffic forecasting algorithms were reviewed by Vlahogianni et al. [3] and Vlahogianni et al. [1]. Existing shortterm traffic forecasting algorithms can be categorized into two major strands: statistical models and machine learning models. Linear regression analysis method [4, 5], time series method [6–8], and space time prediction methods [9, 10] are statistical models. Kalman filtering model [2, 11, 12], support vector regression model [13–15], and neural network model [16–20] are machine learning models. A series of combination models [21–25] are proposed in recent years.
Some researchers compared the performance of statistical models and machine learning models. For example, Stathopoulos et al. [26] found that fuzzy neural network outperformed Autoregressive Integrated Moving Average Model (ARIMA) in prediction performance. Vlahogianni [27] suggested that the advanced Neural Network (NN) structure can perform better than the ARIMA model. Jiang et al. [28] examined the prediction performance of different models under multiple steps ahead, and their results indicated that the machine learning models are superior to the two statistical models (i.e., vector autoregressive models and ARIMA).
Traffic data usually exhibit periodic characteristics during weekdays. Thus, considering the periodicity of data can improve the prediction performance. Up to date, three different approaches have been proposed to capture the periodic characteristics. Zou et al. [29] found that a synthetic prediction model consisting of statistical models and trigonometric polynomial function (TPF) can achieve higher prediction accuracy when the forecasting horizon is greater than half hour with 5 minutes as the aggregation level. Tang et al. [30] applied a double exponential smoothing method (DES) to describe the weekly similarities of traffic data. In the course of the study, Chen et al. [31] utilized the prediction model in accordance with the original traffic flow series compared with the intraday trend removed the by simple average (SA) approach. It is found that the accuracy of the prediction could be considerably improved by using the residual time series.
Regarding the prediction interval (steps), some existing studies have investigated the impact of data resolution on model prediction performance, but there are no definitive results. For example, Park et al. [32] considered the aggregation level from 2 minutes to 60 minutes of the ARIMA model based on travel time data. They concluded that forecasting route travel time required higher concentration levels than link travel time prediction. Vlahogianni et al. [33] found that time clustering may distort critical traffic flow information, and we need further research to determine the optimal concentration level. Some studies found that higher data resolution usually shows larger noise [34, 35].
Based on the previous studies, some studies have compared statistical models and machine learning models, and some scholars have proposed the improvement of periodic functions on travel time prediction. However, few studies have comprehensively evaluated the effects of different periodic functions on the two types of models under different prediction steps. Thus, this study focuses on multistep ahead travel time prediction by considering different periodic functions. The periodic characteristics of the travel time are captured by SA, TPF, and DES models. The residual part is modeled by the statistical models (ARIMA, space time (ST) model, vector autoregressive (VAR) model) and machine learning models(support vector machine (SVM), back propagation neural network (BPNN), multilinear regression (MLR)). In total, 18 hybrid prediction models were established and compared. In addition, the performance of prediction models was evaluated under different scenarios: multistep ahead prediction (1, 3, 6, and 12 steps ahead predictions) with different aggregation levels (5minute, 10minute, and 15minute).
The remainder of the paper is organized as follows. In Section 2, we introduce the travel time data in the study. We describe the data collection site and analyze the temporal and spatial correlation as well as the diurnal pattern observed in the data. In Section 3, we introduce periodic functions and two main methodologies used in this study: statistical models and machine learning approaches. We also discuss the evaluation measures and determine the appropriate training periods. In Section 4, we evaluate the prediction performance of the six models and compare the impacts of different periodic functions on prediction models under different scenarios. In Section 5, we provide the conclusions and some future works.
2. Travel Time Data
This study analyzed the travel time data of US290 between IH610 and FM1960 in Houston, Texas. The total length is approximately 12 miles. The segment is divided into five links by six automatic vehicle identification (AVI) readers (Figure 1). Vehicles with toll tags passed through the AVI readers will be recorded with their ID and timestamps. Travel time of the link enclosed by this pair of AVI readers is the difference in the timestamps. The length of link A to link E is 0.8, 2.6, 3.0, 1.5, and 4.1 miles, respectively. The data collection duration is from January 2008 to August 2008, a total of 174 days. The travel times were initially collected once every 30 seconds, 24 hours per day. We calculated the arithmetic mean of travel time and aggregated the travel time into 5minute, 10minute, and 15minute intervals for each link. The missing data for the five links are all less than 1%, and historical averaged based data imputation method have been implemented to ensure the selected travel time data are appropriate for model validation and evaluation in this study. This study only focuses on the weekday (MondayFriday) travel time prediction.
2.1. Temporal and Spatial Correlation of the Travel Time
We calculate the historical average travel time per mile of the five links (Monday to Friday, January to August 2008) (Figure 2). It can be found that the peak time of traffic occurs in the afternoons of all these five links, and there are mainly three types of travel time patterns. For link A, travel time increases after 12:00, peaks at about 16:30, and finishes later than 20:00. For links B and C, traffic congestion starts at 12:00, peaks around 17:35, and returns to usual after 20:00. For links D and E, traffic congestion often occurs before 16:00, peaks around 17:50, and dissipates after 20:00. In our study, link D is chosen as the target link.
Changes in traffic flow have certain temporal and spatial characteristics. Autocorrelation and crosscorrelation functions were calculated to examine the temporal and spatial correlation. The equation adopted here follows that of Zou et al. [23], as shown in equations (1)–(3).where is the sample autocorrelation function; is the time lag; ; is the number of observations; is the sample observation; and is the sample means of the series.
In this case, the crosscorrelation function measures the temporal and spatial correlation between the travel time data pairs recorded on two selected links. For travel time data pairs , an estimate of the crosscovariance function iswhere and are the sample means of the series and series, respectively. n is the number of travel time data pairs, and an estimate of the lag k_{c} crosscorrelation function iswhere and , and is the sample crosscorrelation function.
We found that the autocorrelation function of travel time shows a downward trend with time lag (Figure 3). Crosscorrelation functions between link D and links A, B, C, and E peak at the lag of −9 and −4, 0 and 0, respectively (Figure 4). As can be seen from previous analysis (Figure 2), the peak times of five links are not same. The peak of links A and B occurs earlier than link D, so the crosscorrelation functions between links A and B and link D peak reach the peak at lags of −9 and −4. The traffic state of links A and B changed 45 minutes and 20 minutes earlier than link D. Links C and E are directly connected to link D, the traffic congestion state and the peak time are more similar, the crosscorrelation functions between links C, E and link D peak at lags of 0. Furthermore, the maximum crosscorrelation values between link D and its adjacent links A, B, C, and E are 0.547, 0.720, 0.822, and 0.904.
(a)
(b)
(c)
(d)
(e)
2.2. Periodic Pattern of Travel Time
Previous research showed that travel time exhibits periodic characteristics during the weekdays. Similar periodic characteristics were found by Kamarianakis et al. [36] in occupancy, speed, and flow data. Because the periodic trend may affect the travel time prediction, this study proposed the hybrid prediction models to accommodate the periodic trend components as well as the temporal and spatial correlation observed in the data. Specifically, periodic characteristics are modeled using TPF, SA, and DES methods.
3. Methodology
3.1. Periodic Functions
3.1.1. Simple Average Method
Simple average method is one of the commonly used methods to describe the periodic characteristics [31]. During the study, the researchers set the hypothesis that the sampling travel time data of consecutive working days could be written as a series of onedimensional vectors , as shown in equation (4). The intraday trend is calculated by simple average method as equation (5).where stands for travel time data collected at time m on day d. M indicates the number of sampled days, is sampling data points per day. In this study, M = 30, m = 288.
3.1.2. Trigonometric Polynomial Function
Trigonometric polynomial adopts the sinusoids and cosinusoids to describe the periodic pattern. Equation (6) was used to calculate the average daily travel time at each station. Trigonometric polynomial function is represented in the following equation:where is the estimated periodic component at time ; indicates the number of samples per day; is the number of trigonometric polynomials; and are the coefficients.
Regarding the selection of optimal number of trigonometric series functions, Zou et al. [29] claimed that the number of trigonometric polynomials might have an impact on the prediction accuracy of the hybrid model. At the same time, they found that 15 or more trigonometric polynomials should be included in the periodical component. Therefore, in this study, the researchers set the value of n_{r} in equations (6)–(15).
3.1.3. Double Exponential Smoothing
Double exponential smoothing is one widely used method for both smoothing and forecasting time series. This approach builds the prediction in accordance of the levels mean M_{t} and the trend T_{t}. The model can be expressed as where is the observed travel time at time ; stands for the estimate of level of series at time ; indicates the estimate of slope of series at time ; and are smoothing parameters, the two parameters can be estimated using the LevenbergMarquardt algorithm.
3.2. Travel Time Prediction Models
3.2.1. Statistical Models
(1) Autoregressive Integrated Moving Average (ARIMA) Models. The ARIMA model transforms nonstationary time series into stationary time series after di differences, and then stationary sequence can be predicted by the ARMA model. From the view of mathematics, the demonstration of an ARMA (p, q) procedure is as where stands for the future travel time at time ; and are the parameters of pattern; indicates white Gaussian noise with mean zero and variance ; is the number of autoregressive terms; is the amount of lagged forecast errors. Let
ARIMA model is as equation (10):where is a nonnegative integer, which stands for the number of nonseasonal differences. If , the ARMA model could be obtained. When predicting each future travel time value, the best order of the ARIMA model is decided by Akaike information criterion (AIC).
(2) Space Time Model. As a probabilistic modeling method, ST model can provide point prediction and corresponding prediction intervals. The normal distribution is used to describe travel time in this study. The point prediction is given by where and are the location parameter and scale parameter of ; is the cumulative density function of a standard normal distribution. The is modeled through a linear combination of current and previous values of all travel time series on all links. When choosing the predictive variables, different combinations of predictive variables need to be considered. Therefore, researchers begin with the most complex models and gradually subtract predictive variables until no further improvement was obtained. For instance, if variables were selected,where are the travel time at links A, B, C, D, and E at time t and are model coefficients.
To build a model for the predictive spread, , the ST model allows for conditional heteroscedasticity by modeling as a linear function of the volatility value ,
The coefficients and are nonnegative, and their volatility values could be modeled as
(3) Vector Autoregressive Models. VAR model is regarded as one of the most widespread methods which utilize statistical methods in time series prediction. The model can include many factors consisting of the impact of upstream and downstream links on predicting future travel time. During the process of this research, a 5equation VAR model is utilized, and it can be expressed as follows: where ; = 5 × 1 constant term; = 5 × 5 coefficient matrices; and = the corresponding 5 × 1 independently and identically distributed random vector, .
The stability of the VAR model could be guaranteed through the characteristic polynomialwhere stands for a 5 × 5 identity matrix. It is a necessary and sufficient condition that all characteristic roots are located outside the unit circle for stability.
3.2.2. Machine Learning Models
(1) Support Vector Machine Model. The SVM approach is a method which could be used to map the sample space into high or even infinite dimension feature space (Hilbert space) by nonlinear mapping to construct linear regression in a new space. Given a set of data points for regression, N is the number of training samples. Normally, the objective of SVM is to find a function where = the kernel function that maps input into the feature space ; is the weighting vector; b is a constant bias.
A insensitive loss function is assumed as
Then, it could be estimated that and by working out this optimization problem:where is the maximum deviation permitted; is loss function, indicates the related penalty for stating deviation within the training process that assesses the tradeoff between the empirical risk and the smoothness of the model. The relaxation variables and are used to indicate the optimization objective into the optimization issue stated as
That issue mentioned above is worked out by utilizing the Lagrange equation. The regression function is demonstrated aswhere is the kernel function. are the solutions to dual problem. In our study, the grid analysis and cross validation are used to optimize the parameters C and . The crossvalidation method divides data into three groups, among which one subset is the validation group and the other two subsets are used as the training set; 3 models are obtained. Grid analysis is a method of programming enumeration to compare the performance of models with different parameters C and . In this paper, all combinations of Log 2C and Log 2 parameters between −5 and 5 were traversed. The parameter combination with minimum mean square error was selected.
(2) Back Propagation Neural Network Model. In short, the BPNN model is a multilayer feed forward neural network which consists of many parallel nonlinear computing elements. As we all know, initialization network is composed of input layer, hidden layer, and output layer. Within the neural network, the weights between the most important parameter connection layers can be calculated by error back propagation algorithm. When a neural network model acquires the mapping relationship between input and output variables through continuous learning, it can predict the output according to the given input variables.
First, equation (22) can be used to calculate the value of the predicted hidden layer:where stands for the production of hidden layer and S is the incentive function of neurons, h stands for the neuron number of hidden layers, num refers to the neuron number of the inputlayers, stands for the weight element between inputlayer and hiddenlayer, stands for the bias value of hidden layer.
Second, predicting value of the output layer could be calculated throughwhere Q stands for the actual output of output layer, refers to the bias value of output layer, and stands for the neuron number of the output layer. In our study, the empirical formula combined with the trial and error method was used to determine the number of nodes in the hidden layer. 4 nodes with the best performance were selected finally.
(3) Multilinear Regression Model. Compared with the above two supervised algorithms, the construction of multiple linear regressions is simpler and belongs to regression learning category. In MLR, the prediction values can be calculated by the following equation:where represents the prediction value at time t. The independent variable y(t − j) means the travel time data at the previous t − j period, lr is the number of historical travel times considered in MLR model, and r_{0} , …, r_{j} are the regression parameters which can be optimized by training samples. lr is chosen on basis of an analysis of the travel data from January to April 2008. Different numbers of lr are considered.
3.3. Hybrid Prediction Models
As mentioned in Section 2, freeway travel time has a daily periodic characteristic. Therefore, it can be assumed that the travel time has two parts. One of the two parts is the deterministic component; the other is the irregular component. In such a hypothesis, the hybrid prediction model can be used to describe or calculate the freeway travel time:where is the travel time at time t at station D; D_{t} is the periodic component; and represents the residual part after removing the periodic component.
Periodic component can be described by three kinds of functions (TPF, SA, and DES), and the residual part is modeled by six prediction models. We compare the impacts of different periodic functions on multistep ahead freeway travel time prediction models using travel time data with different aggregation levels.
3.4. Measures and Training Period
To evaluate the multistep prediction performance of all prediction models, three indicators, mean absolute error (MAE), mean absolute prediction error (MAPE) and root mean square error (RMSE) are considered comprehensively. The equations for calculating three indexes are as follows:where n is the number of observations; represents the actual travel time at time i on link D; and refers to the predicted travel time.
So far, there is no automatic way to calculate and evaluate the model training period. This study considered different training periods of 15, 20, 25, 30, 40, 50 and 60 days. For comparison, the travel time data in August (21 weekdays) were used as the test set. Figure 5 shows MAE, MAPE, and RMSE values of six travel time prediction models under different lengths of training periods. It is observed that the prediction performance of statistical models and MLR model changed slightly as the number of training period increases. The performance of SVM and BPNN has been greatly improved as training period increases when the training period was less than 30 days. If the training period was more than 30 days, the prediction accuracy of SVM and BPNN models changed slightly. Longer training period usually requires larger computational time for each model. For example, the computational time of the SVM model is 5 minutes when 10day travel time data was used for model training, and the computational time can be as high as 68 minutes when 60 days was chosen as training period. The calculation time and prediction accuracy were considered comprehensively in our study, and a 30day (July (23 days) and June (7 days)) training period is chosen for models.
(a)
(b)
(c)
4. Results and Discussion
In this part, the multistep ahead prediction performance of SVM, BPNN, MLR, ARIMA, ST, VAR under different aggregation levels (i.e., 5minute, 10minute, and 15minute) are evaluated using the travel time data observed on link D. In addition, we explored the impacts of different periodic functions on statistical models and machine learning models under different aggregation levels for the input data. The testing period is 15:30 to 19:30 from 1 August to 31 August (21 weekdays).
4.1. The Performance of Six Models
The study provides the MAE, MAPE, and RMSE values of SVM, BPNN, MLR, ARIMA, ST, and VAR models for different forecasting horizons under different aggregation levels (5minute, 10minute, and 15minute) for the input data (Tables 1–3). Tests on travel time data indicate the following findings. First, the prediction accuracy deteriorates as the forecasting step increases for all models. Second, the higher the data aggregate level, the higher the accuracy of shortterm travel time prediction results. For example, when we predict the 30minute ahead travel time of link D, the prediction result is better with 15minute data as the aggregation level than that with 10minute and 5minute data as the aggregation level. Third, for machine learning models, the prediction accuracy of MLR is lower than that of BPNN and SVM. For statistical models, the prediction accuracy of ARIMA is lower than that of ST and VAR. The possible reason is that the SVM, BPNN, ST, and VAR models use spatial and temporal information from neighboring links to predict the future travel time value at time t + p. While MLR and ARIMA models use the travel time data collected on the target link D only to predict travel time values at time t + p. Fourth, the prediction accuracy of two machine models (SVM and BPNN) is better than that of statistical models. However, the prediction accuracy of MLR model is significantly lower than that of statistical models.


 
Bold values indicate the smallest MAE, MAPE, and RMSE values in machine learning models and statistical models, respectively. 
4.2. Impacts of Periodic Functions
To investigate whether the proposed periodic functions improve the performance of six prediction models, hybrid models and prediction models are considered to predict travel time values at link D for the same testing period (15:30 to 19:30 from 1 August to 31 August). The periodic function shows a consistent rule on improving the prediction model, whether it is based on MAE index, MAPE or RMSE index. The figure shows the RMSE results of the six models and 18 hybrid models for different forecasting horizons under different aggregation levels (5minute, 10minute, 15minute) (Figure 6). Based on the observation of the results, several interesting conclusions can be drawn. First, period functions have similar impacts on SVM and BPNN models. The periodic functions have a definite improvement for more than 60 minutes ahead prediction under three data aggregation levels. Second, three periodic functions have improved the prediction performance of MLR model in multistep ahead prediction for three data aggregation levels. Third, three periodic functions can improve the prediction accuracy of travel time over 30 minutes ahead for all statistical models. Fourth, with the increase of aggregation level, the difference of prediction results of the comprehensive prediction model considering periodic functions increases gradually. For example, when the aggregation level is 5minute, the prediction results of the three SVM comprehensive models have little difference, while when the aggregation level is 15 minutes, the prediction results are significantly different.
(a)
(b)
(c)
(d)
(e)
(f)
From the above analysis, we can conclude that the periodic functions obviously improve the prediction accuracy of the six prediction models for multistep ahead prediction. Then we analyze the impact degree of different periodic functions on prediction models based on mean absolute error difference (MAED). The equation of the MAED is as follows:where n is the number of observations; represents the actual travel time at time i on link D; refers to the predicted travel time based on traditional prediction models, and refers to the predicted travel time based on models considering periodic functions. If the MAED is greater than zero, this periodic function improves the prediction accuracy of the traditional prediction model; otherwise, it reduces the prediction accuracy of the traditional prediction model. The result shows the MAED values for 18 hybrid models from 1step to 12step ahead forecasting with 5minute, 10minute and 15minute as aggregation levels (Figure 7). First, periodic functions can significantly improve the MLR model in multistep ahead prediction for three data aggregation levels. Second, for 1step and 3step ahead prediction, three periodic functions reduce the prediction accuracy of SVM and BPNN models. Third, for 1step ahead prediction, both TPF and SA improve the performance of statistical models in multistep ahead prediction for three data aggregation. When the aggregation level is 5minute, TPF can obviously improve the prediction accuracy. While when the aggregation level increases to 10minute, SA periodic function performs better. For 3step ahead prediction, three periodic functions improve the forecasting results of the statistical models obviously, and three periodic functions have slight difference in improving prediction accuracy of the statistical models. Fourth, when 6step and 12step prediction ahead, three periodic functions improve the forecasting results of the six models obviously, and three periodic functions have slight difference in improving prediction accuracy of the prediction model. Fifth, for the same prediction step, the improvement of periodic function is more obvious with the increase of data aggregation level. For example, for 12step ahead prediction, hybrid models perform better with 15minute as aggregated level than that of with 5minute as aggregated level.
(a)
(b)
(c)
(d)
In this section, we discuss the aggregation level and periodic function suggestions. According to the conclusion of Table 1 and Figure 7, the higher the data aggregate level, the higher the accuracy of shortterm travel time prediction results. When prediction time was greater than 15 minutes, the highest accuracy could be obtained by using the aggregation level of 15 minutes. According to Figures 66(a) and 6(b), the periodic functions cannot improve the prediction performance of SVM and BPNN models when prediction time was less than 60 minutes, it is recommended that no periodic function should be considered. When the prediction time is greater than 60 minutes, TPF periodic functions are recommended. As can be seen from Figure 7, both TPF and SA have improved the performance of statistical models and MLR model in multistep ahead prediction for three data aggregation and SA performed better. For different minutes ahead prediction, aggregation level and periodic function suggestions are shown in Table 4.

5. Conclusions
This paper evaluated the multistep ahead prediction performance of SVM, BPNN, MLR, ARIMA, ST, and VAR models using the freeway travel time data collected from vehicle identification readers along US290 in Houston, Texas. The performances of the six prediction models under different aggregation levels (5minute, 10minute, and 15minute) were compared. The impacts of different periodic functions on machine learning and statistical models under different aggregation levels (5minute, 10minute, and 15minute) are also investigated. Several important conclusions can be drawn based on the results. First, the periodic functions can improve the prediction performance of machine learning models for more than 60 minutes ahead prediction and improve the over 30 minutes ahead prediction accuracy of all statistical models. Second, the considered three periodic functions have slight difference in improving prediction accuracy of the six prediction models during multistep ahead prediction. Third, with the increase of prediction steps, the impact of periodic function on the prediction model becomes obvious. Fourth, for the same prediction step, the effect of periodic function is more obvious with the increase of data aggregation level. For future work, since nonrecurrent events (incidents, special events, etc.) may disturb the cyclical pattern of travel time, it will be interesting to analyze and compare the impacts of periodic functions on prediction models under nonrecurrent traffic conditions. In addition, artificial intelligence has greatly promoted the development of traffic science. Especially deep learning algorithms, such as deep residual networks, cyclic neural networks and convolutional neural networks, have been rapidly developed in transportation field. It is also interesting to examine the impact of different periodic functions on deep learning algorithms.
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This research was funded by the National Key Research and Development Program of China (grant no. 2018YFE0102800).
References
 E. I. Vlahogianni, M. G. Karlaftis, and J. C. Golias, “Shortterm traffic forecasting: where we are and where we’re going,” Transportation Research Part C: Emerging Technologies, vol. 43, pp. 3–19, 2014. View at: Publisher Site  Google Scholar
 J. Xia, M. Chen, and W. Huang, “A multistep corridor traveltime prediction method using presencetype vehicle detector data,” Journal of Intelligent Transportation Systems, vol. 15, no. 2, pp. 104–113, 2011. View at: Publisher Site  Google Scholar
 E. I. Vlahogianni, J. C. Golias, and M. G. Karlaftis, “Shortterm traffic forecasting: overview of objectives and methods,” Transport Reviews, vol. 24, no. 5, pp. 533–557, 2004. View at: Publisher Site  Google Scholar
 M. Elhenawy and H. A. Rakha, Expected Travel Time and Reliability Prediction Using Mixture Linear Regression, Transportation Research Board, Washington DC, USA, 2016.
 X. Zhang and J. A. Rice, “Shortterm travel time prediction,” Transportation Research Part C: Emerging Technologies, vol. 11, no. 34, pp. 187–210, 2003. View at: Publisher Site  Google Scholar
 A. Guin, “Travel time prediction using a seasonal autoregressive integrated moving average time series model,” in Proceedings of the 2006 IEEE Intelligent Transportation Systems Conference, IEEE, Toronto, Canada, September 2006. View at: Publisher Site  Google Scholar
 S. Ishak and H. AlDeek, “Performance evaluation of shortterm timeseries traffic prediction model,” Journal of Transportation Engineering, vol. 128, no. 6, pp. 490–498, 2002. View at: Publisher Site  Google Scholar
 Z. Tongyu, K. Xueping, L. Weifeng, Z. Yuan, and D. Bowen, “Travel time prediction for float car system based on time series,” in Proceedings of the 2010 12th International Conference on Advanced Communication Technology (ICACT), IEEE, Phoenix Park, South Korea, February 2010. View at: Google Scholar
 W. Min and L. Wynter, “Realtime road traffic prediction with spatiotemporal correlations,” Transportation Research Part C: Emerging Technologies, vol. 19, no. 4, pp. 606–616, 2011. View at: Publisher Site  Google Scholar
 Q. F. Yang, X. W. Wei, C. Y. Lin, Z. L. Li, and X. Y. Liu, “Reliability prediction of travel time based on spatiotemporal bayesian model,” Journal of South China University of Technology (Natural Science Edition), vol. 44, pp. 115–122, 2016. View at: Google Scholar
 S. I. Chien and C. M. Kuchipudi, “Dynamic travel time prediction with realtime and historic data,” Journal of Transportation Engineering, vol. 129, pp. 609–616, 2003. View at: Publisher Site  Google Scholar
 L. Chu, J.S. Oh, and W. Recker, Adaptive Kalman Filter Based Freeway Travel Time Estimation, Transportation Research Board, Washington DC, USA, 2005.
 W.C. Hong, “Traffic flow forecasting by seasonal SVR with chaotic simulated annealing algorithm,” Neurocomputing, vol. 74, no. 1213, pp. 2096–2107, 2011. View at: Publisher Site  Google Scholar
 W. ChunHsin, H. JanMing, and D. T. Lee, “Traveltime prediction with support vector regression,” IEEE Transactions on Intelligent Transportation Systems, vol. 5, pp. 276–281, 2004. View at: Google Scholar
 K. L. Li, Y. H. Ma, Y. M. Tian, and J. Xie, “An improved LSSVR ensemble learning in internet traffic prediction,” Applied Mechanics and Materials, vol. 121126, pp. 3794–3798, 2011. View at: Publisher Site  Google Scholar
 J. W. C. V. Lint, “Reliable realtime framework for shortterm freeway travel time prediction,” Transportation Research Record: Journal of the Transportation Research Board, vol. 132, pp. 921–932, 2006. View at: Google Scholar
 J. W. C. V. Lint, “Online learning solutions for freeway travel time prediction,” IEEE Transactions on Intelligent Transportation Systems, vol. 9, pp. 38–47, 2008. View at: Google Scholar
 J. Y. Wang, K. I. Wong, and Y. Y. Chen, “Shortterm travel time estimation and prediction for long freeway corridor using NN and regression,” in Proceedings of the 2012 15th International IEEE Conference on Intelligent Transportation Systems, IEEE, Anchorage, AK, USA, September 2012. View at: Publisher Site  Google Scholar
 X. Zeng and Y. Zhang, “Development of recurrent neural network considering temporalspatial input dynamics for freeway travel time modeling,” Computeraided Civil and Infrastructure Engineering, vol. 28, no. 5, pp. 359–371, 2013. View at: Publisher Site  Google Scholar
 J. Wang, I. Tsapakis, and C. Zhong, “ A space time delay neural network model for travel time prediction,” Engineering Applications of Artificial Intelligence, vol. 52, pp. 145–160, 2016. View at: Google Scholar
 X. Yang, Y. Zou, J. Tang, J. Liang, and M. Ijaz, “Evaluation of shortterm freeway speed prediction based on periodic analysis using statistical models and machine learning models,” Journal of Advanced Transportation, vol. 2020, Article ID 9628957, 16 pages, 2020. View at: Publisher Site  Google Scholar
 J. Tang, L. Zheng, C. Han et al., “Statistical and machinelearning methods for clearance time prediction of road incidents: a methodology review,” Analytic Methods in Accident Research, vol. 27, Article ID 100123, 2020. View at: Publisher Site  Google Scholar
 Y. Zou, X. Zhu, Y. Zhang, and X. Zeng, “A spacetime diurnal method for shortterm freeway travel time prediction,” Transportation Research Part C: Emerging Technologies, vol. 43, pp. 33–49, 2014. View at: Publisher Site  Google Scholar
 N. Zou, J. Wang, and G. L. Chang, “A reliable hybrid prediction model for realtime travel time prediction with widely spaced detectors,” in Proceedings of the 2008 11th International IEEE Conference on Intelligent Transportation Systems, IEEE, Beijing, China, October 2008. View at: Publisher Site  Google Scholar
 C. M. Kuchipudi and S. I. J. Chien, “Development of a hybrid model for dynamic traveltime prediction,” Transportation Research Record: Journal of the Transportation Research Board, vol. 1855, no. 1, pp. 22–31, 2003. View at: Publisher Site  Google Scholar
 A. Stathopoulos, L. Dimitriou, and T. Tsekeris, “Fuzzy modeling approach for combined forecasting of urban traffic flow,” ComputerAided Civil and Infrastructure Engineering, vol. 23, no. 7, pp. 521–535, 2008. View at: Publisher Site  Google Scholar
 E. I. Vlahogianni, “Enhancing predictions in signalized arterials with information on shortterm traffic flow dynamics,” Journal of Intelligent Transportation Systems, vol. 13, no. 2, pp. 73–84, 2009. View at: Publisher Site  Google Scholar
 H. Jiang, Y. Zou, S. Zhang, J. Tang, and Y. Wang, “Shortterm speed prediction using remote microwave sensor data: machine learning versus statistical model,” Mathematical Problems in Engineering, vol. 2016, Article ID 9236156, 13 pages, 2016. View at: Publisher Site  Google Scholar
 Y. Zou, X. Hua, Y. Zhang, and Y. Wang, “Hybrid shortterm freeway speed prediction methods based on periodic analysis,” Canadian Journal of Civil Engineering, vol. 42, no. 8, pp. 570–582, 2015. View at: Publisher Site  Google Scholar
 J. Tang, H. Wang, Y. Wang, X. Liu, and F. Liu, “Hybrid prediction approach based on weekly similarities of traffic flow for different temporal scales,” Transportation Research Record: Journal of the Transportation Research Board, vol. 2443, no. 1, pp. 21–31, 2014. View at: Publisher Site  Google Scholar
 C. Chen, Y. Wang, L. Li, J. Hu, and Z. Zhang, “The retrieval of intraday trend and its influence on traffic prediction,” Transportation Research Part C: Emerging Technologies, vol. 22, pp. 103–118, 2012. View at: Publisher Site  Google Scholar
 D. Park, L. R. Rilett, B. J. Gajewski, C. H. Spiegelman, and C. Choi, “Identifying optimal data aggregation interval sizes for link and corridor travel time estimation and forecasting,” Transportation, vol. 36, no. 1, pp. 77–95, 2009. View at: Publisher Site  Google Scholar
 E. Vlahogianni and M. Karlaftis, “Temporal aggregation in traffic data: implications for statistical characteristics and model choice,” Transportation Letters, vol. 3, no. 1, pp. 37–49, 2011. View at: Publisher Site  Google Scholar
 F. Qiao, X. Wang, and L. Yu, Optimizing Aggregation Level for Its Data Based on Wavelet Decomposition, Transportation Research Board, Washington D.C., 2003.
 X. Liu, X. Fang, Z. Qin, C. Ye, and M. Xie, “A shortterm forecasting algorithm for network traffic based on Chaos theory and SVM,” Journal of Network and Systems Management, vol. 19, no. 4, pp. 427–447, 2011. View at: Publisher Site  Google Scholar
 Y. Kamarianakis, H. Oliver Gao, and P. Prastacos, “Characterizing regimes in daily cycles of urban traffic using smoothtransition regressions,” Transportation Research Part C: Emerging Technologies, vol. 18, no. 5, pp. 821–840, 2010. View at: Publisher Site  Google Scholar
Copyright
Copyright © 2020 Xu Miao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.