Abstract

Directing against the shortcoming of low accuracy in short-term traffic flow prediction caused by strong traffic flow fluctuation, a novel method for short-term traffic forecasting based on the combination of improved grey Verhulst prediction algorithm and first-order difference exponential smoothing is proposed. Firstly, we constructed an improved grey Verhulst prediction model by introducing the Markov chain to its traditional version. Then, based on an introduced dynamic weighting factor, the improved grey Verhulst prediction method, and the first-order difference exponential smoothing technique, the new method for short-term traffic forecasting is completed in an efficient way. Finally, experiment and analysis are carried out in the light of actual data gathered from strong fluctuation environment to verify the effectiveness and rationality of our proposed scheme.

1. Introduction

In recent years, the popularization of seamless links among heterogeneous traffic equipment brought about higher requirements on the real-time and reliability of short-term traffic flow prediction. With continuous improvement of traffic information processing, how to predict the short-term traffic flow accurately and effectively has aroused wide attention of scholars domestically and abroad [13], whereupon numerous of prominent research results have emerged. So far, relevant academic circles mainly focus on the construction and optimization of prediction models in terms of time series, linear regression, historical average model, Kalman filtering, grey theory, chaos theory, nonparametric regression, neural network, support vector machine, dynamic traffic assignment model, and so forth [49]. These algorithms and models mentioned above are relatively mature, and their prediction effects are acceptable under the environment of favorable traffic flow stability. However, once the traffic data are seriously fluctuated, one single model can neither guarantee the prediction accuracy nor break through certain limitations of operations to predict short-term traffic flow under the environments of heterogeneous traffic equipment transmission.

In order to solve the aforementioned problems, domestic and foreign scholars have contrived an improved model to realize fusion prediction with the advantages of different models integrated of short-term traffic flow. For example, Xie et al. [10] improved the search efficiency of -nearest neighbor algorithm by cooperating the multivariate statistical regression model and the pattern distance search method based on the analysis of original -nearest neighbor algorithm for short-term traffic flow prediction. Their experimental results indicated that the prediction effect is better when the values are reasonable. Fan et al. [4] exploited the characteristics of time-varying and nonlinearity of traffic flow and proposed a new hybrid forecasting model based on the nonparametric regression model and the BP neural network model; they also employed fuzzy control to determine the weight of each single model. However, the basic support platform of the hybrid model is the traffic flow database, so the demand of data volume is large. Once the data volume is insufficient, the prediction accuracy will be depressed. Xiao et al. [11] proposed an improved binding cycle truncation accumulated generating operation seasonal grey rolling forecasting model based on the properties of similar seasonality within intraday and weekly trends. The model weakens the random disturbance and highlights the intrinsic grey exponent rule after accumulating the sequence, so that the model has better performance under different traffic flow conditions. Lin-chao et al. [12] proposed a short-term traffic flow prediction model based on support vector regression which is suitable for real-time monitoring and then analyzed the model parameters by grid search under the premise of considering the influence of space-time factors. The model can achieve ideal prediction effect even if the real-time traffic data are insufficient. Nevertheless, the accuracy of the model will decrease when the traffic status changes. In order to solve such problem, Ma et al. [13] proposed a two-dimensional prediction method by using the Kalman filtering theory based on historical data. The advantage of this method is that the two predicted values are fused by using an equation with weight coefficients where the weight coefficients can be generated in real time in the process of prediction. Experimental results show that the model has an admirable predictive effect. Chan et al. [14] used exponential smoothing to preprocess traffic data that is taken as the input of network and then used Levenberg Marquardt (LM) variant algorithm to train the network weights, making the generalization ability of network enhanced.

However, thanks to the diversified developments of traffic information processing and data transmission techniques within heterogeneous traffic network, as well as the impact caused by dynamic changes of road topology, traffic accidents, severe weather, driving styles, and so forth, short-term traffic data are instantaneous and the irregular volatility is always changing [15, 16]. In this case, the above combinational algorithm can just solve the short-term traffic prediction problem in certain circumstance, and the prediction effect can hardly be achieved in the light of the real-time requirement. Hence, it is urgent to construct new models and algorithms to deal with the short-term traffic forecasting problem aiming at the strong fluctuation triggered by events. Thus, in this paper, we intend to propose a novel method for short-term traffic prediction based on the energetically grey Verhulst prediction algorithm and the first-order difference exponential smoothing technique to solve the problem of low prediction accuracy caused by traffic event-triggering strong fluctuation.

The layout of this paper is arranged as follows. Firstly, based on the introduction of the traditional grey Verhulst model, an improved grey Markov forecasting model is devised by introducing Markov chain. Secondly, combining the advantages of forecasting by utilizing first-order difference exponential smoothing algorithm and introducing a dynamic weighting factor, a new method for short-term traffic forecasting is concretely constructed according to the afore contrived models. Finally, comparative analysis of the examples illustrated that our proposed model and algorithm are more effective.

2. Short-Term Traffic Flow Prediction Model Based on Grey Difference Exponential Smoothing

2.1. Short-Term Traffic Flow Prediction Model Based on Grey Markov Theory
2.1.1. Introduction of Grey Verhulst Model

The grey system theory was first put forward by Professor Deng Julong, a Chinese scholar, in 1980s [17]. Its quantitative model is mainly based on the structure of number generating, which makes the prediction effect no longer affected by the empirical statistical law gained from the analysis of mass data. At the same time, it surmounted the limitation of white system and the black system relying solely on the probability and statistics method. Therefore, the grey system theory has been widely used in the fields of agricultural production, industrial control, traffic management, and so forth [18].

Classical grey system theory mainly includes GM model, GM model, grey Verhulst model, and so forth [19]. Considering the strong randomness and nonlinearity of traffic flow with strong fluctuation in heterogeneous information model, the data sequence significantly gives expression to the characteristics of nonmonotonic oscillatory development or the S-shaped property of saturation trend, which makes the grey GM prediction model unsuitable. Therefore, we choose the grey Verhulst model in this paper to forecast short-term traffic. The basic modeling process is described as follows.

The nonnegative data sequence is defined aswhile stands for the one-time accumulation sequence (1-Ago):

Then, assuming that represents the consecutive neighbor sequence of ,

The grey Verhulst model and its whitening equation can be defined as follows.

Definition 1 (see [20, 21]). The grey Verhulst model iswhere and are the parameters of the equation.

Definition 2 (see [20, 21]). The whitening equation of the grey Verhulst model iswhere , , and are the parameters of the whitening equation.

As deduced in [20, 21], the following conclusions are drawn.

Theorem 3. If the grey Verhulst model is defined as formula (4) and represents the parameter column, letThe least squares estimator of the parameter column satisfies

Theorem 4. If the grey Verhulst model is defined as definition (2), the solution of the whitening equation can be deduced asAccording to Theorems 3 and 4, the following conclusions are drawn.

Inference 5. The time response sequence of grey Verhulst model can be defined asLet ; we iterate formula (9) to beThe reduction formula can then be defined as

Pointing at the traffic flow with the trend of increasing saturation, numerous traffic prediction algorithms have been proposed taking advantage of the grey Verhulst model to achieve preferable prediction effects. However, as a complicated nonlinear system involving multitudinous uncertainties, the probability of fortuitous events on urban roads is highly fluctuant, which leads to the deviation of forecasting results. Therefore, it is necessary to take the nonlinearity and time-varying characteristics of overall interactions into account in line with various influential factors. So, we improve the algorithm aiming at the accuracy of prediction model next.

2.1.2. Short-Term Traffic Flow Prediction Model Based on Grey Markov Theory

During the process of traffic data aggregation, which is coordinated by complex human-vehicle-environment interaction, the current traffic flow is often affected by previous moments. Therefore, in order to improve the prediction accuracy of grey Verhulst model, mathematical description of traffic flow aggregation is given in advance by constructing the Markov state transition probability matrix [22].

It is obvious that the state transition probability matrix should be updated over time when Markov is used to optimize the grey model. That is to say, at time point , the state transition probability of Markov chain should be recalculated according to real-time traffic data along with the state transition probability matrix updated synchronously. Therefore, at time point , the probability of traffic flow aggregation from state to can be described asAnd the updated Markov transition probability matrix is formulated to bewhere each element is constrained as , , and , .

The grey Markov forecasting model is shown in Figure 1. According to the flow chart of this model, the value of grey Verhulst model to be predicted is at time point . In order to make the predicted value as close to the true value as possible, the true value at the time point is utilized and represented as , and the transition probability from the true value of time point to the predictive value is denoted by . Thus, the cumulative sum of the product of the traffic flow average value and its corresponding probability from the current state to other states can be taken as the predicted value of the grey Markov model:where is the adjustment factor and .

The adjustment factor is used to solve the data fluctuation problem of collaborative traffic flow prediction in terms of human, vehicle, and environment interaction. However, the smoothness of short-term traffic flow forecasting remains to be improved once events trigger strong fluctuations caused by traffic flow aggregation. Therefore, in order to continuously improve the prediction smoothness together with accuracy, a grey difference exponential smoothing short-term traffic prediction method is further constructed by introducing the first-order difference exponential smoothing model in the next subsection.

2.2. Short-Term Flow Traffic Prediction Model Based on Grey Difference Exponential Smoothing

Exponential smoothing [23] is a widely used sequential prediction algorithm, whose predictive values are obtained by assigning different weights to historical records. Similar to the exponential smoothing algorithm, the first-order difference exponential smoothing algorithm achieves the predicted value by giving different weights to different data sequences. The basic modeling process is shown as follows.

For data sequence (1), the difference sequence is represented asAnd the first-order difference exponential smoothing equation iswhere . Therefore, the forecast result should be

Herein, and when . Thus, formula (17) can be rewritten as

We use dynamic weighting factors to weight the models of grey Markov model and first-order difference exponential smoothing model. The predictive value of the grey exponential smoothing model is thus achieved:

In formula (19), and are the predictive value of grey Markov and first-order difference exponential smoothing, respectively, while and are their corresponding weight factors. The weight factors are calculated as follows:

In formula (20), the weight of and depends on the absolute value of the difference between the predicted results of different models and the short-term traffic flow. In the meantime, in order to avoid the failure of formula (20), when the two prediction errors are 0, tends to zero. The pseudocode of grey difference exponential smoothing prediction model is shown in Algorithm 1.

           Input: Historical traffic flow data set
           Output: Short-term traffic flow forecasting result set
    Initial state probability matrix  , , , ,
    for   to   do
         Calculate forecasting result    and
         ,
         Calculate the error    and  
         ,  
        Update the weights of    and  
         ,
         Update the state transition matrix    by  
   end
   return  

According to the synthetic forecasting algorithm, the prediction process of grey difference exponential smoothing prediction model is shown in Figure 2.

3. Example and Analysis

In order to verify the prediction effect of our model, the benchmarks of mean absolute error (MAE), mean square error (MSE), mean absolute percentage error (MAPE), and mean square percentage error (MSPE) are taken as evaluation indexes. Those evaluation indexes are defined as formulas (21)–(24). In those formulas, and are the real value and predicted value of the traffic flow at time , respectively, and stands for the number of samples in the short-term traffic flow.(1)Mean absolute error is(2)Mean square error is(3)Mean absolute percentage error is(4)Mean square percentage error is

The urban expressway in Nan’an District of Chongqing Municipality in China is selected in observation phase to verify the actual prediction effect of the grey difference exponential smoothing model, which is composed of 6 bidirectional lanes. The schematic diagram of the section observation is shown in Figure 3. The data detected include the traffic volume, speed, and occupancy of each lane, which are collected via microwave-based vehicle sensors.

The road traffic flow is counted every 5 minutes, and the 226 observation data points of No. 1 Lane are used as the basic data for verification. The fluctuation trend of the data is shown in Figure 4.

As illustrated in Figure 4, there is a strong fluctuation in historical traffic flow, which brought about great difficulties to the prediction. When using the first-order difference exponential smoothing algorithm to predict the short time traffic flow, it is necessary to determine the value of in the definite formula (18). In this paper, five experiments are carried out according to the parameters listed in Table 1, and the evaluation indexes corresponding to the first-order difference exponential smoothing algorithm are obtained in terms of different benchmarks.

According to the results in Table 1, when , the first-order difference exponential smoothing algorithm produces the best effect, and the fitting patterns between the predicted value and actual value are shown in Figure 5.

The initial state probability transfer matrix is calculated from the historical traffic sequence when the grey Markov model is used to predict the short-term traffic flow. The state is divided according to the same interval that every 30 vehicles are used as a state. In the following prediction process, we added real-time traffic data to update Markov state probability transfer matrix in real time. The prediction result of grey Markov model is determined by formula (14), and the fitting effect of predicted value and actual value is shown in Figure 6.

Comparing Figure 6 with Figure 5, the grey Markov model can commendably reflect the trend of traffic flow. However, urban road traffic as a complex nonlinear system shows high uncertainty at different times. The grey Markov chain model cannot accurately predict once sharp fluctuations occur. Meanwhile, compared with the grey Markov chain prediction model, the first-order difference exponential smoothing model is more accordant with the trend of original data, but the prediction results of the algorithm depend on the value of and, more importantly, are affected by time delays.

Therefore, combining the advantages of grey Markov model and first-order difference exponential smoothing, a new predictive value is obtained by using dynamic weighting factors according to formula (19) and formula (20). The prediction results are shown in Figure 7.

Based on the short-term traffic data of the observed road sections, the test data are predicted, respectively, by formulas (14), (18), and (19). The fitting results of the first 120 data points and the last 106 data points are separately illustrated in Figures 8 and 9.

The evaluation indexes of the above prediction models are shown in Table 2 for the first-order difference exponential smoothing parameter . Simulation results showed that the grey difference exponential smoothing algorithm can fit short-term traffic flow well and can be applied to urban traffic signal control for short-term traffic flow data with strong fluctuation.

Compared with the above models, the first-order difference exponential smoothing algorithm can closely reflect the trend of the original data sequence in the prediction trend of the data, but its predicted value depends on the value of and the trend of the data sequence. For the abrupt data points, the accuracy of its prediction will often be affected to distinct degrees. Compared with the first-order difference exponential smoothing model, the prediction accuracy of grey Markov model considers the development trend of the whole data and the state transition probability of historical traffic flow data. Therefore, the model can reflect the overall equilibrium level for data series with large fluctuation. The grey difference exponential smoothing model combines the advantages of the two models and uses the dynamic weight to obtain the predictive value. The performance evaluation indexes MAE, MSE, MAPE, and MSPE of the improved method are better than the first-order difference exponential smoothing model and grey Markov model for the same traffic flow sequence. However, it can also be seen from Figure 6 and evaluation indexes that the prediction effect of the grey difference exponential smoothing algorithm using dynamic weighted calculation is not absolutely well. Therefore, the method needs to be further optimized and improved.

4. Conclusion

This paper mainly contributes to the prediction effect of short-term traffic flow. To overcome the shortcoming of low accuracy in short-term traffic flow prediction caused by strong traffic flow fluctuation, a novel method for short-term traffic forecasting based on the combination of improved grey Verhulst prediction algorithm and first-order difference exponential smoothing is proposed. The main conclusions are as follows.

(1) The grey differential exponential smoothing model combines the advantages of the grey Markov model and the first-order differential exponential smoothing model. The experimental results illustrated that this method is suitable for forecasting short-term traffic flow with large fluctuation. Therefore, this method is practical and feasible.

The analysis shows that this method has smaller prediction error compared with the grey Markov model and the first-order difference exponential smoothing model, and the prediction result is closer to the actual value.

(2) In a word, the method proposed in this paper has obvious advantages. However, due to high uncertainty and nonlinearity of the short-term traffic flow, how to improve the prediction accuracy with fluctuation remains an open problem. So this will be our continuous research orientation.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work is supported by the National Science Foundation of China (NSFC) under Grants 61573076 and 61703063; the Scientific Research Foundation for the Returned Overseas Chinese Scholars under Grant 2015-49; the Program for Excellent Talents of Chongqing Higher School under Grant 2014-18; Science and Technology Research Project of Chongqing Municipal Education Commission of China under Grants KJ1705121 and KJ1705139; the Chongqing Natural Science Foundation of China under Grant CSTC2017jcyjA1665.