Research Article  Open Access
Forecasting of ShortTerm Metro Ridership with Support Vector Machine Online Model
Abstract
Forecasting for shortterm ridership is the foundation of metro operation and management. A prediction model is necessary to seize the weekly periodicity and nonlinearity characteristics of shortterm ridership in realtime. First, this research captures the inherent periodicity of ridership via seasonal autoregressive integrated moving average model (SARIMA) and proposes a support vector machine overall online model (SVMOOL) which insets the weekly periodic characteristics and trains the updated data day by day. Then, this research captures the nonlinear characteristics of the ridership via successive ridership value inputs and proposes a support vector machine partial online model (SVMPOL) which insets the nonlinear characteristics and trains the updated data of the predicted day by time interval (such as 5min). Afterwards, to avoid the drawbacks and to take advantages of the strengths of the two individual online models, this research takes the average predicted values of two models as the final predicted values, which are called support vector machine combined online model (SVMCOL). Finally, this research uses the 5min ridership at Zhujianglu and Sanshanjie Stations of Nanjing Metro to compare the SVMCOL model with three wellknown prediction models including SARIMA, backpropagation neural network (BPNN), and SVM models. The resultant performance comparisons suggest that SARIMA is superior for the stable weekday ridership to other models. Yet the SVMCOL model is the best performer for the unstable weekend ridership and holiday ridership. It shows that for metro operation manager that gear toward timely response to realworld unstable and abnormal situations, the SVMCOL may be a better tool than the three wellknown models.
1. Introduction
Shortterm ridership forecasting is a vital component of metro operation and management. Accurate predictions can reflect realtime changes in ridership. The prediction results can become important inputs for decisionmaking in evaluating rail transit service level and system operating status and provide an important basis for station passenger crowd regulation and emergency response. In addition, shortterm ridership forecasting is the key to the success of revenue management for railway operators [1].
In the last two decades, traditional metro ridership forecasting is based on travel demand forecasting models including the steps of trip generation, trip distribution, mode choice, and assignment [2, 3]. This type of longterm forecasting has been applied in the planning and construction of metro, but it cannot be adapted to the needs of the operations management.
Though the spatialtemporal characteristics of metro ridership are not completely the same as those for vehicle traffic flow [4], shortterm forecasting methods can also be divided into two categories: the theory driven method and the data driven method. Theory driven method is based on traffic flow mechanism to investigate traffic dynamics [5, 6]. The data driven method on the other hand is based on the data of traffic flow series itself to construct models and make predictions. The data driven model is the main method of shortterm prediction and can be divided into linear, nonlinear, and hybrid forecasting methods. The linear forecasting method mainly includes time series model [7–9] and Kalman filtering model [10–12]. The nonlinear forecasting method includes nonparametric regression [13, 14], neural network algorithm [15–17], support vector machine [18–20], and Gaussian maximum likelihood model [21]. The hybrid forecasting method combines at least two methods for prediction to achieve better performance in accuracy and reliability. Hybrid models mainly include wavelet decomposition hybrid model [22, 23], Bayesian decomposition hybrid model [24, 25], empirical mode decomposition hybrid model [26], neural network hybrid model [27, 28], and support vector machine hybrid model [29–35].
Whether it is traffic flow or passenger, time series model has become one of the classic models of shortterm flow prediction [36]. Of all the time series models, seasonal autoregressive integrated moving average (SARIMA) model considers the periodicity feature of the time series, so it can capture the inherent periodicity of traffic flow data. Williams et al. [9–11] used the SARIMA model for shortterm traffic flow prediction and verified its good performance. But time series model is a linear model, and its prediction performance may worsen significantly if the time series are nonstationary and nonlinear. Nevertheless SARIMA model is widely used to be the benchmark to evaluate the forecasting performance of a novel model.
Neural networks are among the most widely used nonlinear models. A neural network trains neurons based on historical data, maps the complicated nonlinear relation between input and output data, and uses the relationship for predictions for given inputs. Neural network algorithms have the adaptive and learning advantages and are flexible without the need to construct detailed and explicit models like other methods. Vlahogianni et al. [37] optimized neural network structure to forecast urban traffic flow parameters. But the neural network algorithm cannot make expected risk minimization because of the empirical risk minimization principle that may also lead to two major drawbacks: local minima and overfitting [38]. The local minima are associated with the training process of neural network, which is to minimize the difference between the predicted outputs and the observed outputs by optimizing the network weights. Overfitting leads to poor generalization ability and may produce inaccurate predictions with some particular testing data.
Compared with neural network algorithm, support vector machine (SVM) model can strike a compromise between prediction accuracy and generalization ability based on the structural risk minimization principle. With the help of intelligent use of kernel function, SVM can solve the problems of small sample, nonlinearity and the curse of dimensionality, overfitting, and local minima. Zhang and Xie [19] proposed a υsupport vector machine model for shortterm traffic volume prediction and showed that it outperformed the multilayer feedforward neural network (MLFNN) model. Zhang et al. [30] proposed a novel hybrid model that identified the SVM input dimensions via SARIMA model to forecast shortterm traffic volume, taking advantage of the individual strengths of the two models. Hong [33] presented a traffic flow forecasting model to forecast interurban traffic flow, which combines the seasonal support vector regression model with chaotic immune algorithm (SSVRCIA), and yielded more accurate forecasting results than the SARIMA, BPNN, and seasonal Holt–Winters models. Wang and Shi [34] constructed a new kernel function using a wavelet function to capture the nonstationary characteristics of the shortterm traffic speed data, proposed a shortterm traffic speed forecasting hybrid model (ChaosWavelet AnalysisSupport Vector Machine model, CWSVM), and achieved the encouraging results. Chen et al. [35] proposed an approach which hybridizes SVR model with adaptive genetic algorithm (AGA) and the seasonal index adjustment, namely, AGASSVR, to forecast holiday daily tourist flow.
The research of shortterm metro ridership forecasting is a rather new undertaking. Tsai et al. [1] proposed two novel neural network structures based on temporal feature extraction and successfully applied them in railway shortterm passenger demand forecasting in Taiwan. Wei and Chen [26] used empirical mode decomposition to extract neural network input variables to forecast the shortterm ridership of Taipei Rapid Transit Muzha Line. Sun et al. [29] proposed a novel hybrid model WaveletSVM, and the experimental results showed that the approach has appeared to be the promising and robust. These studies indicated that metro ridership has significant characteristics of periodicity and nonlinearity reflecting a variety of factors; however, how these characteristics are embedded into the model without affecting the computational complexity of the model is worth discussing. And, for neural network or support vector model, the previous literature also did not discuss the training time to see if it meets the demand of practical operation. If the training time is too long and leads to serious forecasting delay, the prediction model cannot meet the demand of practical operation even if it has good prediction performance. In addition, most existing research on shortterm metro ridership forecasting focused mainly on normal situations; it is not clear how the applicability and the prediction accuracy of the model is when it comes to holidays, inclement weather, large sports events, or emergencies. Sun et al. [29] selected the data including a Valentine’s Day (not a major holiday) as training data, not as a predictor. Finally, the shortterm prediction interval is long (i.e., 15min) in these literatures, and, for the actual operation of the metro, it cannot meet the requirement of the operator because the departure intervals are short.
The reliability and the operability of the models play a crucial role in the accuracy and realtime implementation of the prediction, so the choice of the model is very important in a practical application. Since the characteristics of metro ridership are quite different from those in other transportation systems, most of forecasting models provide unsatisfactory prediction effectiveness. After comparing time series model, neural network model, and SVM model, this paper selects SVM model as the base shortterm prediction model, considering capturing in realtime the periodicity and nonlinearity characteristics of shortterm ridership as mentioned previously. With this base model, this paper proposes a support vector machine overall online (SVMOOL) model, which extracts input features via SARIMA model, trains the updated data by day, and optimizes the parameters by a particle swarm optimization (PSO) algorithm, to capture the periodicity of ridership in realtime. This paper also proposes a support vector machine partial online (SVMPOL) model, which extracts input features based on the temporal continuity of ridership model, trains the updated data by time intervals (such as 5min), and also optimizes the parameters by a PSO algorithm to capture the nonlinearity of ridership. Afterwards, the support vector machine combined online (SVMCOL) model is proposed by combining the SVMOOL model and the SVMPOL model.
The main contributions of this paper are as follows.
This paper proposes a novel hybrid model combining the SVMOOL model and the SVMPOL model for shortterm ridership forecasting that better captures the periodicity and nonlinearity characteristics by the updated data set. The SVMCOL model takes advantages of the individual strengths of the two models. The actual results of 5min shortterm ridership forecasting show the feasibility and effectiveness of the proposed combined model in realtime implementation.
While the SARIMA model is superior for the stable weekday ridership to other models, experiments results indicate that the SVMOOL model is superior to SARIMA model, BPNN model, or SVM model in terms of MAE and RMSE for the weekend and holiday ridership test. It should be noted that the prediction of ridership under abnormal situations (such as holiday) is evidently more challenging than doing so under normal conditions (such as weekday ridership) and, hence, is much desired by the operator. Therefore, the proposed SVMCOL model is found to be suitable and useful in realworld operations.
The experiments using LibSVM package on desktop computers indicate that the SVMOOL model needs about one hour for three weeks’ data (4284 observations) to construct the prediction function and the forecasting time takes less than 1 second for a onestep prediction using SVM. In the process of the implementation experiments, the SVMPOL model needs less than 1 s to construct due to the small data sample and the forecasting time needs less than 1 s for a onestep prediction. Therefore, the training time and the forecasting time can meet the realtime demand for the onestep prediction in implementation as well.
In general, shortterm forecasting represents prediction for a specific time interval, such as 5 min, 10 min, and 15 min. For metro ridership, 5min interval will be more useful for metro operation and management because the departure interval of the metro vehicle is really short. In addition, it is obvious that ridership during workdays is different from that on weekends or holidays. As discussed by Chen et al., some prediction models that work well for workdays data may yield unsatisfactory results for weekends or holidays data. In order to discuss the applicability of the proposed model, three samples were selected. The first sample contains weekdays, weekends, and no holidays, and the second and third samples contain weekdays, weekends, and holidays.
This paper attempts to develop an online hybrid model to improve the forecasting performance of metro ridership. The rest of this paper is organized in the following manner. A brief theoretical background of the SVM model is presented first, followed by detailed description on SVMOOL model, SVMPOL model, and SVMCOL model. After that, a brief description of the data source and the implementation of the models are given. Finally, results analysis and conclusions are presented.
2. Methodology
To introduce the SVMOOL, SVMPOL, and SVMCOL models, SVM model is illustrated here first.
2.1. Support Vector Machine for Regression
A detailed description of SVM algorithm is given in Vapnik [38]. Assume that training input data and the corresponding training output data are , where and , and denotes the total number of data. The basic idea of SVM is to map the lowdimensional input space to the highdimensional feature space using a function . The linear regression function can be stated aswhere and are coefficients. For SVM, these coefficients can be obtained by solving the following optimization problems:where (≥0) is the insensitive loss function, and are slack variables, and is a regularization parameter. The maximal dual function in (2) has the following form:where and are Lagrange multipliers.
Ultimately, the decision function given by (1) has the explicit form:where is the kernel function. There are several types of kernel functions, including polynomial, radial basis, and sigmoid. Generally, a Gaussian radial basis function (see (5)) is widely used because of better prediction performance:
2.2. Input Features and Parameter Optimization
Identifying input features is crucial step in SVM modeling. Metro ridership has significant characteristics of periodicity and nonlinearity. Abe [39] discovered that excessive features caused not only long training time but also poor generalization ability. Some researchers documented in detail the identification of input features. For example, Zhang et al. [30] identified the SVM input dimensions via SARIMA. Wu et al. [40] extracted input features from successive actual values before the prediction time; that is to say, if the value of future time is regarded as output, then the real values of past time serve as inputs. Cao et al. [41] used principal component analysis, kernel principal component analysis, and independent component analysis for inputs extraction. Huang and Wang [42] and Lin et al. [43] used genetic algorithm (GA) and particle swarm optimization (PSO) algorithm to extract input features, respectively.
Parameter optimization is to obtain better forecasting accuracy of the SVM model. The parameters optimized are mainly the penalty coefficient, the insensitive loss coefficient, and the corresponding parameters of kernel function. The LibSVM package [44] uses the gridsearching algorithm combined by crossvalidation to determine these parameters but the process takes lengthy computation time. Hong et al. [45], Lin et al. [43], and Hong et al. [45] successfully used GA, PSO, and the ant colony optimization (ACO) algorithm to find the most optimal parameters, respectively. The advantages of PSO lie in easier application, fewer parameters to adjust, and faster convergence to optimum. As a result, PSO is used to optimize the parameters in this study. PSO simulates social behavior, like birds flocking to a promising position, to achieve precise objectives in a multidimensional space [46]. PSO gains the optimal solution through collaboration between individuals.
2.3. Support Vector Machine Online Model
2.3.1. Support Vector Machine Overall Online Model
Support vector machine overall online (SVMOOL) model is based on the theory of SVM, to extract input features, to train the batched updated training data, to use intelligent algorithms, to find the optimal parameters, and to get timevarying prediction function to realize the shortterm forecasting.
Due to apparent periodicity feature of the rail transit ridership, SARIMA model is used to extract input features because SARIMA model is able to capture the periodicity of time series. A time series is generated by the SARIMA(p,d,q)(P,D,Q process of Box and Jenkins as described by Williams et al. [8, 10] and Zhang et al. [30] described the process how to extract the features via SARIMA model in detail.
Considering the computation time of the training data and the realtime demand of the onestep prediction, the SVMOOL model is constructed by updating the training data day by day. That is to say, the training data is updated by adding the ridership data of the most recent day, and the timevarying prediction function is then constructed. Stating in simpler words, assume that denotes the ridership value at time of day , , where denotes the number of the data points each day. All of the prediction values of ridership after day are forecasted by the training data of ridership values. According to SVMOOL model described above, the prediction function is obtained by using the SARIMA model to extract input features from the training data and using PSO algorithm to optimize parameters, then forecasting value of every time interval , until the real values of day are totally obtained. After that, the training data is updated by adding the actual ridership values of day . New prediction function is then constructed to forecast every value of day , by retraining data and updating the parameters, and the process repeats. This process of constructing SVMOOL model is shown in Figure 1.
2.3.2. Support Vector Machine Partial Online Model
Support vector machine partial online (SVMPOL) model is also based on the theory of SVM, to extract input features, to train the realtime updated testing data, to use intelligent algorithm, to find the optimal parameters, and to get realtime prediction function to realize the shortterm forecasting.
According to the input feature extraction approaches mentioned previously and considering the temporal continuity of the realtime data, SVMPOL model extracts input features from successive actual values before the prediction time to capture nonlinear features of the ridership. In addition, parameters are also optimized by PSO.
The SVMPOL model makes full use of the temporal continuity of ridership data and takes advantage of SVM’s capability of addressing small samples. The testing data is updated by adding the ridership value of every time interval of the prediction day at same time deleting the earliest value. The realtime forecast function is obtained by training updated data and optimizing parameters in realtime to predict the value in the next time until the end of the prediction day. Stating in simpler words, assume that denotes the ridership value at time of the prediction day, , where denotes the number of the data points every day. The rest of the ridership after time needs to be forecasted with the passenger values. According to SVMPOL model as described above, the prediction function is obtained by extracting successive ridership values prior to the prediction time as the inputs and using PSO to optimize parameters, then the ridership corresponding output in time is achieved. After that, the testing data is updated by adding the actual value of time and deleting the earliest data. New prediction function is then constructed by retraining data and updating the parameters to forecast the ridership values in time , and the process continues. This process of constructing SVMPOL model is shown in Figure 2, where denotes the size of the moving window and denotes the number of the input features via continuity.
2.3.3. Support Vector Machine Combined Online Model
As described previously, this paper proposes a SVMOOL model to address the periodicity of ridership and a SVMPOL model to address the nonlinearity of ridership. But the SVMOOL model updates the training data day by day and cannot capture the realtime local variations of ridership on the day being predicted. And considering the computation time of the testing data and the realtime demand of the onestep prediction, the testing data contains oneday data at most for constructing the SVMPOL model and the internal mechanism of metro ridership to study is insufficient. To avoid the drawbacks and to take advantages of the strengths of the two individual online models, the average predicted values of two models are the final results, which are called support vector combined online (SVMCOL) model.
3. Data Set and Evaluation Criteria
3.1. Data Set
At present, Automatic Fare Collection (AFC) System has been able to realize realtime data collection of metro passengers in and out station records [47] (though there is a slight delay in data transmission.). By simple statistics, the ridership data can be achieved for the required time interval. That is to say, the shortterm ridership data of metro can be collected online, which puts forward higher requirements for shorttime prediction. Operators expect faster and more accurate predictions, in order to plan ahead to accommodate the changes in passenger flow.
A ridership dataset of metro is collected to investigate the validity of the proposed SVMOOL, SVMPOL, and SVMCOL model for forecasting shortterm ridership. The dataset is collected from the entrance transaction records of Nanjing Metro’s Automatic Fare Collection (AFC) Systems. In general, shortterm forecasting represents prediction for a specific time interval, such as 5 min, 10 min, and 15 min. For metro ridership, 5min interval will be more useful for metro operation and management because the departure interval of the metro vehicle is really short. Taking the operation time of Nanjing Metro into consideration, the time period of data collection for each day is from 6:00 AM to 11:00 PM. There are 204 observations collected with a 5min interval every day. The collected data is divided into two sets of training data plus testing data. In addition, it is obvious that ridership during workdays is different from that on weekends or holidays. As discussed by [48], some prediction models that work well for workdays data may yield unsatisfactory results for weekends or holidays data. In order to discuss the applicability of the proposed model, three samples were selected. The first sample contains no holidays, and the second and third samples contain holidays of ChingMing Festival and May Day. The specific sample information is as follows.
3.1.1. Sample 1
The dataset is collected from the entrance transaction records of the Sanshanjie station during the period from November 5 to December 2, 2012, so there are 5712 observations in total for these 28 days. The first training data set is data collected from November 5 to November 25, and the first testing data set contains the remaining seven days’ ridership values, or 1428 observations, as shown in Figure 3. The weekend ridership pattern is different from weekday’s obviously and the metro ridership shows the weekly periodic characteristics. This is because the weekday ridership is mainly composed of the commuted passenger flow, which is more stable. To the contrary, the weekend ridership mainly consists of the leisure and travel passenger flow, which is relatively fluctuant and has the obvious nonlinear characteristics.
3.1.2. Samples 2 and 3
The dataset is collected from the entrance transaction records of the Zhujinglu station during the period from March 12 to May 6, 2012, so there are 11424 observations in total for these 56 days. The second training data set is data collected from March 12 to April 1, and the third training data set is data collected from April 9 to April 29. Both two training data sets contain three weeks’ ridership values, or 4284 observations. Both two testing data sets contain the remaining seven days’ ridership values, respectively, or 1428 observations, as shown in Figures 4 and 5. It must be noted that the ChingMing Festival is on April 2 (Monday) to April 4 (Wednesday); meanwhile March 30 (Saturday) and April 1 (Sunday) change weekday. The May Day is on April 29 (Sunday) to May 1 (Tuesday); meanwhile April 28 (Saturday) changes weekday. The holiday ridership pattern is similar to weekend’s as a whole but is different from the local.
3.2. Data Normalization
Usually, normalizing raw input data can improve the convergence rate and performance of an SVM model. A common practice of data normalization was used to transform the raw data into a range . In this study, each input data point is scaled according towhere is the normalized value, is any input vector of ridership data, and are, respectively, the maximum value and the minimum value of the training data in the period of training data.
3.3. Performance Indices
The mean absolute error (MAE), the mean absolute percent error (MAPE), and the root mean square error (RMSE) are commonly used criteria to evaluate the forecasting model. Generally, the smaller the MAE, MAPE and RMSE values, the better the prediction performance. The three performance criteria are, respectively, defined aswhere is the actual observed value in time and is the forecasting value in time , and is the number of the observations every day.
4. Model Implementation
In this section, specific applications of the SVMOOL, the SVMPOL, and the SVMCOL models described previously are addressed.
In the methodology section, several methods of choosing the appropriate input features are introduced. The SVMOOL model’s input features are extracted using the SARIMA model. The SARIMA model is formulated with statistical software SAS. The model forms generated from the three training data sets are all SARIMA(1,0,1)(0,1,1)_{1428}. For example, the specific equation is shown as the following, which constructs by the second training data set at Zhujianglu station:where is the real value in time , is error between the real value , and the predicted value in time .
Therefore, for the prediction at time , the real values for time , , serve as inputs. Afterwards, the εSVM model and the Gaussian radial basis function are implemented using the LibSVM software package developed by Wu et al. [40]. The Python codes were developed to integrate the LibSVM package with the PSO algorithm for parameters optimizing. The fivefold crossvalidation technique and PSO are applied to obtain the optimal parameters (shown in Table 1) with the training data to construct the final εSVM model for future forecasting. The testing data are then used as input to the final εSVM model to produce predicted outputs.

The SVMOOL model updates training data set day by day. For example, using the second training data set from March 12 to April 1, predictions of ridership for every time interval on April 2 are made, then actual observed values of April 2 are added to the initial training data set to produce an updated training data set. Then the updated training data set from March 12 to April 2 is used to forecast the ridership of every interval on April 3 and the process repeats.
For the SVMPOL model, the testing data is updated by time interval (i.e., 5min) for the day being predicted, and the number of input features extracted via continuity, or value is determined to be 4 through several trails. In each of the 7 testing days, the first 10 data points (from 6:05 am to 6:50) were used as the testing data, with the 11th data point (at 6:55 am) being the target. Then 10point window “walks”, incorporating the 11th data point, which results on a new 10point window (from 6:10 am to 6:55), having then the 12th data point (at 7:00 am) as the target. The process continues until the last observation (at 23:00 pm) becomes the target.
For the combined model, after the values from SVMOOL and SVMPOL models are calculated, the final prediction value is the average prediction of the previous two models.
5. Results Analysis
After the SVMOOL, SVMPOL, and SVMCOL models are implemented with the data sets, this research selects SARIMA, SVM, and BPNN models (i.e., backpropagation neural network) as the benchmark for onestep prediction are shown in Tables 2, 3, and 4.



5.1. Weekday Ridership Forecasting Results
In addition, the pattern of weekday’s ridership is similar, so Table 1 shows the forecasting ridership results of three weekdays. As shown in Table 2, the SARIMA model is the best among them in terms of forecasting accuracy for weekday’s ridership. It is demonstrated that the SARIMA model is good at predicting the ridership with periodic and stability characteristics as shown in Figure 6. The SVMOOL model is superior to the two models (BPNN and SVM models) because the updating data set, but the performance of the SVMCOL model, is not very satisfactory and is affected by the SVMPOL model, which is not applicable to the weekday ridership forecasting independently.
5.2. Weekend Ridership Forecasting Results
Table 3 shows that the SVMCOL model is the best among them in terms of forecasting accuracy, which the performance improves 40% compared with SARIMA model and improves 10% compared with the BPNN and SVM models for the value RMSE and MAE. It confirms that the combined model captures the weekly periodic and nonlinear characteristics of time series data for the estimation of shortterm ridership (as shown in Figure 7). Though the SVMPOL model is not getting good results, it is much better than the SARIMA model. The SVM and BPNN models are both better than the SARIMA model for the weekend ridership forecasting, which also demonstrate that SVM and BPNN are suitable for nonlinear and fluctuant passenger flow.
5.3. Holiday Ridership Forecasting Results
As shown in Table 4 and Figure 8, it is not difficult to find that the SARIMA model is the worst performance for the holiday ridership and cannot meet the accuracy of shorttime prediction. This is because the ridership data in samples 2 and 3 contains the unstable holiday passenger flow of the ChingMing Festival and May Day and demonstrates that the SARIMA model does not apply to nonstationary ridership. The SVMOOL and SVMCOL two models both outperform the three models (such as SARIMA, BPNN, and SVM models) and the SVMCOL model is the best among them in terms of forecasting accuracy, because the two models are constructed on the updated data set and more responsive to the change of passenger flow. The results of the SVMPOL model outperform the SARIMA and SVM models in the case of a small sample (with only 10 samples) as shown in Table 4. It is demonstrated that the SVMPOL model can capture the change of passenger flow in realtime and has special advantages for small sample prediction. In a word, compared with the offline models, the online models achieve better prediction performance. Of course, the prediction performance of BPNN is slightly better than the SVMPOL model, which is due to the small sample information. The results demonstrate the effectiveness of the proposed model. It is noted that the predicted performance of the SVMOOL model on April 2 and 30, 2012, at Zhujianglu station is equal to the SVM model because the two models both own the same training sample.
6. Discussion
The training time and the forecasting time are the keys to realtime implementation. The experiments using LibSVM package on desktop computers indicate that the training time needs about one hour for three weeks’ data (4284 observations) to construct the prediction function and the forecasting time needs less than 1 second for a onestep prediction using SVM. According to the SVMOOL model updating testing data set by day, the SVMOOL model uses the training data sample size from 21 days’ observations to 22 days’ or 27 days’ observations, but the training time only increases 10 min and the forecasting time needs less than 1 s. Because the SVMOOL model is retrained once a day, the obtained forecasting function can be used for onestep predictions for the day, therefore realtime implementation is possible. The SVMPOL model is retained in realtime in 5min interval. The obtained forecasting function can be used to onestep prediction for the next 5min. In the process of the implementation experiments, the SVMPOL model needs less than 1 s to construct due to the small data sample and the forecasting time needs less than 1 s for onestep prediction. Therefore, the training time and the forecasting time can meet the realtime demand for the onestep prediction in the implementation as well.
7. Conclusions
The key to metro operation and management is based on the changes of the ridership to effectively deploy and use the system resources and to timely adjust operation strategy to ensure that metro is safe to complete the transportation service task. The results of shortterm ridership forecasting can provide useful information to decision makers of metro system, and the prediction accuracy directly influence the legitimacy and effectiveness of any changes in operations, such as adjustments to headway, train dispatching, and the activation of station passenger crowd regulation plan or emergency response plan.
This paper proposes a novel hybrid model combining the SVMOOL model and the SVMPOL model for shortterm ridership forecasting that better captures the periodicity and nonlinearity characteristics by the updated data set. The SVMCOL model takes advantages of the individual strengths of the two models. While the SARIMA model is superior for the stable weekday ridership to other models, experiments results indicate that the SVMOOL model is superior to SARIMA model, BPNN model, or SVM model in terms of MAE and RMSE for the weekend and holiday ridership test. The actual results of 5min shortterm ridership forecasting show the feasibility and effectiveness of the proposed combined model in realtime implementation.
It should be noted that the prediction of ridership under abnormal situations (such as holiday) is evidently more challenging than doing so under normal conditions (such as weekday ridership), and hence, much desired by the operator. Therefore, the proposed SVMCOL model is found to be suitable and useful in realworld operations, particularly in prediction under abnormal conditions. And, further studies need apply the proposed model to other abnormal situations (such as horrible weather, large sports events or emergencies, this study chooses the weekday, weekend, and holiday ridership as the demonstration). In addition, different characteristics (the impact of different meteorological conditions, the number of metro station entrances, etc.) can be considered as the input features in further studies. Jia et al. [49] indicate that, with the consideration of additional rainfall factor, the traffic flow prediction accuracy is improved.
Data Availability
Detailed data are included within the supplementary materials.
Disclosure
An earlier version of this paper has been presented in the Transportation Research Board 92nd Annual Meeting (Washington DC, 2013).
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This research has been supported by the Fundamental Research Funds for the Central Universities (no. KYLX16_0270). The authors thank the Nanjing Metro for providing the data used in this research. Thanks are due to ChihChung Chang and ChihJen Lin for permission to use the LibSVM package in this research.
Supplementary Materials
APPENDIX Table 1: the origin 5min entrance ridership data at Sanshanjie Station of Nanjing Metro from November 5 to December 2, 2012. APPENDIX Table 2: the origin 5min entrance ridership data at Zhujianglu Station of Nanjing Metro from March 12 to April 8, 2012. APPENDIX Table 3: the origin 5min entrance ridership data at Zhujianglu Station of Nanjing Metro from April 9 to May 6, 2012. (Supplementary Materials)
References
 T.H. Tsai, C.K. Lee, and C.H. Wei, “Neural network based temporal feature models for shortterm railway passenger demand forecasting,” Expert Systems with Applications, vol. 36, no. 2, pp. 3728–3736, 2009. View at: Publisher Site  Google Scholar
 H. BarGera and D. Boyce, “Originbased algorithms for combined travel forecasting models,” Transportation Research Part B: Methodological, vol. 37, no. 5, pp. 405–422, 2003. View at: Publisher Site  Google Scholar
 G. Jovicic and C. O. Hansen, “A passenger travel demand model for Copenhagen,” Transportation Research Part A: Policy and Practice, vol. 37, no. 4, pp. 333–349, 2003. View at: Publisher Site  Google Scholar
 M.C. Chen and Y. Wei, “Exploring time variants for shortterm passenger flow,” Journal of Transport Geography, vol. 19, no. 4, pp. 488–498, 2011. View at: Publisher Site  Google Scholar
 L. ChangJen and S.P. Miaou, “Realtime prediction of traffic flows using dynamic generalized linear models,” Transportation Research Record, no. 1678, pp. 168–178, 1999. View at: Google Scholar
 S. Sundaram, H. N. Koutsopoulos, M. BenAkiva, C. Antoniou, and R. Balakrishna, “Simulationbased dynamic traffic assignment for shortterm planning applications,” Simulation Modelling Practice and Theory, vol. 19, no. 1, pp. 450–462, 2011. View at: Publisher Site  Google Scholar
 B. M. Williams, P. K. Durvasula, and D. E. Brown, “Urban freeway traffic flow prediction: application of seasonal autoregressive integrated moving average and exponential smoothing models,” Transportation Research Record, no. 1644, pp. 132–141, 1998. View at: Publisher Site  Google Scholar
 B. M. Williams, “Multivariate vehicular traffic flow prediction: evaluation of ARIMAX modeling,” Transportation Research Record, no. 1776, pp. 194–200, 2001. View at: Google Scholar
 B. M. Williams and L. A. Hoel, “Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: theoretical basis and empirical results,” Journal of Transportation Engineering, vol. 129, no. 6, pp. 664–672, 2003. View at: Publisher Site  Google Scholar
 Z. R. Ye, Y. L. Zhang, and D. R. Middleton, “Unscented Kalman filter method for speed estimation using single loop detector data,” Transportation Research Record, no. 1968, pp. 117–125, 2006. View at: Google Scholar
 J. Xia, M. Chen, and Z. Qian, “Predicting freeway travel time under incident conditions,” Transportation Research Record, no. 2178, pp. 58–66, 2010. View at: Publisher Site  Google Scholar
 Z. Ye and Y. Zhang, “Speed estimation from single loop data using an unscented particle filter,” ComputerAided Civil and Infrastructure Engineering, vol. 25, no. 7, pp. 494–503, 2010. View at: Publisher Site  Google Scholar
 B. L. Smith, B. M. Williams, and R. K. Oswald, “Comparison of parametric and nonparametric models for traffic flow forecasting,” Transportation Research Part C: Emerging Technologies, vol. 10, no. 4, pp. 303–321, 2002. View at: Publisher Site  Google Scholar
 S. Clark, “Traffic prediction using multivariate nonparametric regression,” Journal of Transportation Engineering, vol. 129, no. 2, pp. 161–168, 2003. View at: Publisher Site  Google Scholar
 T. TsungHsien, L. ChiKang, and W. ChienHung, “Design of dynamic neural networks to forecast shortterm railway passenger demand,” Journal of the Eastern Asia Society for Transportation Studies, no. 6, pp. 1651–1666, 2005. View at: Google Scholar
 Y. Mo and Y. Su, “Neural networks based realtime transit passenger volume prediction,” in Proceedings of the 2009 2nd Conference on Power Electronics and Intelligent Transportation System, PEITS 2009, pp. 303–306, December 2009. View at: Publisher Site  Google Scholar
 Y. Huang and H. Pan, “Shortterm prediction of railway passenger flow based on RBF neural network,” in Proceedings of the 4th International Joint Conference on Computational Sciences and Optimization, CSO 2011, pp. 594–597, April 2011. View at: Publisher Site  Google Scholar
 Y. Zhang and Y. Xie, “Forecasting of shortterm freeway volume with vsupport vector machines,” Transportation Research Record, vol. 2024, no. 1, pp. 92–99, 2007. View at: Publisher Site  Google Scholar
 Y. Zhang and Y. Xie, “Travel mode choice modeling with support vector machines,” Transportation Research Record, no. 2076, pp. 141–150, 2008. View at: Publisher Site  Google Scholar
 Q. Chen, W. Li, and J. Zhao, “The use of LSSVM for shortterm passenger flow prediction,” Transport, vol. 26, no. 1, pp. 5–10, 2011. View at: Publisher Site  Google Scholar
 W.H. Lin, “A Gaussian maximum likelihood formulation for shortterm forecasting of traffic flow,” in Proceedings of the 2001 IEEE Intelligent Transportation Systems, pp. 150–155, Oakland, Calif, USA, 2001. View at: Publisher Site  Google Scholar
 Y. Xie, Y. Zhang, and Z. Ye, “Shortterm traffic volume forecasting using Kalman filter with discrete wavelet decomposition,” ComputerAided Civil and Infrastructure Engineering, vol. 22, no. 5, pp. 326–334, 2007. View at: Publisher Site  Google Scholar
 D. BotoGiralda, F. J. DíazPernas, D. GonzálezOrtega et al., “Waveletbased denoising for traffic volume time series forecasting with selforganizing neural networks,” ComputerAided Civil and Infrastructure Engineering, vol. 25, no. 7, pp. 530–545, 2010. View at: Publisher Site  Google Scholar
 C. P. I. van Hinsbergen, J. W. C. van Lint, and H. J. van Zuylen, “Bayesian committee of neural networks to predict travel times with confidence intervals,” Transportation Research Part C: Emerging Technologies, vol. 17, no. 5, pp. 498–509, 2009. View at: Publisher Site  Google Scholar
 X. Fei, C. C. Lu, and K. Liu, “A bayesian dynamic linear model approach for realtime shortterm freeway travel time prediction,” Transportation Research Part C: Emerging Technologies, vol. 19, no. 6, pp. 1306–1318, 2011. View at: Publisher Site  Google Scholar
 Y. Wei and M. Chen, “Forecasting the shortterm metro passenger flow with empirical mode decomposition and neural networks,” Transportation Research Part C: Emerging Technologies, vol. 21, no. 1, pp. 148–162, 2012. View at: Publisher Site  Google Scholar
 H. Xiao, H. Sun, and B. Ran, “Special factor adjustment model using fuzzyneural network in traffic prediction,” Transportation Research Record, no. 1879, pp. 17–23, 2004. View at: Google Scholar
 Y. Zhang and Z. Ye, “Shortterm traffic flow forecasting using fuzzy logic system methods,” Journal of Intelligent Transportation Systems: Technology, Planning, and Operations, vol. 12, no. 3, pp. 102–112, 2008. View at: Publisher Site  Google Scholar
 Y. Sun, B. Leng, and W. Guan, “A novel waveletSVM shorttime passenger flow prediction in Beijing subway system,” Neurocomputing, vol. 166, pp. 109–121, 2015. View at: Publisher Site  Google Scholar
 N. Zhang, Y. Zhang, and H. Lu, “Seasonal autoregressive integrated moving average and support vector machine models: prediction of shortterm traffic flow on freeways,” Transportation Research Record, vol. 2215, pp. 85–92, 2011. View at: Publisher Site  Google Scholar
 W.C. Hong, Y. Dong, F. Zheng, and S. Y. Wei, “Hybrid evolutionary algorithms in a SVR traffic flow forecasting model,” Applied Mathematics and Computation, vol. 217, no. 15, pp. 6733–6747, 2011. View at: Publisher Site  Google Scholar  MathSciNet
 W.C. Hong, “Traffic flow forecasting by seasonal SVR with chaotic simulated annealing algorithm,” Neurocomputing, vol. 74, no. 1213, pp. 2096–2107, 2011. View at: Publisher Site  Google Scholar
 W.C. Hong, “Application of seasonal SVR with chaotic immune algorithm in traffic flow forecasting,” Neural Computing and Applications, vol. 21, no. 3, pp. 583–593, 2012. View at: Publisher Site  Google Scholar
 J. Wang and Q. Shi, “Shortterm traffic speed forecasting hybrid model based on chaos–wavelet analysissupport vector machine theory,” Transportation Research Part C: Emerging Technologies, vol. 27, no. 1, pp. 219–232, 2013. View at: Publisher Site  Google Scholar
 R. Chen, C.Y. Liang, W.C. Hong, and D.X. Gu, “Forecasting holiday daily tourist flow based on seasonal support vector regression with adaptive genetic algorithm,” Applied Soft Computing, vol. 26, pp. 435–443, 2015. View at: Publisher Site  Google Scholar
 S. Ishak and H. AlDeek, “Performance evaluation of shortterm timeseries traffic prediction model,” Journal of Transportation Engineering, vol. 128, no. 6, pp. 490–498, 2002. View at: Publisher Site  Google Scholar
 E. I. Vlahogianni, M. G. Karlaftis, and J. C. Golias, “Optimized and metaoptimized neural networks for shortterm traffic flow prediction: a genetic approach,” Transportation Research Part C: Emerging Technologies, vol. 13, no. 3, pp. 211–234, 2005. View at: Publisher Site  Google Scholar
 V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, NY, USA, 1995. View at: Publisher Site  MathSciNet
 S. Abe, “Analysis of support vector machine,” in IEEE Signal Processing Society Workshop, pp. 8889, IEEE Press, New York, NY, USA, 2002. View at: Google Scholar
 C.H. Wu, J.M. Ho, and D. T. Lee, “Traveltime prediction with support vector regression,” IEEE Transactions on Intelligent Transportation Systems, vol. 5, no. 4, pp. 276–281, 2004. View at: Publisher Site  Google Scholar
 L. J. Cao, K. S. Chua, W. K. Chong, H. P. Lee, and Q. M. Gu, “A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine,” Neurocomputing, vol. 55, no. 12, pp. 321–336, 2003. View at: Publisher Site  Google Scholar
 C. Huang and C. Wang, “A GAbased feature selection and parameters optimizationfor support vector machines,” Expert Systems with Applications, vol. 31, no. 2, pp. 231–240, 2006. View at: Publisher Site  Google Scholar
 S.W. Lin, K.C. Ying, S.C. Chen, and Z.J. Lee, “Particle swarm optimization for parameter determination and feature selection of support vector machines,” Expert Systems with Applications, vol. 35, no. 4, pp. 1817–1824, 2008. View at: Publisher Site  Google Scholar
 C. C. Chang and C. J. Lin, “LIBSVM: A Library for Support Vector Machines,” Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, 2001. View at: Google Scholar
 W.C. Hong, Y. Dong, F. Zheng, and C.Y. Lai, “Forecasting urban traffic flow by SVR with continuous ACO,” Applied Mathematical Modelling: Simulation and Computation for Engineering and Environmental Systems, vol. 35, no. 3, pp. 1282–1291, 2011. View at: Publisher Site  Google Scholar  MathSciNet
 J. Kennedy and R. Eberhart, “Particle swarm optimization,” in Proceedings of the IEEE International Conference on Neural Networks (ICNN’ 95), vol. 4, pp. 1942–1948, Perth, Western Australia, NovemberDecember 1995. View at: Publisher Site  Google Scholar
 W. Zhu, W. Wang, and Z. Huang, “Estimating train choices of rail transit passengers with real timetable and automatic fare collection data,” Journal of Advanced Transportation, vol. 2017, Article ID 5824051, 12 pages, 2017. View at: Publisher Site  Google Scholar
 C. Chen, Y. Wang, L. Li, J. Hu, and Z. Zhang, “The retrieval of intraday trend and its influence on traffic prediction,” Transportation Research Part C: Emerging Technologies, vol. 22, pp. 103–118, 2012. View at: Publisher Site  Google Scholar
 Y. Jia, J. Wu, and M. Xu, “Traffic flow prediction with rainfall impact using a deep learning method,” Journal of Advanced Transportation, vol. 2017, Article ID 6575947, 10 pages, 2017. View at: Publisher Site  Google Scholar
Copyright
Copyright © 2018 Xuemei Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.