Research Article

Bus Single-Trip Time Prediction Based on Ensemble Learning

Table 1

Comparison of advantages and disadvantages of popular models.

CategoryCommonly used modelsAdvantagesDisadvantagesReferences

History of the same periodSmoothing methodEasy understandability, better results in normal conditions and with large time granularityExcessive reliance on data patterns from historical dataOmkar and Kumar [32]

Time seriesKalman filteringApplicable to time series data and interpretabilityUnsuitable for capturing nonlinear data patternsZhou et al. [33]
AR(Auto regressive)Li et al. [34]
ARIMAGummadi and Edara [35]

Machine learningSVM SVRSuitable for learning nonlinear features in dataLow computational efficiency at high data volumesLi and Xu [36]
K-nearest neighborSun et al. [37]
Linear regressionKhiari and Olaverri-Monreal [38]
Decision treeAlajali et al. [39]
Random forestZhou et al. [40]

Deep learningRNNApplicable for learning linear and nonlinear patterns with good data fitting capabilityLow interpretability and low efficiencyPang et al. [41]
LSTMAgafonov and Yumaganov [23, 42]
GRUShu et al. [43]

Ensembled modelAdaBoostApplicable to select the appropriate base model for ensemble according to the characteristics of different datasetsProne to overfitting, low interpretability, and poor results when data are unbalancedZhou et al. [44]
Bootstrapped aggregationVaish et al. [45]
Stacked generalizationSharma et al. [46]
Gradient boosting Machines, GBMMonego et al. [47, 48]
Gradient boosted regression Trees, GBRTChen et al. [49]

Combined modelDirect averaging, weighted averaging, and other combinationsHigh applicability with various sub-models and combinationsSubjective on choosing the combination method and sub-modelsYan et al. [50]