Research Article  Open Access
A New Hybrid Model FPASVM Considering Cointegration for Particular Matter Concentration Forecasting: A Case Study of Kunming and Yuxi, China
Abstract
Air pollution in China is becoming more serious especially for the particular matter (PM) because of rapid economic growth and fast expansion of urbanization. To solve the growing environment problems, daily PM2.5 and PM10 concentration data form January 1, 2015, to August 23, 2016, in Kunming and Yuxi (two important cities in Yunnan Province, China) are used to present a new hybrid model CIFPASVM to forecast air PM2.5 and PM10 concentration in this paper. The proposed model involves two parts. Firstly, due to its deficiency to assess the possible correlation between different variables, the cointegration theory is introduced to get the inputoutput relationship and then obtain the nonlinear dynamical system with support vector machine (SVM), in which the parameters c and g are optimized by flower pollination algorithm (FPA). Six benchmark models, including FPASVM, CISVM, CIGASVM, CIPSOSVM, CIFPANN, and multiple linear regression model, are considered to verify the superiority of the proposed hybrid model. The empirical study results demonstrate that the proposed model CIFPASVM is remarkably superior to all considered benchmark models for its high prediction accuracy, and the application of the model for forecasting can give effective monitoring and management of further air quality.
1. Introduction
Air pollution has a great impact on humans and environment [1, 2]. The information on meteorological pollution, caused by CO, NO, NO_{2}, SO_{2}, O_{3}, and particulate matter (PM_{2.5} and PM_{10}), is urgent due to the harmful effects on human health [3]. Especially in recent years, regions of China have suffered the hazy weather including Jianghuai, North China, Huanghuai, south of the Yangtze River, and other areas. The affected regions are about 25% of the country, and the affected population is above six hundred million [4]. Furthermore, the hazy weather is harmful to the respiratory and cardiovascular system of human which would induce chronic disease and cancer. In addition, it would affect mental and reproductive health. And related studies found that extreme particulate matter (PM_{2.5} and PM_{10}) was one of the main factors of hazy weather [5]. So it is urgent to monitor the particulate matter and its forecasting is an important work. In view of this situation, this paper introduces a new hybrid model to forecast the daily particulate matter of Kunming and Yuxi, China.
In recent period, there are lots of researchers concentrating on the technique of predicting the PM concentration. The extreme particulate matter is an open, nonlinear, dynamic, and complex system. So it is difficult to derive an accurate formula to predict the value of PM. Fortunately, a datadriven, empirically based or “blackbox” modeling approach which is designed to identify relationships between input and output without considering the mechanism of generating particulate matter can be employed to predict the PM concentration. With the development of artificial intelligence, machine learning techniques such as ANN and SVM have been applied into the time series of air pollution matter. Grivas and Chaloulakou provided reliable predictions of PM_{10} hourly concentrations by evaluating the potential of various developed neural network models [6]. Cai et al. applied artificial network to predict hourly air pollutant concentrations of Guangzhou, China [7]. Caselli et al. developed the backpropagation neural network to predict the daily PM_{10} concentration before 1, 2, and 3 days [8]. De Gennaro et al. developed an artificial neural network (ANN) to forecast PM_{10} daily concentration in two contrasted environments in NE Spain [9]. Ding et al. predicted air pollutant concentration using a feedforward neural network inspired by the mechanism of the human brain [10]. Meanwhile, the method of support vector machine is widely employed in predicting the air pollutant concentrations. Suárez Sánchez et al. proposed a regression model of air quality by using the support vector machine (SVM) technique in the Aviles urban area (Spain) at local scale [11]. García Nieto et al. presented a method of daily air pollution modeling by using support vector machine (SVM) technique in Oviedo urban area (Northern Spain) at local scale [12]. But it is difficult for one single machine learning algorithm to achieve high precise prediction [13]. So researchers combined different algorithms to get hybrid models to forecast the air pollution matter (CO, NO, NO_{2}, SO_{2}, O_{3}, and particulate matter). DíazRobles et al. proposed a hybrid model combining ARIMA and ANN to improve forecast accuracy for the air quality of Temuco, Chile [14]. Feng et al. used artificial neural network to predict ozone concentration on single site with a better forecast accuracy in huge data set condition [15]. Fu et al. introduced a feedforward neural network with rolling mechanism and grey model to forecast air PM_{2.5} and PM_{10} concentrations in Hangzhou, Shanghai, and Nanjing, China [16]. Niu et al. introduced a hybrid model based on CCEMD, GWO, and SVM for daily PM_{2.5} concentration forecasting in Harbin and Chongqing, China [4]. Xu et al. proposed a hybrid model named ICEEMDWOASVM for forecasting major pollutants (CO, NO, NO_{2}, SO_{2}, O_{3}, and particulate matter) in Harbin, Chongqing, and Taiyuan, China [17]. Inspired by above researches, this paper proposes a new hybrid model with different algorithms to improve the accuracy of prediction.
As the traditional methods, many researchers established the models only using one time series. So these models may reduce the accuracy of the prediction with using insufficient information. Fortunately, Engle and Granger provided the cointegration theory to overcome the problems of nonstationarity of the time series and deal with the “spurious regression” [18]. And the forecast based on cointegration theory can put two or more sequences into the models and enhance the performance of the models. Because of its great effect, the theory has been studied in economics extensively during the past decades. Nevertheless, this theory started applying to the engineering research. Using the cointegration theory, Belloumi examined the causal relationship between per capita energy consumption and per capita gross domestic product for Tunisia during the 1971–2014 period [19]. Shahbaz et al. reexamined the relationship between electricity consumption, economic growth, and employment in Portugal using the cointegration [20]. Jahangir Alam et al. investigated the possible existence of dynamic causality between energy consumption, electricity consumption, carbon emissions, and economic growth in Bangladesh [21]. Saboori et al. established a long run as well as causal relationship between economic growth and carbon dioxide (CO_{2}) emissions for Malaysia [22]. Dogan analyzed the short and long run estimates as well as the causality relationships between economic growth, electricity consumption from renewable sources, and electricity consumption from nonrenewable sources for Turkey in a multivariate model wherein capital and labor are included as additional variables [23]. In the study of hydrology, Zhang et al. introduced CI to reveal the longterm balance relationship and shortterm fluctuations of the original and decomposed runoff and sediment load time series [24]. In meteorology, de Cian et al. presented an empirical study of the relationship between residential energy demand and temperature [25]. For these reasons, this paper tries to make use of the cointegration theory to find the causal relationship of PM_{2.5} and PM_{10} of Kunming and Yuxi.
In machine learning, support vector machine (SVM) has greater performance to depict nonlinear relationship. But the accuracy of SVM depends on two parameters and the optimized methods for selecting the parameters are complex and changeable. Hu et al. proposed a hybrid forecasting approach that consists of the empirical wavelet transform, coupled simulated annealing, and least square support vector machine for enhancing the accuracy of shortterm wind speed forecasting [26]. Zhang et al. built a predictive model based on support vector regression and differential evolution algorithm to forecast the electricity load [27]. Liang et al. proposed a hybrid model based on wavelet transform and least squares support vector machine, which is optimized by an improved cuckoo search to predict the shortterm electric load [28]. Wu and Peng built a novel hybrid approach for wind power generation forecasting in the light of cloudbased evolutionary algorithm and least squares support vector machine [29]. SantamaríaBonfil et al. proposed a hybrid methodology based on support vector regression and genetic algorithm for wind speed forecasting [30]. W. Sun and J. Sun presented a novel hybrid model based on least squares support vector machine optimized by cuckoo search to monitor and control the PM_{2.5} concentration [31]. Sreekumar et al. presented three forecasting models, namely, threeday trained support vector regression model and parameter optimized SVR using genetic algorithm and that using particle swarm optimization in the fields of power system [32]. This paper introduces a new optimized method using flower pollination algorithm to obtain the suitable parameters for support vector regression, and this algorithm is more efficient than traditional methods such as GA and PSO [33].
Targeting at improving the predictive accuracy of PM_{2.5} and PM_{10} concentration, a hybrid model based on cointegration theory (CI), support vector machine (SVM), and flower pollination algorithm (FPA) is established. Firstly, the cointegration theory is utilized to get the causal relationship among four particular matter sequences of Kunming and Yuxi. Then the SVM technique optimized by FPA which can achieve a balance between exploration and exploitation is built to forecast particular matter (PM_{2.5} and PM_{10} concentrations) [33]. The data sets of particular matter from two cities (Kunming and Yuxi) in Yunnan Province are collected to evaluate the effectiveness of the proposed model. The remaining part of the article is organized as follows. Section 2 mainly introduces the techniques of cointegration theory, support vector machine, and flower pollination algorithm. Next, the data of study areas, evaluation criteria, and the results of proposed hybrid model are introduced in Section 3. At last, the conclusion and future work are displayed in Section 4.
2. Mathematical Methods
2.1. Cointegration Theory (CI)
The cointegration theory is proposed by Engle and Granger to overcome the “spurious regression” of time series [18]. Cointegration mainly depicts the longterm balance relationships among nonstationary time series [24]. If a nonstationary time series is stationary after the times differencing, the time series is said to be integrated of order , represented as . Apparently, is the stationary time series.
The Augment DickeyFuller (ADF) test is one of the most popular tests to determine the stationarity of variable series [34]. The ADF test depends on the flowing regression formula:where α is the constant term; β, δ, are the parameters; is the first differencing of ; is the time; and is the white noise term. Meanwhile, the lag length is determined by the AIC and SC.
Engle and Granger proposed EG test to examine the cointegration between two time series [18]. Firstly, the test establishes a regression model of the data by OLS and obtains the residues . Then, it tries to verify the residues time series using the ADF test. If the residue is stationary, the two time series have a casual relationship on short and long run.
The Johansen test is proposed by Soren Johansen to test cointegration of several time series of [35]. The test permits more than one cointegrating relationship. There are two types of Johansen test (trace and eigenvalue). The null hypothesis for the trace and eigenvalue tests is that the number of cointegration vectors is versus the alternative where . Both the Johansen tests are based on the vector autoregressive model.
2.2. Support Vector Machine (SVM)
The support vector machine is a popular technique and its fundamental theory are introduced by Vapnik [36]. One of the advantages of SVM is minimization of structural risks, which minimize the upperbound generalization error rather than the local training error [37]. The SVM purses the best tradeoff between the model’s empirical error and the model complexity [30]. The regression formula is defined aswhere is the bias term; is the feature. And of formula is optimized aswhere is the complexity penalization term, and , correspond to the dual variables for the active constraints [38].
The technique converts nonlinear problem into linear problem using the kernel function . In this paper, the RBF is adopted, which can be expressed by
Finally, the nonlinear formula can be obtained by
2.3. Flower Pollination Algorithm (FPA)
The novel swarm intelligence (SI) technique of FPA is first proposed by Yang [33]. Flower pollination is an intriguing process in the natural word. Its evolutionary characteristics can be used to design new algorithms.
The main purpose of a flower is ultimately reproduction via pollination. Pollination can take two major forms: abiotic and biotic. About 90% of flowering plants belong to biotic pollination; that is, pollen is transferred by a pollinator such as insects and animals. About 10% of pollination takes abiotic form which does not require any pollinators. The flower constancy may have evolutionary advantages because this will maximize the transfer of flower pollen to the same or conspecific plants, thus maximizing the reproduction of the same flower species [33].
Pollination can be achieved by selfpollination or crosspollination. Crosspollination, or allogamy, means pollination can occur from pollen of a flower of a different plant, while selfpollination is the fertilization of one flower from pollen of the same flower or different flowers of the same plant. Biotic crosspollination may occur at long distance, and the pollinators can fly a long distance, which is considered as the global pollination. The algorithm idealizes the characteristics of pollination process, flower constancy, and pollinator behavior as the following rules:(1)Biotic and crosspollination are considered as global pollination process with pollencarrying pollinators performing Levy flights.(2)Abiotic and selfpollination are considered as local pollination.(3)Flower constancy can be considered as the reproduction probability which is proportional to the similarity of two involved flowers.(4)Local pollination and global pollination are controlled by a switch probability . Due to the physical proximity and other factors such as wind, local pollination can have a significant faction in the overall pollination activities.
There are two key steps in the algorithm, the global pollination and local pollination. In the global pollination step, pollen can travel over a long distance because insects can fly and move on a longer range. The first rule plus flower constancy can be represented mathematically aswhere is the pollen at iteration , and is the current best solution found among all solutions at the current iteration. The parameter is the strength of pollination which drew from a Levy distribution . The local pollination (Rule 2) and flower constancy can be represented as where and are random pollen from the different flowers of the same plant species; ε is from a uniform distribution in . And works better for most applications from lots of simulations. The flower pollination algorithm (FPA) is presented in Figure 1.
2.4. The Hybrid Model CIFPASVM
In this section, the proposed novel hybrid model CIFPASVM is described in detail (Figure 2). First, we obtain the casual relationship among the four particular matter times series by CI with unit root test and cointegration test. Then, the nonlinear model between the input and target is built by SVM which is optimized by FPA. Finally, the prediction of PM is obtained by the proposed hybrid model. The structure of the proposed hybrid model is illustrated in Figure 2.
3. Empirical Study
3.1. Study Areas Description
To verify the effectiveness of the proposed hybrid model, Kunming and Yuxi are collected as the study areas (Figure 3). The detailed information of the study areas is as follows.
Kunming is the capital and largest city in Yunnan Province, Southwest China, with a population of 6.677 million in 2016. It is located between north latitude of 24°23′ and 26°22′N and east longitude of 102°10′ and 103°40′E, with a total area of 21,600 square kilometers. This city is situated in a fertile lake basin on the northern shore of the Lake Dian and surrounded by mountains to the north, west, and east, and the altitude of downtown is 1891 meters. Kunming belongs to the subtropical monsoon climate, and the average temperature is around 16.5°C. The annual precipitation is about 1450 mm, belonging to high humidity area. Besides, Kunming is a major tourist and trade city, with the GDP being 4300 billion yuan in 2016. With the rapid development of Kunming, the environment problems need to be paid more attention.
Yuxi is located in the center of Yunnan Province, about 90 kilometers south of Kunming. It is located between north latitude of 23°19′ and 24°53′N and east longitude of 101°16′ and 103°09′E. Like many of the central and eastern parts of the province, it is part of the YunnanGuizhou Plateau. The area is 15,285 km^{2} and the population is approximately 2.5 million. Tempered by the low latitude and moderate elevation, Yuxi has a mild subtropical highland climate, with short, mild, dry winters, and warm, rainy summers. The annual average temperature is about 15.4–24.2°C and the precipitation is about 787.8–1000 mm. In addition to the complex nature conditions, Yuxi is an important economy center and its GDP is 1309 billion yuan in 2016, so it plays a pivotal role in the development of Yunnan Province.
3.2. Data Description
The data sets of daily PM_{2.5} and PM_{10} concentration of Kunming and Yuxi used in this paper are retrieved from the website of the online air quality monitoring and analyze platform of China (https://www.aqistudy.cn/historydata/). The daily PM_{2.5} and PM_{10} concentrations data from January 1, 2015, to August 23, 2016, in Kunming and Yuxi are collected (Figure 4). Each data set is divided into two sets: the training data set including 491 data points (from January 1, 2015, to May 5, 2016) and the remaining 110 data points as the testing data set (from May 6, 2016, to August 23, 2016).
(a) Kunming
(b) Yuxi
Table 1 shows the statistics of the training, testing, and total data for daily PM_{2.5} and PM_{10} concentrations of Kunming and Yuxi. The recorded daily maximum PM_{2.5} is 95.8 and 91 for Kunming and Yuxi, both appearing on March 22, 2015. The recorded daily maximum PM_{10} is 129 and 121 for Kunming and Yuxi, appearing on July 9, 2016, and June 19, 2015, respectively.
 
Min: the minimum. Max: the maximum. Std: the standard deviation. SK: the skewness. CV: the coefficient of variation. 
3.3. Evaluation Criteria
The rootmeansquare error (RMSE), the mean absolute error (MAE), the mean bias error (MBE), and Pearson’s correlation coefficient () are used to evaluate the reliability of CIFPASVM model. RMSE and MAE measure residual errors, which give a global idea of the difference between the observed and forecast values. RMSE is used to measure the sensitivity and extremum effect of the predicted value. MAE is used to evaluate the absolute error range of the predicted value. is collected to show linear correlation between observed data and forecasted value. The lower values of MAE and RMSE indicate that the model is better. MBE indicates whether the model is over or underpredicted in general. MBE is better when it is close to 0 while is better when it is close to 1. RMSE, MAE, MBE, and are calculated as follows:where is the observed value and is the forecasted value to . is the number of the observations of the validation set. and stand for the mean of observed value and the mean of forecasted value, respectively.
3.4. Process of Cointegration Test
3.4.1. Result of Unit Root Tests
To estimate the cointegration of the time series variables, all of the time variables need to be stationary in order to avoid problems with spurious correlation. The Augmented DickyFuller (ADF) unit root tests are employed to test the stationarity of the time series variables being investigated in this study. Table 2 shows the results of the ADF tests and the results indicate that all the time series variables are stationary at 0.01 significance level. Therefore, all the time series variables are regarded as cointegrated of order zero, that is, .

3.4.2. Result of Cointegration Test
Table 3 obtained from the Johansen cointegration test shows that four variables, Kunming_PM2.5, Kunming_PM10, Yuxi_PM2.5, and Yuxi_PM10 are cointegrated as indicated by the star wherein the value of trace statistic is smaller than 1% critical value. The results of the trace test and the maximum eigenvalue test verify the presence of long run relationship between Kunming_PM2.5, Kunming_PM10, Yuxi_PM2.5, and Yuxi_PM10.

3.5. Results and Analysis
3.5.1. Results of the Proposed Model
In this study, the input consisted of the particular matter of past two days in Kunming and Yuxi. The following day’s particular matter of Kunming and Yuxi is, respectively, selected as the output of the new hybrid model.
From Figures 5 and 6, the prediction of PM_{2.5} and PM_{10} of Kunming and Yuxi has a great performance by the proposed model. According to the results in testing periods, it can be observed that the performance of the proposed model for PM_{2.5} in Yuxi is better than that in Kunming. For PM_{2.5} forecasting, the hybrid model CIFPASVM obtains the RMSE, MAE, and MBE of 6.58, 5.31, and −2.57 in Kunming, respectively, and yields the RMSE, MAE, and MBE of 4.96, 4.06, and −2.22 in Yuxi, respectively (Table 4). Meanwhile, the accuracy of PM_{10} in Kunming is higher than that in Yuxi. And the proposed model of PM_{10} achieves the RMSE, MAE, and MBE of 7.86, 6.6, and −3.22 in Kunming while the values of RMSE, MAE, and MBE are 10.35, 8.37, and −3.32 in Yuxi (Table 5). Moreover, it appears that the hybrid model CIFPASVM can provide a highly accurate prediction to 1day ahead PM time series for Kunming and Yuxi.


(a) PM2.5
(b) PM10
(a) PM2.5
(b) PM10
3.5.2. Model Comparisons
In this study, the comparisons of forecasting hybrid models for the daily particular matter (PM_{2.5} and PM_{10}) of Kunming and Yuxi are made among the proposed hybrid model CIFPASVM (Model 1), the hybrid model CIPSOSVM (Model 2), the hybrid model CIGASVM (Model 3), the hybrid model CISVM (Model 4), the hybrid model FPASVM (Model 5), the hybrid model CIFPANN (Model 6), and multiple linear regression (Model 7). The forecasting performances of different models are presented in Tables 4 and 5. And the empirical study shows that the proposed hybrid model CIFPASVM is remarkably superior to all the considered benchmark models. Furthermore, it displays that the hybrid model can combine all the advantages of each individual model.
As for forecasting of PM_{2.5} in Kunming and Yuxi in Table 4, it is apparent that the proposed hybrid model CIFPASVM has a best performance among all other hybrid models. In particular, compared with CIPSOSVM and CIGASVM, the proposed hybrid model achieves the most excellent accuracy in both two regions. And this reveals that FPA has a better optimizing performance than the traditional optimization methods (PSO and GA). What is more, it is obvious that the hybrid model FPASVM acquires worse predictive result in this study; this means the cointegration theory plays an important role in the hybrid model. Meanwhile, we also can draw the conclusion that the prediction of PM_{2.5} in Yuxi is superior to that in Kunming.
Next, the performance of the proposed hybrid model CIFPASVM and compared models in the prediction of the PM_{10} in Kunming and Yuxi is displayed in Table 5. Firstly, overall, the proposed model has an outstanding performance in Kunming and Yuxi among all the models. Secondly, FPA can achieve two better parameters of SVM than PSO and GA by comparing CIFPASVM to CIPSOSVM and CIGASVM. Thirdly, it can be found that the three statistical errors (MAE, RMSE, and MBE) of FPASVM are highest in both Kunming and Yuxi, which means CI has an important role in prediction.
Then, it must be noticed that Model 6 and Model 7 are considered the classical models for PM concentration forecasting. The performance of Model 6, in which the artificial neural network is selected as the main algorithm to get nonlinear relationship between input and output, is better but is worse than proposed Model 1 according to four indicators (MAE, RMSE, MBE, and ). Meanwhile, Model 7, as the most traditional method to get linear relationship by least square method, has obtained the worst precise accuracy among all seven considered benchmark models.
Above all, the hybrid model CIFPASVM in this paper is simple and quite efficient in the prediction of PM.
4. Conclusions
In order to predict the particular matter pollution, the serious environmental issues, this paper proposes a new model called CIFPASVM, which combined flower pollination algorithm with support vector machine (FPASVM) based on cointegration theory (CI). The model consists of two parts. The prior part introduces the information related to ambient sequences into the hybrid model by cointegration theory, so it can make full use of the information for prediction. The cointegration theory provides a useful and effective tool for extracting functional relationships between inputs and outputs, and it can avoid the occurrence of spurious regression. To establish the forecasting part, SVM, in which the parameters c and g are optimized by FPA, is employed in this study. In the empirical study, the proposed hybrid model CIFPASVM is utilized to forecast daily PM_{2.5} and PM_{10} concentrations in Kunming and Yuxi. Compared with six benchmark models, including FPASVM model which has no cointegration theory as foundation, CISVM model which rejects optimization algorithm, and two other models based on cointegration theory but optimized by traditional algorithms, GA and PSO, called CIPSOSVM and CIGASVM, and two classical methods, CIFPANN model and multiple linear model, the results indicate that the proposed hybrid model CIFPASVM is remarkably superior to all considered benchmark models in both Kunming and Yuxi, in terms of its higher predictive accuracy.
However, in this paper, we only take the correlation of particular matters (PM_{2.5} and PM_{10}) and the influence of the surrounding city into consideration, without considering the possible impacts of other pollutants, such as NO, CO_{2}, and SO_{2}. It is obvious that the factors are important for prediction. Investigating how to probe into appropriate and reasonable components to construct the model may be a future research direction. As mentioned above, an interesting potential direction would be the use of this novel hybrid model to further enhance and optimize the performance.
Abbreviations
ADF:  Augment DickeyFuller 
AIC:  Akaike information criterion 
ANN:  Artificial neural network 
ARIMA:  Autoregressive integrated moving average model 
CCEMD:  Complementary ensemble empirical mode decomposition 
CI:  Cointegration theory 
FPA:  Flower pollination algorithm 
GA:  Genetic algorithm 
GWO:  Grey wolf optimizer 
ICCEMD:  Improved ensemble empirical mode decomposition 
MAE:  Mean absolute error 
MBE:  Mean bias error 
OLS:  Ordinary least square 
PM:  Particular matter 
PSO:  Particle swarm optimization 
RMSE:  Rootmeansquare error 
SC:  Schwartz criterion 
SI:  Swarm intelligence 
SVM:  Support vector machine 
SVR:  Support vector regression 
WOA:  Whale optimization algorithm 
RBF:  Radial basis function. 
Conflicts of Interest
The authors declare no conflicts of interest.
Authors’ Contributions
Weide Li designed research; Demeng Kong drafted this paper; Jinran Wu performed research and analyzed data.
Acknowledgments
The authors acknowledge the National Natural Sciences Foundation of China (Grant no. 41571016) for providing support for this research.
References
 G. Emenius, G. Pershagen, N. Berglind et al., “NO_{2}, as a marker of air pollution, and recurrent wheezing in children: A nested casecontrol study within the BAMSE birth cohort,” Occupational and Environmental Medicine, vol. 60, no. 11, pp. 876–881, 2003. View at: Publisher Site  Google Scholar
 W. R. Wan Mahiyuddin, M. Sahani, R. Aripin, M. T. Latif, T.Q. Thach, and C.M. Wong, “Shortterm effects of daily air pollution on mortality,” Atmospheric Environment, vol. 65, pp. 69–79, 2013. View at: Publisher Site  Google Scholar
 A. C. Comrie and J. E. Diem, “Climatology and forecast modeling of ambient carbon monoxide in Phoenix, Arizona,” Atmospheric Environment, vol. 33, no. 30, pp. 5023–5036, 1999. View at: Publisher Site  Google Scholar
 M. Niu, Y. Wang, S. Sun, and Y. Li, “A novel hybrid decompositionandensemble model based on CEEMD and GWO for shortterm PM_{2.5} concentration forecasting,” Atmospheric Environment, vol. 134, pp. 168–180, 2016. View at: Publisher Site  Google Scholar
 I. B. Konovalov, M. Beekmann, F. Meleux, A. Dutot, and G. Foret, “Combining deterministic and statistical approaches for PM_{10} forecasting in Europe,” Atmospheric Environment, vol. 43, no. 40, pp. 6425–6434, 2009. View at: Publisher Site  Google Scholar
 G. Grivas and A. Chaloulakou, “Artificial neural network models for prediction of PM_{10} hourly concentrations, in the Greater Area of Athens, Greece,” Atmospheric Environment, vol. 40, no. 7, pp. 1216–1229, 2006. View at: Publisher Site  Google Scholar
 M. Cai, Y. Yin, and M. Xie, “Prediction of hourly air pollutant concentrations near urban arterials using artificial neural network approach,” Transportation Research Part D: Transport and Environment, vol. 14, no. 1, pp. 32–41, 2009. View at: Publisher Site  Google Scholar
 M. Caselli, L. Trizio, G. De Gennaro, and P. Ielpo, “A simple feedforward neural network for the PM_{10} forecasting: Comparison with a radial basis function network and a multivariate linear regression model,” Water, Air, and Soil Pollution, vol. 201, no. 14, pp. 365–377, 2009. View at: Publisher Site  Google Scholar
 G. De Gennaro, L. Trizio, A. Di Gilio et al., “Neural network model for the prediction of PM_{10} daily concentrations in two sites in the Western Mediterranean,” Science of the Total Environment, vol. 463464, pp. 875–883, 2013. View at: Publisher Site  Google Scholar
 W. Ding, J. Zhang, and Y. Leung, “Prediction of air pollutant concentration based on sparse response backpropagation training feedforward neural networks,” Environmental Science and Pollution Research, vol. 23, no. 19, pp. 19481–19494, 2016. View at: Publisher Site  Google Scholar
 A. Suárez Sánchez, P. J. García Nieto, P. Riesgo Fernández, J. J. del Coz Díaz, and F. J. IglesiasRodríguez, “Application of an SVMbased regression model to the air quality study at local scale in the Avilés urban area (Spain),” Mathematical and Computer Modelling, vol. 54, no. 56, pp. 1453–1466, 2011. View at: Publisher Site  Google Scholar
 P. J. García Nieto, E. F. Combarro, J. J. Del Coz Díaz, and E. Montañés, “A SVMbased regression model to study the air quality at local scale in Oviedo urban area (Northern Spain): A case study,” Applied Mathematics and Computation, vol. 219, no. 17, pp. 8923–8937, 2013. View at: Publisher Site  Google Scholar
 W. D. Li, D. M. Kong, and J. R. Wu, “A Novel Hybrid Model Based on Extreme Learning Machine, kNearest Neighbor Regression and Wavelet Denoising Applied to ShortTerm Electric Load Forecasting,” Energies, vol. 10, no. 5, p. 694, 2017. View at: Publisher Site  Google Scholar
 L. A. DíazRobles, J. C. Ortega, J. S. Fu et al., “A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile,” Atmospheric Environment, vol. 42, no. 35, pp. 8331–8340, 2008. View at: Publisher Site  Google Scholar
 Y. Feng, W. Zhang, D. Sun, and L. Zhang, “Ozone concentration forecast method based on genetic algorithm optimized back propagation neural networks and support vector machine data classification,” Atmospheric Environment, vol. 45, no. 11, pp. 1979–1985, 2011. View at: Publisher Site  Google Scholar
 M. Fu, W. Wang, Z. Le, and M. S. Khorram, “Prediction of particular matter concentrations by developed feedforward neural network with rolling mechanism and gray model,” Neural Computing and Applications, vol. 26, no. 8, pp. 1789–1797, 2015. View at: Publisher Site  Google Scholar
 Y. Xu, W. Yang, and J. Wang, “Air quality earlywarning system for cities in China,” Atmospheric Environment, vol. 148, pp. 239–257, 2017. View at: Publisher Site  Google Scholar
 R. F. Engle and C. W. J. Granger, “Cointegration and error correction: representation, estimation, and testing,” Econometrica, vol. 55, no. 2, pp. 251–276, 1987. View at: Publisher Site  Google Scholar
 M. Belloumi, “Energy consumption and GDP in Tunisia: Cointegration and causality analysis,” Energy Policy, vol. 37, no. 7, pp. 2745–2753, 2009. View at: Publisher Site  Google Scholar
 M. Shahbaz, C. F. Tang, and M. Shahbaz Shabbir, “Electricity consumption and economic growth nexus in Portugal using cointegration and causality approaches,” Energy Policy, vol. 39, no. 6, pp. 3529–3536, 2011. View at: Publisher Site  Google Scholar
 M. Jahangir Alam, I. Ara Begum, J. Buysse, and G. Van Huylenbroeck, “Energy consumption, carbon emissions and economic growth nexus in Bangladesh: cointegration and dynamic causality analysis,” Energy Policy, vol. 45, pp. 217–225, 2012. View at: Publisher Site  Google Scholar
 B. Saboori, J. Sulaiman, and S. Mohd, “Economic growth and CO_{2} emissions in Malaysia: A cointegration analysis of the Environmental Kuznets Curve,” Energy Policy, vol. 51, pp. 184–191, 2012. View at: Publisher Site  Google Scholar
 E. Dogan, “The relationship between economic growth and electricity consumption from renewable and nonrenewable sources: A study of Turkey,” Renewable and Sustainable Energy Reviews, vol. 52, pp. 534–546, 2015. View at: Publisher Site  Google Scholar
 J. Zhang, Y. Zhao, and W. Xiao, “Multiresolution cointegration prediction for runoff and sediment load,” Water Resources Management, vol. 29, no. 10, article no. A009, pp. 3601–3613, 2015. View at: Publisher Site  Google Scholar
 E. de Cian, E. Lanzi, and R. Roson, “Seasonal temperature variations and energy demand: A panel cointegration analysis for climate change impact assessment,” Climatic Change, vol. 116, no. 34, pp. 805–825, 2013. View at: Publisher Site  Google Scholar
 J. M. Hu, J. Z. Wang, and K. L. Ma, “A hybrid technique for shortterm wind speed prediction,” Energy, vol. 81, no. 1, pp. 563–574, 2015. View at: Publisher Site  Google Scholar
 F. Zhang, C. Deb, S. E. Lee, J. Yang, and K. W. Shah, “Time series forecasting for building energy consumption using weighted Support Vector Regression with differential evolution optimization technique,” Energy and Buildings, vol. 126, pp. 94–103, 2016. View at: Publisher Site  Google Scholar
 Y. Liang, D. Niu, M. Ye, and W. Hong, “Shortterm load forecasting based on wavelet transform and least squares support vector machine optimized by improved cuckoo search,” Energies, vol. 9, no. 10, pp. 827–843, 2016. View at: Publisher Site  Google Scholar
 Q. Wu and C. Peng, “A least squares support vector machine optimized by cloudbased evolutionary algorithm for wind power generation prediction,” Energies, vol. 9, no. 8, article no. 585, 2016. View at: Publisher Site  Google Scholar
 G. SantamaríaBonfil, A. ReyesBallesteros, and C. Gershenson, “Wind speed forecasting for wind farms: a method based on support vector regression,” Renewable Energy, vol. 85, pp. 790–809, 2016. View at: Publisher Site  Google Scholar
 W. Sun and J. Sun, “Daily PM_{2.5} concentration prediction based on principal component analysis and LSSVM optimized by cuckoo search algorithm,” Journal of Environmental Management, vol. 188, pp. 144–152, 2017. View at: Publisher Site  Google Scholar
 S. Sreekumar, J. Verma, A. Sujil, and R. Kumar, “Comparative Analysis of Intelligently Tuned Support Vector Regression Models for Short Term Load Forecasting in Smart Grid Framework,” Technology and Economics of Smart Grids and Sustainable Energy, vol. 2, no. 1, 2017. View at: Publisher Site  Google Scholar
 X. S. Yang, “Flower pollination algorithm for global optimization,” in Unconventional Computation and Natural Computation, vol. 7445 of Lecture Notes in Computer Science, pp. 240–249, Springer, Berlin, Germany, 2012. View at: Publisher Site  Google Scholar
 D. A. Dickey and W. A. Fuller, “Distribution of the estimators for autoregressive time series with a unit root,” Journal of the American Statistical Association, vol. 74, no. 366, part 1, pp. 427–431, 1979. View at: Google Scholar  MathSciNet
 S. Johansen, “Estimation and hypothesis testing of cointegration vectors in Gaussian vector autoregressive models,” Econometrica, vol. 59, no. 6, pp. 1551–1580, 1991. View at: Publisher Site  Google Scholar  MathSciNet
 V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, NY, USA, 1995. View at: Publisher Site  MathSciNet
 M. Shenify, A. S. Danesh, M. Gocić et al., “Precipitation estimation using support vector machine with discrete wavelet transform,” Water Resources Management, vol. 30, no. 2, pp. 641–652, 2015. View at: Publisher Site  Google Scholar
 B. Scholkopf and A. Smola, Learning with Kernels, The MIT Press, Cambridge, Mass, USA, 2002.
Copyright
Copyright © 2017 Weide Li et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.