Abstract

Energy crisis has made it urgent to find alternative energy sources for sustainable energy supply; wind energy is one of the attractive alternatives. Within a wind energy system, the wind speed is one key parameter; accurately forecasting of wind speed can minimize the scheduling errors and in turn increase the reliability of the electric power grid and reduce the power market ancillary service costs. This paper proposes a new hybrid model for long-term wind speed forecasting based on the first definite season index method and the Autoregressive Moving Average (ARMA) models or the Generalized Autoregressive Conditional Heteroskedasticity (GARCH) forecasting models. The forecasting errors are analyzed and compared with the ones obtained from the ARMA, GARCH model, and Support Vector Machine (SVM); the simulation process and results show that the developed method is simple and quite efficient for daily average wind speed forecasting of Hexi Corridor in China.

1. Introduction

Energy crisis has made it urgent to find alternative energy sources for sustainable energy supply. Since the first worldwide oil crisis in the early 1970s, energy supply has become tighter and tighter. Many countries have enacted laws to promote renewable energy supply and to reduce the fossil fuel-based energy usage. In the twenty-first century, as the prices of crude oil in the international oil market has gone up like a rocket and the oil resources has been diminishing, renewable energy, such as wind energy and solar power, has attracted more and more attention from both developed and developing countries, especially from the countries that are heavy energy consumers. In recent years, the collective installed capacity of wind power systems increase with an average annual growth rate of 24.8%. There are over 100 thousand wind turbine units have been installed. The installed capacity has reached 94.1 million kilowatts and the revenue of wind power industry has exceeded 37 billion U.S. dollars. With the pressure of reducing greenhouse-gas emissions and mitigating the impact of climate changes, both developed and developing countries have been devoting more and more efforts in renewable energy development. Amongst the affordable renewable energy sources, wind power is considered as one of the most attractive clean energy sources and has becomes the first choice for renewable energy development.

The development of wind power industry is one of the China's responses to energy challenges. As China's economic development and industrialization and urbanization are speeding up, energy demand grows rapidly so that China has become the world's second energy producing and consuming country. China, however, has relative shortage of high-quality energy sources and the majority of energy sources are fossil based, such as coal and gas, which are neither environmental friendly nor sustainable in long term. In order to fix China's energy problems, a fundamental strategy is to promote energy conservation as well as to vigorously develop new energy and renewable energy sources, such as solar energy, biomass and wind energy. Amongst these alternatives, wind power has more advantages, such as a high degree of industrial maturity, low cost of power generation, good physical and social environmental impact, and become the first choice of renewable energy sources in China. In addition, wind speed is one of the most difficult weather parameters to forecast as a result of the continuous and chaotic fluctuations of the wind, this can give rise to difficulties in the energy transportation and power balance of the network. A great number of authors have proposed different kinds of methods to forecast wind speed in [112]. As can be seen from the literature review, hybrid models based on first definite season index method and ARMA or GARCH have not been proposed for long-term wind speed forecasting. Consequently, in this paper the approach for wind speed prediction in the Hexi Corridor Region of China is presented.

There are four advantages of the proposed methodology. Firstly, wind speed of Zhangye area exhibit seasonality and time-varying volatility attributed to weather, pressure difference and so on. We wanted to detach seasonality that would be easily comprehensible in application. Then, GARCH model is usually applied in stock prediction, due to wind speed have uncertain changes that are similar to stock, consequently, this study adopts GARCH model to predict wind speed in the Hexi Corridor Region of China. Furthermore, ARMA and GARCH eliminated the seasonal factor effects create commendable improvements that are relatively satisfactorily for current research. The wind speed forecasting errors are in general ranging from 25% to 40% [13]. It is not only with relation to the prediction methods, but also with relation to the forecast period and wind speed characteristics of the forecast location. In general, the shorter the forecast period is and the more relax the wind speed of the forecast location are, the smaller the forecast error will be, otherwise, the larger the forecast error will be [14]. However, this study concludes that the long-term forecasting errors 27.4126% and 26.0001%. Finally, there are few papers to forecast long-term wind speed, but it is very important for the programme and dispatching of wind park. The main objective of this paper is to offer developing methods of long-term wind speed forecasting, which can provide tendency analysis and reference for wind power equipment operation and wind power system planning. However, study on long-term wind speed forecasting is rarely reported in the literature. Therefore, a simple and efficient forecasting method for long-term wind speed forecasting is significant, necessary and highly desirable for the development of the wind energy systems.

The rest of paper is organized as follows. Section 2 describes the forecasting model, including the Jonckheere-Terpstra test, the first definite season index method for eliminating the seasonal factor effects and forecasting models based on either the ARMA model or the GARCH model (ES-ARMA model and ES-GARCH model). Section 3 introduces two loss functions for evaluation of forecasting performance. Section 4 presents a case study which applies the developing forecasting procedure to the wind speed data sets obtained in the Hexi Corridor Region Zhangye city of China from 2001 to 2006 (six years) and the analysis of the results.

2. The Developing Forecasting Model for Long-Term Wind Speed Prediction

The developing forecasting models are primarily for predicting long-term wind speed based on historical wind speed data (come from Meteorological Bureau in Gansu Province http://cdc.cma.gov.cn/), which usually exhibit periodic patterns and seasonal factor effects. It can be built based on the ARMA and the GARCH model with data set verification and transformation phase, so the developing models are proposed: first, we eliminate the seasonal factor effects, and then use ARMA model and GARCH model to predict, these models are called ES-ARMA model and ES-GARCH model.

Moghram and Rahman [15], review five short-term load forecasting methods. The conclusion reached is that there is no one best approach, model performance under specific conditions should be analyzed and understood and incremental improvements made based on knowledge gained [16]. The developed hybrid ES-ARMA model and ES-GARCH model are more proper forecasting technique for daily average wind speed prediction in Zhangye city of Hexi Corridor through the historical data analysis, research, and compared with other conventional forecasting methods. So from this view, the simple and efficient method is the best option for the dispatching department of wind park.

The model consists of three typical stages: (1) perform the Jonckheere-Terpstra test for checking whether the given wind speed series of 2006 have significant difference with the ones from 2001 to 2005 or not, whether they exhibit a periodic pattern or not; (2) if the series are not significant, they have periodic pattern, preprocess the series by separating the seasonal factor effects using the first definite season index method; and (3) construct an ARMA model or a GARCH model for actual forecasting the future trend of the preprocessed series. Details about these stages are given in the following subsections.

2.1. Jonckheere-Terpstra Test

The Jonckheere-Terpstra test is a nonparametric test for ordered differences among classes (segments of data series). It tests the null hypothesis that the distribution of the response variable does not differ among the classes. The Jonckheere-Terpstra test statistics can be computed by the following two steps. (i)Form Mann-Whitney counts , where , for pairs of rows in the contingency table as follows: where is the response in the row. (ii)Compute the Jonckheere-Terpstra test statistics by .

This test will reject the null hypothesis of no difference among classes for large values of . Asymptotic -values for the Jonckheere-Terpstra test are obtained by using the normal approximation for the distribution of the standardized test statistics. The standardized test statistics can be computed by where is the number of sample groups, is the number of the th sample group (SAS/STAT User’s Guide). If -values less than the significant level , then the distributions of the response variables have significant difference, otherwise, they do not have significant difference.

2.2. First Definite Season Index Method

Suppose there is a data series and can be recorded in an array format as ;  ;  , where is the number of columns, which denotes the number of data points in one cycle, and is the number of rows, which denotes the number of cycles in the data series, the average value for each cycle can be calculated by: Dividing the time series by the cycle average value, one can have a normalized time series as: The seasonal index for the th data element in a cycle is defined as the average value of , which are corresponding to the same time sampling point in every cycle, and can be calculated by Thanks to the following fact: the seasonal index satisfies the normalized condition.

Dividing each data element in the time series by the corresponding seasonal index as follows: one can obtain a time series that has the seasonal factor effects separated.

2.3. ES-ARMA Forecasting Model

It is possible to build a linear stochastic model of a wind speed time series using the Autoregressive Moving Average (ARMA) model. An ARMA model of order () can be defined as or where is the number of autoregressive terms, and is the number of lagged forecast errors in the prediction equation, is series that eliminating seasonal effects and is stationary white noise with zero mean.

The order of an ARMA model can be determined by using the autocorrelation function (ACF) and the partial autocorrelation function (PACF), proposed by Box and Jenkins. Autocorrelation and partial autocorrelation graphs, which provide information about the AR and MA orders, are then drawn based on the specified lag numbers. Autoregressive (AR) process order is determined from the partial autocorrelation graph and similarly MA process order is determined from the autocorrelation graph [17].

2.4. ES-GARCH Forecasting Model
2.4.1. Construction of a GARCH Model

To relax the traditional assumptions of homogeneity, in 1982, Engle [18] suggests that the conditional variance can be modeled as a function of the squared lagged residual. That is, the predictable wind speed is dependent on past value. After that, in 1986, Bollerslev [18] generalizes the conditional variances in the ARCH model by assuming that the conditional variances are not only subject to the influences caused by the squared error terms, but also to the previous conditional variances.

After the seasonal effects are separated, a wind speed time series becomes stationary. It is possible to build GARCH model; the regression model of GARCH () for the time series can be described as where where is a sequence of independent, uniformly distributed random variables with zero mean and unit variance, that is, , . The model requires the following conditions:

The unobservable parameters are estimated through the Maximum Likelihood (ML) estimation. ML estimates are known to be asymptotically unbiased and efficient. The parameter estimation is based on maximizing a likelihood function for the available data [19]. Rigorous statistical hypothesis testing is applied to validate the model assumptions of the GARCH specification. The diagnosis checks verify the statistical significance and assumptions of the parameters in the GARCH model and its residuals (actual value minus predicted value) [19]. Statistical tests on the residuals include the -statistic, Ljung-Box statistic and plots, such as the autocorrelation and partial autocorrelation.

The Akaike’s Information Criterion (AIC) is a metric that can be used to select optimal model from a class of competing models. The AIC compares the accuracy of the model against its complexity measured by the number of parameters. For a model, the AIC is given by where is the number of parameters in the model and is the natural logarithm of the maximum likelihood [20].

2.4.2. Forecasting Procedure Using the GARCH Model

The procedure consists of the following steps: (i) eliminate the seasonal effects; (ii) forecast using the least square method; (iii) conduct the ARCH test. If the result is Yes, go to step (iv), otherwise, stop. The flow chart of this procedure is shown in Figure 1.

3. Evaluation of Forecasting Performance

Edison Electric Institute’s “Guide to Forecasting Methodology” [21] presents a brief description and examples of the major forecasting methods currently in use. These are: (1) Time Trend Forecast, (2) Time Series Forecast, (3) Informed Opinion, (4) End Use, (5) Econometric, and (6) Hybrid End Use and Econometric. The guide presents a set of criteria for evaluating for forecasting method: understandability, credibility, accuracy, cost, maintainability, and adaptability. Generally, the forecasting method used will depend on the relative emphasis placed on the criteria [16].

Two loss functions can be served as the criteria to evaluate the forecasting performance relative to wind speed value including mean absolute error (MAE) and mean relative error (MRE). The loss functions are expressed as follows:

The MAE measures are not very useful, because they have some relations with the original data. The MRE is widely used, and is the basic measures in this evaluation [22, 23]. We take the absolute values because the magnitude of error is more significant than the direction of error.

4. Case Study

4.1. Test Data Set

Although Hexi Corridor region in Gansu province is not the richest wind energy source in China, its reserves of wind energy is the fifth in China. Zhangye area is in the middle of Hexi Corridor and has become an important production base in Gansu Province. It potentially is a good site for developing a wind farm. Accurate prediction of long-term wind speed would be very helpful to aid the evaluation. Therefore, this case chooses the wind speed data obtained from the Zhangye area for a period of time as the test data set, which contains the daily wind speed values for the period of 2001–2006, as depicted in Figure 2.

The data set is divided into two parts: the first part contains the data from 2001 to 2005 and it is used for training the forecasting procedure; and the second part contains the data in year 2006 and it is used as the future wind speed values for testing the forecasting procedures. Every part is divided into four seasons: Spring, Summer, Autumn, and Winter, as shown in Table 1.

The implementation of the developing forecasting procedure and ARMA model are on a PC with one Pentium IV processor and 512 MB RAM at 1000 MHz. The software used are MATABLE and EVIEWS software package. The running times, including training and prediction, are within one minute for each test presented.

4.2. The Jonckheere-Terpstra Test

For the test data set, including the two parts, perform the Jonckheere-Terpstra test. The results of the test are tabulated in Table 2.

It can be seen from Table 2 that the difference of observed J-T statistic and mean J-T statistic is not obvious. The probability of J-T statistic is  .998. If the significant level is set to be  .05, that is, equals  .05, then the wind speed in each year have similar distribution as . The results show that the data in 2006 do not have significant difference with the data from 2001 to 2005, that is to say, the wind speed data have periodic characteristic.

4.3. Eliminating the Seasonal Effects Using the First Definite Season Index Method

Modeling seasonal time series has been one of the main research endeavors for decades. In the early 1920s, the decomposition model along with seasonal adjustment was the major research focus due to Persons [24, 25] work on decomposing a seasonal time series [26]. The significant assumption in many different seasonal adjustment approaches is that seasonality can be separated from other components of the time series. That is to say, the seasonal time series can be decomposed into a seasonal component and a nonseasonal component . The nonseasonal component can be further predicted by many methods. Seasonal adjustment is a process that estimates the seasonal component called seasonal factors and then dividing the original series by seasonal factors for multiplicative models or additive models. Applying the first definite season index method described in Section 2.2, the wind speed series that has the seasonal effects eliminated is obtained and plotted in Figure 3. This preprocessed wind speed series become smooth and is ready for building a linear stochastic model using ARMA or GARCH model.

4.4. Building a Linear Stochastic Model
4.4.1. Forecasting Procedure Using the ES-ARMA Model

(A) Construction of the ARMA Model
Based on the autocorrelation and partial autocorrelation graphs shown in Figures 4 and 5, two observations can be made: (1) the autocorrelation coefficients function (ACF) values vary significantly from zero to three lags and then statistically rapidly declines to zero; and (2) the partial autocorrelation coefficient function (PACF) value begins with a very high value at one lag and then statistically rapidly decrease to zero. Based on these two observations, the order of the ARMA model can be determined as , , that is, ARMA (1, 3).

(B) Forecasting Result
Figure 6 shows the predicted wind speed value by applying the ES-ARMA model.

4.4.2. Forecasting Procedure Using the ES-GARCH Model

(A) Construction of GARCH Model
GARCH model in EVIEWS soft is used to forecast the preprocessed wind speed series of Zhangye city. According to the flow chart of applying GARCH model shown in Figure 1, firstly, we used the least square method to estimate wind speed data; secondly, we preformed the ARCH test over the residuals by analyzing the autocorrelation and partial autocorrelation of the residuals and looking at the values of the -statistics, which is not strong. It has been found that the residuals have ARCH effect. The results of the ARCH test are given in Table 3. We then drew the residuals’ histogram, which follows student’s distribution. Moreover, when the parameters are estimated, in general we have checked that all of them are significant, according to the ARCH test; Consequently, the optimal lag order of conditional variance has been selected by the AIC, in all cases, the criteria agree and lead to the choice of . Therefore we chose GARCH (1,1) model to predict the future wind speed as. Figure 7 presents the predicted wind speeds using the ES-GARCH model.

4.5. Evaluation of Forecasting Results

We evaluate the variance forecasting ability of ARMA (1,3) and GARCH (1,1), ES-ARMA (1,3) model and ES-GARCH (1,1) model that have been eliminated the seasonal effects. Two loss functions (MAE and MRE) are considered as the evaluation methods of variance forecasts and the results are reported in Tables 4 and 5. These two loss function values of ES-ARMA (1,3) model and ES-GARCH (1,1) model are almost smaller than those of ARMA (1,3) and GARCH (1,1) model except the MRE of Summer by ES-ARMA model. This indicates that ES-ARMA (1,3) model and ES-GARCH (1,1) model have superior volatility forecasting ability for the case. Although ES-ARMA model only improves 4.673% compared with ARMA model, the forecasting values of ARMA model become a stationary line in Figure 8, and it is impossible to reflect the future trends and changes of wind speed. That is to say, this forecasting model is not persuasive. However, ES-ARMA model and ES-GARCH model give similar prediction trends for wind speed and improve their forecasting accuracy effectively. In addition, traditional SVM can be used to forecast wind speed, it is obvious that its accuracy is lower than the proposed models as seen in Table 6.

4.6. Discussions and Analysis

The forecasting errors are defined as the differences between the actual wind speed and the predicted one obtained from the forecasting models (ES-ARMA model and ES-GARCH model). The positive error values would imply that the model underestimates the actual wind speed while the negative values indicate that the model overestimates the actual wind speed. Our explanation to why the preprocessing eliminated seasonal effects helps is that seasonal variation of the time series with seasonality may account for the preponderance of its total change. Models that ignore these seasonal patterns will results in a high error and poor forecasting accuracy. The process of deseasonalizing removes these large seasonal variations from the original data, so helps improving the modeling accuracy.

It is also shown in Tables 4 and 5 that the proposed forecasting procedure using the ES-GARCH model gave the minimum MRE, the forecasting procedure using the ES-ARMA model gave the moderate MRE, whereas, ARMA and GARCH model gave the maximum MRE. Also, it has been observed that the proposed forecasting procedures using the ES-ARMA and ES-GARCH models lead to 4.673% and 8.2642% reductions in total MRE respectively in comparison with the ARIMA and GARCH model as shown in Figures 9 and 10. Furthermore, Table 6 indicates that the proposed two models are better than traditional SVM.

It has potential economic benefits for the proposed approaches, which has been used for the estimation of the reserve of effective wind energy in the Zhangye city with the Hexi Corridor region of China. The wind energy can be predicted using the following formula: , where is wind energy in , is air density in kg/m3, is wind speed in m/s, A is wind energy density in , which is the wind energy flow through the cross-sectional area that is perpendicular to the wind speed within a unit time.

When ES-GARCH is used for the prediction of wind energy in real-world wind power systems, a significant difference can be expected. For instance, based on the annual reserve energy of wind in Hexi Corridor region in China, when the wind energy is used to replace the steam power, although ES-GARCH improves 8.2642%, if ES-GARCH model is used, it can be expected to save 1606.2 million kilowatts of steam power, which are equivalent to the reduction of 469,551.8 tons coal annually. Moreover, it can be expected to reduce CO2 emissions by 123,026 tons, SO2 emissions by 6775.4 tons, and nitrogen oxide emissions by 6001 tons. This can yield an economic cost saving of 3685.3 million to 4625 million RMB annually.

5. Conclusions

The development of wind power industry is the important choice of China's response to energy challenges. China has become the world's second energy producing and consuming country, but generally speaking, China has relative shortage of high-quality energy sources, energy consumption is dominated by coal, environmental pressure, the energy supply system faces major challenges. To solve China's energy problems, the fundamental thing is to overall transform the economic development approach to promote energy conservation, at the same time, adhere to diversified development, vigorously develop new energy and renewable energy. Compared with solar energy, biomass and other renewable energy sources, wind power has a high degree of industrial maturity, low cost of power generation, good physical and social environmental impact, and so forth, should be the first choice of renewable energy development in future in China.

Although wind speed generally shows nonlinear, nonstationary and chaotic characteristics, it has been found based on the analysis of existing wind speed data that the wind speed of the Hexi Corridor region in China is periodic with one cycle per year. Seasonal variation is one of the most commonly encountered phenomena in many fields, such as electricity load, wind speed. How to best model and forecast the variation is a key task in planning and other related decision-making activities. This paper provides both approaches separated seasonal factors to predict wind speed of Zhangye area, the simulation results indicate the adequacy of the proposed models used in wind speed forecast. This procedure can enhance volatility forecasting ability of the well-known ARMA and GARCH model. Moreover, two loss functions (MAE and MRE) are used as the criteria for measuring the forecasting performance. Based on the loss functions, the results show that the new forecasting procedure using either the ES-ARMA model or the ES-GARCH model can give better performance. Consequently, this study has been able to touch on the effective methods to perform the long-term prediction problems.

Acknowledgment

This research was supported by the National Basic Research Program of China (Grant no. 2009CB421402).