Abstract

Monthly electric energy consumption forecasting is important for electricity production planning and electric power engineering decision making. Multiwindow moving average algorithm is proposed to decompose the monthly electric energy consumption time series into several periodic waves and a long-term approximately exponential increasing trend. Radial basis function (RBF) artificial neural network (ANN) models are used to forecast the extracted periodic waves. A novel hybrid growth model, which includes a constant term, a linear term, and an exponential term, is proposed to forecast the extracted increasing trend. The forecasting results of the monthly electric energy consumption can be obtained by adding the forecasting values of each model. To test the performance by comparison, the proposed and other three models are used to forecast China's monthly electric energy consumption from January 2011 to December 2012. Results show that the proposed model exhibited the best performance in terms of mean absolute percentage error (MAPE) and maximal absolute percentage error (MaxAPE).

1. Introduction

Electric energy consumption is essential for promoting economic development and raising people’s living standard [14]. In contrast to other energy sources, electric energy cannot be stored for large-scale consumption. From an economic viewpoint, the supply and demand of electric energy must be balanced at any given time. Therefore, a precise forecasting of electric energy consumption is very important for the economic operation of an electric power system. Studies on electric energy consumption forecasting are usually divided into two categories on the basis of forecasting indicators: short-term [58] and mid-/long-term forecasting [911]. The current study fits into the second category because monthly electric energy consumption forecasting is an important basis for electricity production planning and electric power engineering decision making.

Researchers have developed two ideas for mid-/long-term electric energy consumption forecasting. One idea focuses on the relationship between electric energy consumption and its influencing factors. Models originating from this idea usually formulate an equation to quantitatively simulate such relationship. These models obtain the forecasting results by inputting the values of the influencing factors during the forecasting period into the equation [1214]. Most of these models exhibit satisfactory performance in simulating historical data. However, the exact values of the influencing factors during the forecasting period are difficult to determine; therefore, these models usually cannot produce satisfactory results when used for forecasting. The other idea concentrates on the changing rules of the electric energy consumption trend itself. Related models use the historical electricity consumption data to formulate trend simulation equations and obtain the forecasting results by means of trend extrapolation [1517]. The forecasting model proposed in the current study belongs to the latter category.

The difficulty in forecasting monthly electric energy consumption by means of trend extrapolation lies in the complexity of the consumption curve. The consumption curve generally includes two subtrends. First, a long-term increasing trend typically exists in most cases. Basic electric energy consumption usually increases with the development of social economy. Second, periodic waves always exist, as a result of the effects of people’s living habits and work styles according to alternating seasons. Therefore, the presence of both a long-term increasing trend and periodic waves with different frequencies and amplitudes increases the difficulties in trend extrapolation.

The artificial neural network (ANN) is a traditional nonlinear trend extrapolation model. The primary advantage of ANN is its capability of modeling nonlinear relations without the supervision of human experts [18, 19]. When ANN is used for forecasting by means of trend extrapolation, several continuous electric energy consumption data would be selected as input, and the output would be the forecasting result. However, the very complex nature of the monthly electric energy consumption curve greatly affects the generalization ability of the ANN. That is, when used to forecast the monthly electric energy consumption, ANN produces satisfactory simulation results but results in poor forecasting precision. The key idea in improving forecasting precision is to simplify the monthly electric energy consumption curve by means of trend extraction.

Zhao and Wei [20] conducted a study on the trend extraction problem and proposed several applicable methods. González-Romera et al. [2123] have tested these methods and recommended the moving average as their first choice. They used a moving average algorithm with a 12-month window width to decompose the monthly electric energy consumption time series into a wave curve and a long-term increasing curve. The extracted wave and increasing curves were separately forecasted by Fourier series and ANN models. The forecasting results of the monthly electric energy consumption were obtained by adding the forecasting results of the extracted two curves. Their models have been proven to be better than time series [24, 25] and single ANN models.

Meng et al. [26] pointed out that this model still has at least two flaws. First, the extracted wave trend is still very complex, containing several subwave trends with different frequencies and amplitudes. For this reason, González-Romera et al. [2123] selected the Fourier series and not ANN to forecast the wave curve. However, the Fourier series is limited by the algorithm itself, such that it can only simulate waves with invariable amplitudes. In fact, the amplitudes of periodic waves extracted from the monthly electric energy consumption time series usually increase gradually with the development of social economy. Second, the extracted increasing curve is usually not very smooth because of the effects of changes in the external environment (e.g., economic crisis, political unrest, and unpredictable weather conditions). Using ANN to forecast this curve tends to result in large forecasting errors. For these reasons, Meng et al. [26] adopted wavelet transform to decompose the monthly electric energy consumption time series into a long-term increasing curve and several simple wave curves. They used ANN to forecast the extracted wave curves and selected the GM (1, 1) [2729], which is a widely used small-sample exponential trend extrapolating model, to forecast the extracted long-term increasing curve, which usually presents an approximately exponential trend. The forecasting results of the monthly electric energy consumption were obtained by adding the forecasting results of all the extracted curves.

Nevertheless, the forecasting model proposed by Meng et al. [26] also has its limitations. The electric energy consumption trend has numerous waves with different frequencies and amplitudes as a result of the effects of the people’s changing living habits and work styles according to alternating seasons; therefore, the least common multiple of all the periods of waves should be 12 months. Thus, the wave period may be 12, 6, 4, 3, or 2 months. However, limited by the algorithm itself, the period of waves extracted by wavelet transform could only exist in multiples. That is, the period of waves could be 12, 6, and 3 months. Therefore, the algorithm cannot cover all the possible periods. Moreover, in essence, the forecasting equation of GM (1, 1) [30] is a homogeneous exponential curve, which is simply a special form of the approximately exponential curve. GM (1, 1) is suitable only for a time series with a steadily increasing rate. This characteristic does not necessarily apply to the extracted long-term increasing curve of the monthly electric energy consumption time series.

Therefore, the present study proposes a multiwindow moving average algorithm to obtain accurate extracted results of the wave curves. This algorithm can extract wave curves under all the possible periods. Furthermore, a hybrid growth model is proposed to improve the forecasting results of the extracted long-term increasing curve. A constant term is added to the forecasting equation of GM (1, 1), as well as a linear term, because the approximate exponential trend of the extracted long-term increasing curve is unstable and a linear trend may appear in special periods.

2. Novel Forecasting Models

2.1. Multiwindow Moving Average Algorithm

The moving average algorithm is written as follows: where is a smoothed datum, is a datum of the original time series, and is the window size. Aside from the long-term increasing trend, time series contains the periodic waves, wherein periods are multiples of .

Let be the remained datum; then time series would present a wave curve with a period of .

Figure 1 shows the process of the multiwindow moving average algorithm.

The left branch of Figure 1 is selected as an example to explain the multiwindow moving average process. As shown in Figure 1, the original time series (denoted by OTS) is proposed by a moving average algorithm with a window size of 2 (denoted by W2). Then, the smoothed time series (denoted by S2) and the remained time series (denoted by R2) are obtained. By using a moving average algorithm with a window size of 4 to process S2, the smoothed time series (denoted by S4) and the remained time series (denoted by R4) are obtained. By using a similar algorithm to process S4, the S12 and R12 time series are also obtained. By following the decomposition process shown in the right branch of Figure 1, R3, R6, R12, and S12 are also obtained.

As the OTS are decomposed twice, R2, R3, …, R12 and S12 are multiplied by 1/2 and renamed as AR2, AR3, …, AR12 and AS12 to ensure that the summation of all the extracted time series is equal to the OTS. Furthermore, although the left and right branches can both obtain AS12 and AR12, the two AS12 and two AR12 should be summed separately. Accordingly, the results are named as SAS12 and SAR12, respectively.

The extracted time series are AR2, AR3, AR4, AR6, SAR12, and SAS12. The sum of all the extracted time series is equal to the OTS. As the maximum period of all the waves in monthly electric energy consumption time series is 12, time series SAS12 should have no periodic waves and should thus be a long-term increasing trend. Other time series are wave trends with different periods.

2.2. Hybrid Growth Model

The forecasting/simulation equation of GM (1, 1) is usually written as follows: where is the th value of the time series and and are the parameters.

Clearly, (3) has no constant term because it is a homogeneous exponential equation.

Equation (3) can be written as follows:

If , then (4) can be further written as follows:

The hybrid growth equation, which includes a constant term, a linear term, and an exponential term, can then be written as follows:

Given that time series data exist, for each group of estimated parameters, (6) can be written as follows: where is the residual.

If then (7) can be written as follows:

To obtain the optimal parameter estimations, the residual sum of squares must be minimized.

Let

To obtain the minimum of , its derivative with respect to the parameter vector must be equal to 0. Consider

Then,

Up to this point, the optimal parameter estimations of (6) can be obtained.

However, (6) is an iterative equation. To obtain the optimal starting point of iteration, let

The optimal estimation of is obtained by

Equation (14) has been proven to be a nonconstrained optimization, and its solution [31] is

Thus, the hybrid growth model, which is used to forecast the extracted long-term increasing trend, is completed.

2.3. Forecasting Process

The above models can be used for both one-step and multistep ahead forecastings. As the former is used more often, and especially useful for the production operation management of the power-generation companies, this paper designs the one-step ahead forecasting process as follows.

(A) Curve Extraction. The aforementioned multiwindow moving average algorithm is used to decompose the OTS of the monthly electric energy consumption. A long-term increasing curve and several wave curves with different periods are obtained.

(B) Subtrend Forecasting. The aforementioned hybrid growth model is used to forecast the extracted long-term increasing curve. For each forecasting point, continuous 12 data preceding it are used to estimate the equation parameters (, , , and ). If in (6), then the forecasting result could be obtained.

The radial basis function (RBF) ANN model is used to forecast the extracted wave curves. Similar to [26], for each forecasting point in an extracted wave curve, 60 consecutive previous values of it were selected for RBF ANN training. The 13th to 60th values were taken as outputs. For each output value, 12 previous values of it were selected as the inputs. After training, the ANN is used for forecasting. Based on the same rule, when forecasting the th point, the 12 consecutive previous values of it are entered and the output of ANN is considered the forecasting result.

(C) Monthly Electric Energy Consumption. The forecasting result of monthly electric energy consumption is obtained by adding the forecasting results of each extracted curve.

3. Model Test

3.1. Data Selection and Preliminary Analysis

At present, China is the largest electric energy consumer in the world. In 2012, its global consumption share reached 21.94% [32]. In this section, the monthly electric energy consumption of China was used to validate the proposed model. Figure 2 shows the consumption curve (in 108 kWh) from January 1991 to December 2012.

As shown in Figure 2, China’s monthly electric energy consumption curve contains a long-term approximately exponential increasing trend and several periodic wave trends. Clearly, the amplitudes of the periodic wave trends increase gradually, thus requiring the use of ANN to forecast these wave trends.

The monthly electric energy consumption curve of China was decomposed to several subcurves by using the aforementioned multiwindow moving average algorithm. Figure 3 shows the decomposed results from January 1993 to December 2012.

The SAS12 in Figure 3 has no wave trends. This finding verifies that the maximum period of all periodic waves is 12. Furthermore, the increasing trend of SAS12 is unstable and near a linear trend in special periods (this characteristic can also be found in Figure 2), thus implying the necessity of using the hybrid growth model in forecasting the increasing trend.

3.2. Results and Discussions

To evaluate the performance by comparison, the forecasting model proposed in this paper (denoted by M1), model used in [26] (denoted by M2), single RBF ANN model (denoted by M3), and ARIMA model (denoted by M4) were all used to forecast China’s monthly electric energy consumption from January 2011 to December 2012. For the single RBF ANN, the training and forecasting rules were the same as those in forecasting the extracted wave curves. For the ARIMA model, the first order difference was used to obtain a stationary time series. According to the autocorrelation function and partial autocorrelation function analysis to the difference time series, the numbers of autoregressive term and moving average term are all set to 2. Similar to the RBF ANN model, for each forecasting point in an extracted wave curve, 60 consecutive previous values of it were selected to build the ARIMA model. Real data (denoted by RD) and forecasting results of the four models are listed in Table 1.

To evaluate the forecasting performance of each model by comparison, the mean absolute percentage error (MAPE, in %) and the maximal absolute percentage error (MaxAPE, in %) [26, 33] were used as indicators of forecasting precision. Consider where is the electric energy consumption value of the th month, represents its forecasting result. Consider

Despite having similar functions of determining which is the best model, these two error analysis indicators have fine distinctions. MAPE is the most widely used indicator of accuracy, because this indicator reflects the general closeness of the forecasting results to RD. MaxAPE generates the worst forecasting result and reflects the maximal forecasting risk.

By using the data in Table 1 and (16) and (17), the results of each error analysis indicator were obtained. These values (in %) are listed in Table 2.

On the whole, as the trend extraction algorithms simplified the forecasting curves, M1 and M2 are better than M3 and M4 for each indicator. As the period of the extracted waves of M1 is more reasonable than M2 and the hybrid growth model is more applicable to the extracted long-term increasing curve than GM (1, 1), M1 is always better than M2 for each indicator. Especially for MAPE, which is usually considered the most important indicator, M1 (4.37%) is much smaller than M2 (5.53%).

4. Conclusions

Monthly electric energy consumption time series usually has two characteristics. First, the main trend of the said time series presents an approximately exponential increasing feature. Second, the period of the wave trends can only be 12, 6, 4, 3, or 2 months. Thus, this study proposed a multiwindow moving average algorithm to decompose the said time series and a novel hybrid growth model to forecast the extracted long-term increasing trend. To evaluate the performance by comparison, the proposed and other three models were used to forecast China’s monthly electric energy consumption from January 2011 to December 2012. Empirical results show that, because of the aforementioned innovations, the MAPE and MaxAPE of the proposed model are smaller than those of the other three models.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (NSFC) (71201057 and 71071052) and the Fundamental Research Funds for the Central Universities (2014MS148).