#### Abstract

To accurately forecast energy consumption plays a vital part in rational energy planning formulation for a country. This study applies individual models (BP, GM (1, 1), triple exponential smoothing model, and polynomial trend extrapolation model) and combination forecasting models to predict China’s energy consumption. Since area correlation degree (ACD) can comprehensively evaluate both the correlation and fitting error of forecasting model, it is more effective to evaluate the performance of forecasting model. Firstly, the forecasting model’s performances rank in line with ACD. Then ACD is firstly proposed to choose individual models for combination and determine combination weight in this paper. Forecast results show that combination models usually have more accurate forecasting performance than individual models. The new method based on ACD shows its superiority in determining combination weights, compared with some other combination weight assignment methods such as: entropy weight method, reciprocal of mean absolute percentage error weight method, and optimal method of absolute percentage error minimization. By using combination forecasting model based on ACD, China’s energy consumption will be up to 5.7988 billion tons of standard coal in 2018.

#### 1. Introduction

Energy is a substantial foundation for the consistently high growth rate of China’s economy. Since the policy of Reforming and Opening was initialized, China has become the second largest energy consumer of the world by 2005. As the most flourishing developing country that has a significant influence on current global energy field, China’s energy consumption has become a global concern [1]. Its primary energy consumption has been more than doubled in less than a decade, according to the National Energy Administration of China. The GDP of China maintained a high growth rate of 10.3% in 2010, reaching RMB (Chinese currency) 39.80 trillion Yuan. Meanwhile, its energy consumption is up to 3.25 billion tons of standard coal, which increased by 5.9% compared to 2009 [2, 3]. It is well known that the consistently high growth rate of China’s economy is supported by huge energy consumption. With tremendously increasing energy consumption, the problems of greenhouse gas emissions and energy shortage have become more serious.

In order to steer its energy-economy-environment system onto a more sustainable pathway, China is now changing its economic development model from energy-intensive economy to low-carbon economy. During the process of economic development model transition, one of challenges that China continues to face in the energy field is the consistent imbalance between energy supply and demand. Most of all, the imbalance will not be solved in the foreseeable future. It will be more serious in the coming years. To accurately forecast China’s energy consumption can be profoundly instructive to the economic sustainable development of China and even the world. Energy consumption forecasting is a vital part of energy policy for a country all over the world.

For a rapidly developing country, the main factors that influence China’s energy consumption are sustaining high economic growth rate, increasing human population, large-scale industrialization, motorization process, energy structure optimization, reducing greenhouse gas, and improving energy efficiency [4]. Among these influential factors, the economic growth rate is identified as a major determining factor, and the primary energy structure is identified as a major restricted factor. It is very difficult to accurately forecast China’s energy consumption for these reasons. In view of this, many scholars at home and aboard try to create advanced energy consumption forecasting models to help China make better government policies. Various studies of energy consumption forecasting models can be broadly classified into two overall groups: single forecasting model and combination forecasting model.

A substantial body of the research had been conducted on the single forecasting model in the energy consumption field during the past decade. Christodoulakis et al. [5] forecasted the future demand for energy in Greece based on scenario analysis. Murat and Ceylan [6] adopted an artificial neural network approach to forecast the transport energy demand, by use of socioeconomic and transport related indicators in social economy and transport. Crompton and Wu [7] applied a Bayesian vector autoregressive method to forecast China’s energy consumption. Chen [8] proposed a collaborative fuzzy-neural approach to forecast the long-term load, which aggregated the fuzzy different experts’ annual energy consumption forecasts. Geem [9] developed an artificial neural network model by using various independent variables (such as GDP, population, oil price, number of vehicle registrations, and passenger transport amount) as input parameters to forecast South Korea’s transport energy demand. Forouzanfar et al. [10] introduced a new multilevel genetic programming approach for forecasting transport energy demand in Iran, and its result had smaller error compared with the result obtained using neural network or fuzzy linear regression approach. Kumar and Jain [11] applied Grey-Markov model to forecast crude-petroleum consumption, Grey model with rolling mechanism to forecast coal and electricity (in utilities) consumption, and singular spectrum analysis to predict natural gas consumption in India. Lu et al. [12] adopted GM (1, 1) by modifying the value of the development coefficient to reflect the influence of economic growth on the forecasting results. Hodge et al. [13] used ARIMA models to forecast future wind power output from historical data and applied a systematic approach to determine the best values for the assortment of variables associated with the models, such as training period length and model orders. Kankal et al. [14] forecasted energy consumption in Turkey based on socioeconomic and demographic variables (gross domestic product (GDP), population, import and export amounts, and employment) using artificial neural network and regression analyses. Lee and Tong [15] developed an improved grey forecasting model for Chinese energy consumption, which combined residual modification with genetic programming sign estimation. Chenchen et al. [16] established a new class of weakening buffer operators with exponential type to forecast energy demand by using the exponential average, weighted average, and geometrical average weakening buffer operators. Pao [17] found ANN method more appropriate than the linear method to forecast the electricity consumption in Taiwan. Dong et al. [18] adopted support vector machines, a new neural network algorithm, to forecast building energy consumption in the tropical region. Chedid et al. [19] built an econometric model to predict the future production trend of crude oil and natural gas based on the behavior of them. Some energy or related agencies practiced energy elasticity coefficients method [20], input-output method [21], cointegration theory, and error correction models [22] to forecast energy consumption, and so on.

But for all of these single forecasting models mentioned above, there are some limitations due to the hypothetical conditions, the applying scope, and especially not using lots of useful information effectively. Therefore, combination method was put forward to make good use of source information from different perspectives. Since combination forecasting model properly links the different forecasting methods and makes the best use of the information given by each of them, it can overcome the limitations of individual models and gather the information more effectively.

A linear combination forecast model was first established by Bates and Granger [23] in 1969 which considered as a successful alternative for using an individual forecasting method. Xue et al. [24] made use of neural network, grey forecast, and time series to construct a combination forecasting model to predict Chinese energy consumption. Under departures from elliptically symmetric distributions, Elliott and Timmermann [25] proved that the optimal combination weights under asymmetric loss can be much more different than under mean square error (MSE) loss. Guohao et al. [26] developed a combination forecasting model by use of grey system and multiple regression method to predict China’s coal demand, and the variable combination weight was the type of function that changed with forecast time. Chai et al. [27] found that the outcome of combining forecasting will be more precise based on the Bayesian error correction approach. In order to improve the forecasting accuracy of energy consumption of Hebei Province in China, Song et al. [28] proposed a combination forecasting model based on introduction orderly weighted average (IOWA) operator. Xu and Wang [29] used polynomial curve and moving average combination projection model to estimate the future natural gas consumption in China from 2009 to 2015 and employed ordinary least squares to estimate the combination weights. Yu et al. [2] proposed a combination forecasting model that optimized the coefficients of the three forms of the model (linear, exponential, and quadratic model) based on PSO-GA. Lemke and Gabrys [30] developed a ranking-based combination of methods to improve forecasting performance over simple model selection approaches. Zhou and Wang [31] proposed a combination forecasting model based on multicriteria, in which the combination weights are obtained by using genetic algorithms. Sánchez [32] proposed an adaptive forecast combination (AEC) procedure based on a two-step procedure: firstly, different combination procedures are used; secondly, the AEC is used to combine the combinations from the first step. In view of the individual forecasting model with the smaller variance of mean absolute percentage error (MAPE) has higher stability, its combination weight should be larger. Meanwhile information entropy is a measure of the uncertainty associated with a random variable; an assessment method of the combination weights is adopted on the basis of entropy approach by Zhao [33]. The literature in the field of nonlinear forecasting combination is very sparse due to the lack of evidence of success such as what stated in [34]. Only the linear combination method is discussed in this paper.

Although a large body of literature applied various methods to construct combination forecasting models, there are some limitations of choosing the individual models for combination and determining the combination weight. The weights of individual forecasting model to be combined are based on the relative past forecasting performance, and no single method can guarantee more accurate results. The commonly used criterions in evaluating out-of-sample forecast capability include simple average, root mean square error, mean absolute error, and mean absolute percentage forecast error. Creating an efficient method of distributing combination weights is not easy because of the difficulty in choosing the estimation criterion.

The method for optimally selecting models is vital for the combination forecasting performance. Therefore, ACD is proposed to be the selection criterion in constructing the combination model in this paper. Also the combination weights are determined based on the sequence relative area correlation degree. The ACD of individual model is obtained by finding the area of the curves between actual and forecasted values, which is a comprehensive performance indicator of correlation and matching error between actual and forecasted values.

In this paper, based on China’s energy consumption data (1979~2010), a combination forecasting model is developed to forecast the energy consumption of China during 2011~2018. Most of all, ACD is proposed as the selection criterion of individual model and the method of determining the combination weights.

The rest of this paper is organized as follows. In Section 2, four single forecasting methods are briefly introduced; the individual model selection method based on ACD is proposed; a combination forecasting model is developed, in which the combination weights are also determined based on the ACD of individual forecasting model. Section 3 presents the empirical analysis using the methods mentioned in Section 2, all of which are adopted to forecast China’s energy consumption. Section 4 concludes the paper and suggestions on China’s energy policy.

#### 2. Methodology

Combination forecasting method is under the guideline of linking the different forecasting methods and making good use of the information given by each one. The performance of combination forecasting model is mainly associated with the performance consistency of each individual forecasting model and the assignment combination weights. Meanwhile the forecasting error becomes a random variable because of the great uncertainty of future. In order to develop a highly forecasting performance combination model to predict China’s energy consumption, individual model choice and combination weights assignment are emphasized in this paper. In this section, the theories of four individual forecasting models are discussed, individual model selection based on ACD is proposed, also the combination weights assignment based on ACD is adopted, and a new combination forecasting modeling method is developed.

##### 2.1. Individual Forecasting Model

Although some individual models adopted to forecast time series are considered simple, they are popular and surprisingly powerful. BP neural network, GM (1, 1), exponential smoothing model, and polynomial trend extrapolation model are presented as follows since they are used widely and efficiently in the field of energy consumption forecasting.

###### 2.1.1. BP Neural Network Model

For the ability of learning, storing, and recalling information based on a given data set, neural network model has been successfully employed in the field of energy consumption forecasting. Many of multivariate nonlinear projects can be solved by neural network. BP neural network is a kind of multilayer forward propagation neural network based on the data stream forward propagation and inverse error propagation algorithm. BP consists of input layer, hidden layer, and output layer, with the different layer neurons by connective weight and threshold. More detailed introduction about BP neural network can be found in [35]. The total available data are split into a training set and a test set in the process of BP forecasting. The training set is to calibrate the neural network and the test set is to evaluate BP’s forecasting performance. The modeling steps of BP neural network forecasting model are briefly described as below [36].

*Step 1. *Normalize the input and output data.

*Step 2. *Initialize the value for every connective weight of network, and set up the maximum permitted error, learning rate, and momentum term.

*Step 3. *Prepare the training input data and expect output data for training network.

*Step 4. *Perform feed-forward computation during the process of network training.

Input of input layer is ; input of hidden layer is as formula (1); output of hidden layer is as formula (2):
where is connecting weight between node and node and is the threshold of node .

*Step 5. *The computation of error back-forward propagation is as follows:
where is the error between input layer and output layer and is the learning rate.

*Step 6. *In order to meet the need of the maximum permitted error during the process of network training, fore-forward and back-forward propagation computation are computed recursively until convergence.

*Step 7. *Evaluate the network performance by use of the test data. Also the BP neural network forecasting model is established.

###### 2.1.2. GM (1, 1) Model

Grey system theory is proposed by Professor Deng in 1982. It is spread into the field of prediction of known information to uncertain areas. GM (1, 1) is applied widely in grey system forecasting. The modeling steps of GM (1, 1) forecasting model are described as follows [37].

*Step 1. *Assume the actual data series with samples is expressed as
where the actual data is assumed to be positive, which is the need of Grey system forecasting theory.

*Step 2. *Generate a new data series by accumulated operation (AGO)
where

*Step 3. *Establish GM (1,* N*) matrix equation.

Since both data series and the solution of a first order difference equation meet the requirements of the exponential increase law, should conform to a first order difference equation as . By use of dispersed method solving, GM (1,* N*) matrix equation can be established: , where

Then the only unknown parameter is matrix .

*Step 4. *Solve parameters matrix based on least squares method. Let ; then
where .

*Step 5. *By taking inverse AGO on sequence , the actual data forecasting is obtained. Also, GM (1, 1) forecasting model is established:
where .

###### 2.1.3. Triple Exponential Smoothing Model

Triple exponential smoothing model (TESM) has been around since 1950s and is still one of the most popular forecasting methods. It produces an exponentially weighted moving average in which each smoothing calculation or forecasting is dependent on the previous data. The modeling steps of TESM are as follows [38].

*Step 1. *Implement three-time smoothing calculation in line with formula (10) as follows:
where is the smoothing factor (), when closer to 1 means more weigh to the recent observations and hence more rapidly changing forecast; is the actual data set; and is the estimated value for the order exponential smoothing.

*Step 2. *Construct the triple exponential smoothing model as
where ,

Let and the triple exponential smoothing model be as
The estimating value of each moment can be calculated by assigning .

###### 2.1.4. Polynomial Trend Extrapolation Model

Polynomial trend extrapolation model (PTEM) is a very simple forecasting method that adopts previous data trends to project future data. Polynomial trend extrapolation forecasting model is to identify the overall past trend and to fit proper polynomial . The object function for can be expressed as where ; is an actual data point; ; and is the coefficient of .

The necessary condition for the extreme values of polynomial is .

Therefore where .

The matrix representation of formula (16) is as follows:

Therefore the coefficients of can be obtained through formula (16) based on least squares method.

##### 2.2. Combination Forecasting Model

Combination forecasting model is a reliable method of improving forecast accuracy by making good use of the different strength of various models. If there are models to forecast an actual data set , the method of combination forecasting model constructed by these individual models is described as where is the weight of the model and ; is the forecasting value of the model; and is the forecasting value of combination forecasting model.

In line with formula (17), we can infer that the forecasting performance of combination model is typically influenced by the individual forecasting model used and the combination weights assignment. This paper proposes the selection of individual forecasting model and the combination weights assignment based on ACD to improve the performance of combination forecasting model.

###### 2.2.1. Selection of Forecasting Method Based on ACD

The ability of evaluating and selecting a best-fitting model among the different methods that is appropriate for a particular time series is considered to be one of the most important aspects in the field of combination forecasting. MAPE is the most widely adopted one to describe the accuracy of forecasting models in statistics. The expression of MAPE is as follows: where is the actual value; is the forecast value; and is the number of data series.

Although MAPE is very simple, a very high MAPE might distort a comparison among models when calculating the MAPE for a few data. In other words, MAPE cannot always be effective to represent the correlation between forecasting model and actual time series’ development. This paper proposes a new area correlation degree criterion for the model selection, which is easy to identify a candidate model for the selection of combination forecasting. The principle of ACD criterion is the better high performance, the larger ACD. Since it comprehensively evaluates both the correlation and fitting error of the forecasting and actual values’, ACD model selection criteria can be introduced as an indicator for the performance of forecasting model. ACD is obtained by way of calculating the area of curves between actual and forecasted value.

First of all, given is the actual data, is the forecasting data. Then and can consist of two curves (actual curve and forecasting curve) as in Figure 1. When the area is zero, the forecasting model is completely fitting to the actual data and the ACD is 1. The smaller the area of the curves between the actual and forecasted value is, the larger the ACD becomes. At the same time, the forecasting model has more high performance.

The process of calculating the ACD is as follows [39].

*Step 1. *Given is the actual data, is the forecasting data:

*Step 2. *Define as a sign function as follows:(1), when ;(2), when ;(3), when .

*Step 3. *Calculate the area of curves between actual and forecasted curves. Firstly, we calculate , which is the area of time span from number (*i*) to number (*i*+1).(1)If , then
(2)If , then
where

*Step 4. *Define as the ACD of actual data curve and forecasting data curve:
where is the number of individual models for combination model selecting candidates and is the number of data series for calculation.

In view of removing the effect of base number (various energy consumption at each time moment), ACD is calculated based on the area of the curves between* x*-axis and forecasted percentage error curve in this paper.

###### 2.2.2. Combination Weights Assignment Based on ACD

The assessment of the individual weights has become a hot point for research institutes. Generally, it can be objectively obtained by solving a mathematical optimal object function with some constraints by regression method. There are some drawbacks during the assignment of weights; for example, the objective function is not easy to resolve and the weights are not the optimal ones for the nonpositive constraint condition of weights. Most of all, it cannot grasp the key principle of combination forecasting that makes full use of information from different aspects.

As mentioned above, the changing trend of area between actual and forecasted can be an indicator for the stability of individual forecasting model. Since the stability of individual forecasting model is higher when the changing trend of the curve area is smaller, its weight of combination forecasting should be larger. For the individual forecasting model, the better the performance is, the larger the ACD is. For these reasons, an assessment method of the combination weights is adopted on the basis of ACD. The combination weights assignment based on ACD is formed as follows.

*Step 1. *Calculate ACD of each individual forecasting model, given that is the ACD of number individual forecasting model.

*Step 2. *Obtain the variance coefficient :
where is the ACD of number individual forecasting model, .

*Step 3. *Assess the individual weights :
where is the weight of number individual forecasting model,

#### 3. Result and Discussion

##### 3.1. Data Source

Actual data on “Total Energy Consumption in China 1979~2010” is from various issues of the China Statistical Yearbook and China Energy Statistical Yearbook. Meanwhile, the unit of energy is ten thousand tons of standard coal. Figure 2 shows the annual energy consumption of China during the period of 1979~2010.

Due to strong economic growth, China’s demand for energy is surging rapidly. From 1979 to 2010, the energy consumption of China has increased from 0.58588 billion tons of standard coal to 3.25 billion tons of standard coal. During the early period of 1979–1996, with the energy consumption growth slowing down, the energy-intensive of China economy continued to decline. From 1997 to 1999, the energy consumption of China declined because of 1997 Asian Financial Crisis. Since 2000 to 2003, its energy consumption had entered a new round of increasing process since China had gradually recovered from the influence of 1997 Asian Financial Crisis. Especially during the period of 2003~2010, because of conducting large-scale infrastructure construction and the high economy growth, China’s energy consumption showed a rapid growth.

##### 3.2. Forecast Result

According to the above energy consumption trend and its annual data from 1979 to 2010 shown in Figure 2, this section contains the forecasting results obtained from the forecasting models described above. All the computations required for the present study have been carried out on MATLAB 2010.

###### 3.2.1. Result Using BP

Under the restriction of a limited training set, we use BP neural networks in this paper with a 4-11-1 structure (three layers: one input layer with 4 neurons, one hidden layer with 11 neurons, and one output layer with 1 neuron, with a learning rate of 0.01) to construct the forecasting model. BP neural network was trained with the annual data from 1979 to 2004, and the remaining data from 2005 to 2010 was used for evaluating its forecasting performance. Forecasting result by BP is shown in Figure 3.

###### 3.2.2. Result Using GM (1, 1)

Using the energy consumption data of 1995~2010 for the initial data sequence to create the GM (1, 1) forecasting model, where and are obtained as 94084 and −0.07833, respectively, the forecasting model that based on GM (1, 1) is formed as , where in 1996. China’s energy consumption forecasting result based on GM (1, 1) is as shown in Figure 4.

###### 3.2.3. Result Using TESM

The energy consumption data of 1979~2010 is adopted for the initial data sequence to construct TESM. In light of the fact that the smoothing factor closer to 1 means more weigh to the recent observations and hence more rapidly changing forecast, is assumed to be 0.27 by forecasting simulation tests. The triple exponential smoothing mode is formed as , where in 2011. In light of the forecasting process based on TESM mentioned above in the description of TESM process, China’s energy consumption forecasting result based on TESM is as shown in Figure 5.

###### 3.2.4. Result Using PTEM

Using the energy consumption data of 1979~2010 for the initial data sequence to construct PTEM as , where when in 1979, China’s energy consumption forecasting result based on PTEM is as shown in Figure 6.

###### 3.2.5. Result Using Combination Forecasting Model

Appropriate individual forecasting models should be applied to develop the combination forecasting model. Since combination forecasting model is usually constructed by 3~5 types of individual model, this paper will select 3 types of individual methods from BP, GM (1, 1), exponential smoothing model, and polynomial trend extrapolation model. Area correlation degree of each individual forecasting model is calculated based on the forecast result as reported in Table 1.

To remove the effect of base number (energy consumption at each time) during the process of calculating the correlation areas, ACD is calculated in this paper based on the area of the curves between* x*-axis and each individual forecasting model’s percentage error curve during the period of 2005~2010. The ACD results of each individual forecasting model are reported in Table 2.

Because the better the performance, the larger the ACD, according to this study, TESM, BP, GM, and PTEM are listed in terms of the forecasting performance as shown in Table 2. Therefore, TESM, BP, and GM are chosen for developing the combination forecasting model. Also the combination weights assignment based on ACD are reported in Table 3.

For the purpose of evaluating the performance of combination weight assignment method based on ACD, it also adopts some other combination weight assignment methods such as entropy weight method, reciprocal of MAPE weight method, and optimal method of APE minimization method. Table 4 shows the forecasting results of combination models based on different weights assignment methods (the combination forecasting model is developed by BP, GM, and TESM).

The forecasting results of China’s energy consumption of 2011~2018 by combination forecasting model are shown in Table 5, which is developed by using of BP, GM, and TESM based on ACD.

##### 3.3. Discussion

Facing great energy challenges and an energy dilemma, it is essential for China to forecast energy consumption for its sustainable development. Moreover, combination forecasting can improve the fitting effect and increase the forecasting accuracy, which makes good use of the information source from each individual forecasting model. Besides that, the relative stability of forecasting model is improved. For the reasons above, it is useful to project with a relatively simple system based on combination forecasting model in this paper. Thus, the specific purpose of this study was to determine a suitable combination forecasting model for short-term forecasting of China’s energy consumption with limited data. In view of the fact that the area of the curves between actual and forecasted value is a more comprehensive performance indicator for forecasting correlation and matching error, ACD is proposed as the individual model selection criterion to develop combination model and the determination method of the combination weights. It establishes a procedure for increasing the efficiency of the forecasting combination method based on ACD. Consequently, four kinds of individual forecasting model, model selection method for the purpose of constructing combination forecasting model, and combination weight assignment method are emphatically researched in this paper. In light of all the forecasting results of China’s energy consumption based on this study, we can infer the following.

As can be seen from the individual model forecasting results reported in Figures 3 to 6, China’s energy consumption is also on its rapid growing process during the coming years. During the periods of 1997~1999 and 2000~2003, the simulation values based on three types of individual models (TESM, GM, and PTEM) have significant deviation from actual value. The reasons for huge simulation errors are the Asian Financial Crisis at 1997 and the fact that China’s energy consumption has entered into a new round of increasing process since 2003. It proves that sudden change factors that influence China’s energy consumption will be shown in forecasting results. It does not lie in forecasting model itself, but in the distortion of the behavioral data series with the disturbance of the distracter.

Table 1 summarizes the simulation results of China’s energy consumption during the period of 2005~2010 based on BP, TESM, GM, and PTEM. These four individual forecasting models rank as BP, GM, TESM, and PTEM by using MAPE to be an indicator of forecasting performance. All the individual forecasting models have highly accurate forecasts (MAPE results are less than 4%).

Most of all, this study proposed ACD as an indicator of forecasting performance. Table 2 shows that these four individual forecasting models rank as BP, TESM, GM, and PTEM in terms of the forecasting correlation and fitting error based on ACD. The performance of forecasting based on ACD is different from that based on MAPE. Since MAPE might distort a comparison among models when calculating a few data, it is not always effective to represent the correlation between forecasting model and actual data series’ development trend. Therefore, the MAPE of forecasting result based on GM, TESM, and PTEM in the period of 1997~2010 is calculated as 5.43%, 4.65%, and 5.79%. The performance of forecasting models rank based on MAPE by using relative more data is in accordance with its ranking based on ACD by using more data. We can infer that ACD is more effective to represent the correlation and fitting error between forecasting model and actual time series’ changing trend when calculating a few data. Hence ACD can be an indicator of the forecasting model’s performance.

By using ACD to choose appropriate candidate individual models for constructing combination model and determining the combination weight, a new combination forecasting method is proposed in this paper. Table 3 reports the combination weight assignment based on ACD. It also reveals that the larger ACD, the higher the forecasting performance. Furthermore, Table 4 lists the combination forecasting results based on different weights assignment methods. It concludes that combination forecasting model can improve forecasting accuracy by making good use of source information from each individual forecasting model. In comparison of forecasting performance, only the combination forecasting model based on optimal method is better than the combination forecasting model based on ACD. Taking into account the time cost of calculation and the fact that it does not grasp the key principle of combination forecasting, combination assignment based on optimal method does not make full use of information from different aspects. As a result, ACD can be used to combine the selected forecasts; in other words, it is feasible to determine the combination weights based on ACD.

From the above analysis and comparison, we adopt BP, TESM, and GM to develop a combination forecasting model based on ACD for the purpose of forecasting China’s energy consumption of 2011~2018. From the forecasting results in Table 5, we can infer that China is still on a rapid energy consumption path. The total energy consumption of China in 2018 will increase to 5.798812 billion tons of standard coal. With the rapid growth of energy consumption, China should adopt energy-saving policies to reduce greenhouse gas emissions intensity, develop circular economy, and actively promote low-carbon technologies to respond to climate change.

#### 4. Conclusions

A new combination forecasting model based on ACD is developed to accurately predict China’s energy consumption by using the data of “Total Energy Consumption in China 1979~2010”. Since ACD can be an indicator of the correlation and fitting error of the forecasted and actual values, it is proposed to choose appropriate candidate individual models for constructing combination model and to determine the combination weight based on ACD in this paper. Based on the forecasting results, it can provide more accurate and reliable forecasting results by using combined forecasting method based on ACD. Therefore, ACD is more effective to evaluate the forecasting performance than MAPE with fewer data. All of these prove that it is feasible for us to develop a combination forecasting model based on ACD. The combination forecasting model proposed in this paper can be reliably and accurately used for forecasting energy consumption of China. It also can be widely applied to most of other countries’ energy consumption forecasting.

Moreover, in light of China’s energy consumption forecasting results of 2011~2018, we can conclude that the current status of China’s energy policies still faces challenges. The policy implication of China in this study is summarized as follows.(1)Insisting on the implementation of environmental protection and resource conservation as well as energy saving as a fundamental national policy: to be the largest developing country and energy consumer in the world, with the largest population, complex climate, insufficient energy resources, and fragile eco-environment, China should pay more attention to the global climate change.(2)Upgrading the traditional industries: China has cautiously scheduled to restructure and rejuvenate high-energy-consuming industries.(3)Realizing the energy conservation by promoting energy-saving technology and developing a circular economy, especially improving energy utilization efficiency.(4)Developing the low-carbon energy by accelerating the development of hydropower, geothermal, solar power, wind power, biomass energy, natural gas, and other clean resources: most of all, we should pay more attention to develop and utilize nonfossil energy. Meanwhile, since China energy consumption structure mainly depends on coal, the clean coal technology is still essential.(5)Joining forces with other countries to respond to energy dilemma and climate change problems.

#### Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.