#### Abstract

This paper employs chaos theory into power load forecasting. Lyapunov exponents on chaos theory are calculated to judge whether it is a chaotic system. Delay time and embedding dimension are calculated to reconstruct the phase space and determine the structure of artificial neural network (ANN). Improved back propagation (BP) algorithm based on genetic algorithm (GA) is used to train and forecast. Finally, this paper uses the load data of Shaanxi province power grid of China to complete the short-term load forecasting. The results show that the model in this paper is more effective than classical standard BP neural network model.

#### 1. Introduction

Chaos theory is the important component of the nonlinear science [1]. It is the random phenomena which appeared in the deterministic nonlinear dynamic system. Chaos is not a disorder but has a delicate inner structure. It reveals the order and regularity hidden behind the disordered and complex phenomena. Since the 90s, chaos theory has been well developed. Many subjects are infiltrated and promoted [2] under this tendency. So the research on chaos gets an access to a breakthrough. At the meanwhile the application about chaos theory gets a widely growing.

Short-term power load forecasting is a multidimensional nonlinear system. It is easy to get the load time series in power system. But these data are nonlinear and difficult to establish a matched mathematical model to forecast the next-hour load. Recently, more and more nonlinear time series forecasting models based on chaos theory [3, 4] are applied to power short-term load forecasting. And they have achieved good prediction results. So chaos theory is employed to analyze the characters of the load time series and applied into the power system forecasting in this paper.

There are many models to be adopted into power system load forecasting. They can generally be summarized as follows: time series model, regression model, expert system model, grey theory model, and fuzzy logic model. But according to chaotic characters of the load time series, ANN [5–9] is established and applied into the power system well. In view of neural network parallel processing and powerful nonlinear mapping ability, chaotic time series can be learned for unknown dynamic system and then predicted and controlled. As chaotic time series have an inner deterministic regularity which stems from nonlinearity, it represents the relevance of the time series on time delay state space. The feature makes the system have some kind of memory capacity. However it is difficult to express the regularity with the common analytical methods. So chaos theory and ANN are deserve to be studied and applied into power load forecasting.

Many intrinsic deficiencies of ANN are still in existence. The structure is difficult to confirm. The blindness that initial weights are chosen results into slow convergence speed and easily falling into local minimum. However GA [10–12] has the opposite characteristics of neural network. So this paper introduces GA to overcome these deficiencies of ANN.

#### 2. Chaos Theory

Power system loads are a set of time series. Chaos theory can analyze chaotic characteristics of time series and reveal the sequence itself of the objective regularities to avoid the predicted human subjectivity and improve the accuracy and credibility of load forecasting.

At present, phase-space delay coordinate reconstitution method is employed to analyze chaotic characteristics of time series. Generally, the dimension is very great even infinite. In fact, phase-space delay coordinate reconstitution method can expand the given time series to three-dimensional and even higher-dimensional space, and the information which exposed sufficiently from time series can be classified and extracted.

##### 2.1. Phase-Space Reconstitution

Phase-space reconstitution theory is the basis for chaotic time series forecasting. Packard and Takens proposed the technology of phase-space reconstitution for chaotic time series with the interval in electric power system. The structural character of system attractors is contained in this time series. Then the time series can be marked as where is the number of phase points of reconstructed phase-space. is the delay time, . is the embedding dimension, that is, the dimension of phase-space reconstitution.

Takens proved that phase-space reconstitution through selecting an appropriate delay time and a sufficiently large embedding dimension has the same with the actual geometric and information properties of dynamical systems and does not depend on the specific details of the reconstitution process. Thus, these phase points continuations which are reconstituted in the -dimensional phase-space form the trajectory of the input signal data. And the trajectory describes the evolutionary trace of the system state over time, and all fall to the same chaotic attractor of phase space.

##### 2.2. Lyapunov Exponents

Chaos is characterized by extreme sensitivity to movement on the initial conditions. Lyapunov exponents quantify the exponential divergence of initially close state-space trajectories and estimate the amount of chaos in a system. When the largest Lyapunov exponent [13] of the system is larger than zero (), it indicates that there is the chaotic attractor which can be used to measure the chaotic degree.

M. T. Rosenstin, J. J. Collins, and G. J. Deluca proposed an approach of small data sets. This method is more reliable, with a smaller calculation and easier to operate than others. So the largest Lyapunov exponent is computed by this approach. The process is as follows.(1)For , compute delay time and mean period by FFT.(2)Figure out correlation dimension and determine the embedding dimension through .(3)Reconstruct the phase-space based on and .(4)Find out nearest neighbor of each point , and limit short-term separation, (5)Calculate the distance after discrete time step for each pair of neighbors (6)Compute by the follow formula: where is the number of nonzero. Make regression line with the method of least squares, and the slope of this line is the largest Lyapunov exponent .

#### 3. Chaos Forecast Improved by ANN

Kolmogorov theory supposes if is a nonconst and monotonically increasing function. is compact subset of . is continuous real-valued function of . So for every , should there be and to make meet

That is to say, there is a three-layer network of which hidden-layer function is , input and output functions are linear. The network’s input-output relationship can approach . Kolmogorov theory pledges the feasibility of neural network used in time series forecast on mathematics. Based on Kolmogorov and chaos theory, ANN can be designed as follows (BP is used in the paper).

Chaos neural network in Figure 1 contains three layer: one input layer, one hidden layer (sometime maybe more than one), one output layer . This model has input nodes, hidden nodes, and one output node. Connection weights between input layer and hidden layer are marked as and between hidden layer and output layer are as . stand for the nodes of input, hidden and output. So Figure 1 can use the following formulas to describewhere is the Sigmoid function. That is, .

In order to get the better weights and thresholds and determine the structure of BP neural network, GA can be drawn in to improve the forecasting accuracy of chaotic time series.

#### 4. Improved GA Optimizes ANN

There are some deficiencies of BP neural network, such as a lower pace, being easy to local minimum, and the uncertainty structure. But GA can overcome these and improve network performance and convergence rate and optimize chaos neural network further.

The methods and steps to achieve genetic algorithm and optimize chaos neural network are the following.

*(1) Determine the Initial Population*

After analyzing the chaotic characters of the time series, we get a data set of the weights and thresholds that are unknown. These data are encoded and made as an individual. Several individuals can constitute the initial population.

*(2) Calculate the Fitness*

Suppose is the error of the network. The fitness function is taken as like
So the smaller the error is, the greater the fitness is.

*(3) Selecting Operation*

Calculate the fitness of every individual . Then copy them according to the following formula :
reflects the individual is copied the probability to the next generation. Use the following formula to select how many individuals could be generated:
where is the average of . is rounded and marked as . reflects the individual is copied the times to the next generation. If , it shows this individual is gone out and not to generate.

*(4) Crossover Operation*

Crossover operation is used to enhance the global search ability of GA. After selecting operation, randomly two individuals are selected to match but avoid choosing the individuals of the same gene. Randomly select a cross point of each individual, and according to the probability change the gene of two cross-points to form two new individuals.

*(5) Mutation Operation*

Mutation operation is mainly used to enhance local search ability of GA. After crossover operation, randomly select a mutation point to change the code according to the probability . If 1, change into 0. Otherwise, it is the opposite.

*(6) Optimization Operation*

In order to overcome precocity of GA, the improvement is necessary to operate. The parent and the offspring individuals are looked as a whole to form individuals. Order them in the light of the fitness, and is used to mark the th individual. So the th replication probability is determined in the way of the following formula:

*(7) Iteration Operation*

For the new population, repeat the previous operation until the relative error between two iteration operations meets the accuracy requirements. Error formula is the following formula:
where is the relative error between two iterations; and are separately the fitness of every individual in the th and ()th iteration; is the given standard, and here . If , iteration operation will finish and the optimal solution be outputted.

On the basis of the above, GA is just like the diagram in Figure 2.

#### 5. Application and Analysis

There are some power data from 0:00 at 1/1/2009 to 23:00 at 3/30/2009 in Shaanxi province power grid of China to use as the sample data for short-term power load forecasting. Then the data can be established into the time series . It is described in Figure 3.

The time series are analyzed by chaos theory. Then there are some parameters to determine. Delay time can be got through mutual information [14]. Embedding dimension can be calculated by Cao method [15]. Use and to reconstruct the phase space. Then wolf method [16] is employed to compute the largest Lyapunov exponent . It is obvious that power load time series is chaotic.

Since , it complies with the requirements of short-term electric power load. The graphics of the three parameters are in Figure 4.

**(a) Calculate delay time by mutual information**

**(b) Calculating embedding dimension by Cao method**

**(c) Calculating the largest Lyapunov exponent by wolf method**

When it got the parameters, the phase space can be reconstructed in Figure 5.

(a) - phase diagram |

(b) - phase diagram |

(c) - phase diagram |

(d) -- phase diagram |

Figure 5 shows that power load time series are chaotic. From the phase diagrams, there is one out prominent line. It means that the power loads are affected by accidental factors. However the power load time series are in the chaotic state.

After completing phase-space reconstitution, the data that come from 0:00 at 3/31/2009 to 23:00 at 3/31/2009 in Shaanxi province power grid of china are used as the testing sample. The structure of BP neural network is employed 4-9-1. Relative error and root-mean-square relative error are used as the final evaluating indicators:

The results are in Table 1. It compares the method in this paper with standard BP neural network method which takes the continuous five data as the training samples without improved BP.

Table 1 shows that standard BP neural network method for RMSER is 2.133% but the method in this paper is 1.080% and most of the latter’s errors are in less than 2%. So the latter has much less errors than the former and superior to the latter.

Standard BP neural network method just uses the historical load and the uncertain structure of BP neural network to forecast the next-hour load. Added the deficiencies of BP, the results could not be very accurate and the speed not so quick. But the method in this paper takes the chaotic characters of the power load time series, appropriately determining the structure of input layer and hidden layer in BP network into account. So this can make the network training well and improve the prediction accuracy further obviously.

Above all, Figure 6 shows the different methods, containing the results and the errors curves.

**(a) The results of the different methods**

**(b) The errors of the different methods**

As can be seen from the graphics in Figure 6, it shows that the curve of the method in this paper can better fit real load curve than standard BP. The errors computed by the former are closer zero than by the latter. So for short-term power load forecasting, the model in this paper is more effective than classical standard BP neural network model.

#### 6. Conclusion

This paper employs the chaos theory into eclectic system load forecasting. From Lyapunov exponents to phase-space reconstitution, it tells that power load time series are nonlinear and chaotic and have all the characters of chaos. The chaotic phase diagram also shows there is a chaos attractor. So chaos theory introduced into power load forecasting can describe the nonlinear dynamic behavior of the system and get the accuracy and the precision improved greatly.

Meanwhile, BP neural network under being improved by GA based on chaos theory is used for forecasting to improve the accuracy and the training rate further. In the last, applied into Shaanxi province power grid short-term load forecast of China and compared with the standard BP neural network model, the method mentioned in this paper gets more accurate results and more efficient training rate. So this way can bring a broad application prospects in power load forecasting.