Abstract

Load forecasting problem is a complex nonlinear problem linked with economic and weather factors. Long-term load forecasting provides useful information for maintenance scheduling, adequacy assessment, and limited energy resources for electrical power systems. Fuzzy time series forecasting models can be used for long-term load forecasting. However, the interval length has been chosen arbitrarily in the implementations of known fuzzy time series forecasting models, which has an important impact on the performance of these models. In this paper, a time-variant ratio multiobjective optimization fuzzy time series model (TV-RMOP) is proposed, and its performance is tested on the prediction of enrollment at the University of Alabama. Results clearly promote the forecasting accuracy as compared to the conventional models. A genetic algorithm is used to search for the length of intervals based on the training data while Pareto optimality theory provides the necessary conditions to identify an optimal one. The TV-RMOP model is applied for the long-term load forecasting in Shanghai of China.

1. Introduction

Electric power load forecast has been a research topic for many decades and the accuracy of load forecast is crucial to electricity power industry due to its direct influence on generating planning. During the past years, there have been numerous attempts to improve the accuracy of load forecasting methods [1, 2]. The forecasting model and methods can be divided into two categories: statistical methods and computational intelligence algorithms. The former method include regression analysis and time series [3, 4]. The latter include neural network, expert system, fuzzy logical, and so forth [57]. Now more and more scholar and researchers pay much attention to uncertainty hybrid method [810].

There are three different types of electric load forecasting depending on the time horizon and the operating decision that needs to be made: short-, medium-, and long-term forecasting. In general, long term forecasting includes prediction making for a range more than a year and is needed for power supply and delivery system. Therefore, the weather factor is not the main factor to affect the long-term load forecast, only using historical load data can predict the 1 year ahead load. Fuzzy time series [11, 12] prediction method is likely to get a satisfactory result.

Fuzzy time series has been widely studied for recent years for the aim of forecasting. The basic process of conventional model is shown in Figure 1(a). Related studies mainly focus on three factors: the partition of discourse, the content of forecasting rules, and the methods of defuzzification, all of which influence the prediction accuracy of forecasting models. A number of fuzzy time series forecasting models have been presented since Song and Chissom [13, 14] first presented the concepts of fuzzy time series based on the fuzzy set theory. However, the length of intervals has been chosen arbitrarily in known models. This paper focuses on the partition of discourse in TV-RMOP. We propose a novel approach to determine the length of intervals based on multiobjective optimization method in order to obtain more accurate forecasts in fuzzy time series. The proposed approach is based on two-parameter constrained multiobjective optimization. A genetic algorithm is used to search for the two parameters based on the training data. The TV-RMOP model is shown in Figure 1(b). Its performance is tested on the prediction of enrollment at the University of Alabama and applied for the long-term load forecasting in Shanghai of China. Results investigated that TV-RMOP model considerably improves the forecasting performance.

The rest of this paper is organized as follows. Section 2 introduces the methodology background briefly. The presented model is described, and the forecasting results on the prediction of enrollment at the University of Alabama are presented and discussed in Section 3. In Section 4, TV-RMOP model is applied for the long-term load forecasting in Shanghai of china. Finally, conclusions are presented in Section 5.

2. Background

2.1. Fussy Time Series

The concepts of fuzzy time series are described as follows [13, 14].

Let be the universe of discourse, where . A fuzzy set of is defined as , where is the membership function of the fuzzy set , . is a generic elements of fuzzy set . is the degree of belongingness of to , , and .

Definition 1. Let be the universe of discourse and also a subset of real numbers. It is assumed that is defined on and is the collection of , then is called a fuzzy time series on .

Definition 2. It is assumed that is a fuzzy time series and , where is a fuzzy relation and × is an operator which is caused by . The relationship between and can be denoted by when is the first-order fuzzy time series model of .

Definition 3. Let be a fuzzy time series. For any , and have only finite elements and therefore is a time-invariant fuzzy time series; otherwise, it is a time-variant fuzzy time series.

2.2. Multiobjective Optimization and Pareto Points

It is common in decision-making problems to have a set of parameters that play a dominant role in the process. It is also desirable for the decision made to be the best possible with respect to the available data. In the context of mathematics, the best decision coincides with the optimization of a function of the decision parameters, known as cost function or objective.

On the one hand, if the goal is to optimize a single cost function, then the problem is classified as single objective optimization. On the other hand, the process of simultaneously optimizing two or more conflicting objectives, subject to certain constraints, is identified as multiobjective optimization [1719].

Such problems are called multiobjective optimization problems and formulated as follows: where denotes the number of objective functions; , identify the number of inequality and equality constraints, respectively. is an -dimensional vector in the solution space , and .

Multiobjective optimization problems do not have a single global solution in contrast to the single objective problems. As a result, it is required to specify a number of criteria-conditions for which a set of solutions can be identified as optimal. In general, the optimal solutions for can be obtained based on the Pareto optimality theory. The set of optimal solutions, which is known as the Pareto optimal set , are those solutions such that each of these cannot improve any objective function in without the worsening of some other objective functions at the same time. The optimal solution set is defined as The corresponding points, which are known as the nondominated points, form the Pareto front in the objective space. Since there exists different tradeoff solutions to minimize , one possible way to solve the multiobjective problem is to select one solution manually from the Pareto optimal set.

3. Modeling and Verification

3.1. Proposed TV-RMOP Model

Time series is divided into two parts. One part is used for training purpose while the other part is used for testing. The proposed model was applied to the training data to make estimation and then the predicted values were found for the test data. In the training phase, five measures are adopted to evaluate the model prediction accuracy with the intention to formulate the multiobjective problem. These measures are objective functions with parameters and defined as follows: where is the actual value and is the predicted value at time instant . is the number of predictions made for a prespecified time interval. All measures should be minimized in the optimization procedure.

As shown in Figure 2, the TV-RMOP model can be summarized as follows.

Step 1. Define the universe of discourse and intervals using equations defined in the upper-right dashed box. Where initial and ratio are two unknown parameters.

Step 2. Define fuzzy sets based on the universe of discourse and fuzzify the historical data.

Step 3. Establish fuzzy relationships: .

Step 4. Forecast. Let ; the forecast is the midpoint of the interval which corresponds to .

Step 5. Calculate RMSE, MAE, MAPE, MAP, and THEIL, respectively. Each of them is a function with variable initial and ratio.

In the training phase, the training samples () are used to calculate the above five functions. The determination of variable initial and ratio is formulated as a multiobjective optimization problem:

In the test phase, an optimal solution is brought into the model, and the forecast value is determined in the same way as the training phase.

Genetic algorithm can be used to search for a solution of multiobjective optimization problems. The process is shown in the dashed box on the left in Figure 2. In this paper we use a MATLAB function “gamultiobj” to optimize the initial and ratio parameters.

3.2. Verification of Proposed Model

In order to show what we have achieved from the TV-RMOP model, we apply it to the enrollment of University of Alabama and then have compared the obtained results with those obtained from Chen’s method [20]. Table 1 lists the enrollment from 1971 to 1992. The observations from 1971 to 1989 are used for the training (), while the observations from 1990 to 1992 are used for test.

The minimal value of is 13055, then is taken as constrains of variable initial. is taken as constrains of the other variable ratio, because when the ratio is greater than 0.2, the length of interval will be too large, and the number of intervals will decrease. It results in diminishing the fluctuation in fuzzy time series. Then the problem can be formulated as

We use a MATLAB function “gamultiobj” to solve (5). The initialization of genetic algorithm parameters is set as follows. The fraction of population on non-dominated front is 0.1, the number of individuals is 50, maximum number of generations allowed is 100, and termination tolerance on fitness function value is . The Pareto solutions of training phase are provided in Table 2 (seen in Figure 3).

We select 12135.5 and 0.0455 as optimal parameters to forecast the enrollment from 1990 to 1992. For this initial and ratio, the universe of discourse is defined as . The intervals are finally defined as: , ,   , ,    , , , , , , . Tables 3 and 4 list the forecast results and corresponding measures, respectively. The best RMSE value is 500 with the interval length of 500 for Chen’s method [20]. The RMSE value is 425.46 for Kuo’s model [15] and 327.54 for Wong’s model [16]. The proposed model produces 233.5 for RMSE. Obviously, the TV-RMOP model gives more accurate forecast than [15, 16, 20].

4. Applying TV-RMOP Model on Long Term Load Forecasting

Long term load forecasting is needed for power supply and delivery system. We use twenty years (1990~2010) load of Shanghai region in China for forecast (data from the 2011 Shanghai statistical yearbook). The load is shown in Table 5. Where loads from 1990 to 2007 are used for the training (), and loads from 2008 to 2010 are used for test ().

The minimal value of is 264.74, [240, 260] is taken as constrains of variable initial. (0, 0.4] is taken as constrains of the other variable ratio. Then the problem can be formulated as

The initialization of genetic algorithm parameters is set as follows. The fraction of population on non-dominated front is 0.1, the number of individuals is 50, maximum number of generations allowed is 100, and termination tolerance on fitness function value is . The Pareto solutions of training phase are provided in Table 6, also seen in Figure 4.

Select 250.8 and 0.24 as optimal parameters to forecast the loads from 2008 to 2010. Tables 7 and 8 list the forecast results and corresponding measures, respectively. It is seen that the MAPE and MAP measures are 7.45% and 10.29%, which meet the actual demand.

5. Conclusions

Fuzzy time series forecasting method are getting quite popular in recent years. Most studies focus on the forecasting rules and methods of defuzzification. How to determine the lengths of intervals is less studied. The decision on what the lengths will be is important for forecasting accuracy.

In this study, a new model TV-RMOP is proposed, which formulates the forecasting as a multiobjective optimization problem and solve it by the use of genetic algorithms. The optimal lengths of intervals can be determined in the proposed model. The model is tested for forecasting the enrollment of University of Alabama. The experimental results show that the TV-RMOP model is more accurate than existing models. Finally, we apply this model for long-term load forecasting of Shanghai Region with satisfactory result.

Acknowledgments

The authors would like to thank the anonymous reviewers in MPE for helpful suggestions and correction. This work was conducted by using of the MATLAB software and the authors declared that it has no conflict of interests to this work. The work was financially supported by Shanghai Municipal Nature Science Foundation under Grant 10ZR1401400.