Abstract

Probabilistic short-term wind power forecasting is greatly significant for the operation of wind power scheduling and the reliability of power system. In this paper, an approach based on Sparse Bayesian Learning (SBL) and Numerical Weather Prediction (NWP) for probabilistic wind power forecasting in the horizon of 1–24 hours was investigated. In the modeling process, first, the wind speed data from NWP results was corrected, and then the SBL was used to build a relationship between the combined data and the power generation to produce probabilistic power forecasts. Furthermore, in each model, the application of SBL was improved by using modified-Gaussian kernel function and parameters optimization through Particle Swarm Optimization (PSO). To validate the proposed approach, two real-world datasets were used for construction and testing. For deterministic evaluation, the simulation results showed that the proposed model achieves a greater improvement in forecasting accuracy compared with other wind power forecast models. For probabilistic evaluation, the results of indicators also demonstrate that the proposed model has an outstanding performance.

1. Introduction

As one of the most mature renewable energy sources, the generation of wind power has been growing rapidly worldwide over the last decade. The record shows that the global total installed wind generator capacity has exceeded 310 GW, and, in China, the current installed wind farm capacity is approximately 91.42 GW with a growth rate of 21.4% in 2013 [1]. However, since the wind is a variable and uncontrollable resource, the uncertainty of wind power generation is higher than those of traditional ones. The large-scale integration of wind power imposes difficulties in terms of power system operation and management [2]. Thus wind power forecasting is a significant process to provide a tool for the energy reserve scheduling.

Short-term wind power forecasts with a prediction horizon from 30 minutes to 72 hours are mainly used for decision-making areas such as Economic Dispatch (ED). The strategies for short-term wind power forecasting contain mainly statistical models and physical models. The common methods for the former have been developed based on single methods or hybrids of several methods, such as the Persistence, Kalman filters [3], Autoregressive and Moving Average Model (ARMA) [4, 5], Artificial Neural Networks (ANN) [6, 7], Support Vector Machine (SVM) [8, 9], and Gaussian Processes (GP) [10]. Since wind varies rapidly with time, the statistical models which purely use historical wind speed data are effective only for very short-term forecasts (about 1–4 hours ahead). Physical models have a better performance for longer horizons and complicated terrain, because they adopt Numerical Weather Prediction (NWP) and improve its resolution to accurately predict a certain point (such as wind turbines at each) of weather parameters. However, this type of model also has limitations, such as the need of knowing physical properties and the huge amount of computation [11]. To overcome these limitations, some researchers present a prediction method for longer forecasting horizons by using NWP data and statistical methods [12]. Some of them used artificial intelligence algorithms as the final step of downscaling of NWP to obtain accurate predictions. In [13], a hybrid approach that integrates global Numerical Weather Prediction model with ANN is proposed. ANN is used to carry out a downscaling of the NWP forecast. To improve the accuracy of the prediction system, different NWP forecasting models combined with different neural network banks are discussed in detail to compare the prediction results [14].

It should be noted that the majority of wind power forecasting tools at present can provide only deterministic values, which are also known as spot or point forecasts [15]. Such predictions cannot provide any information about the uncertainty of the predicted values, which limits their use in stochastic optimization and dispatching of wind power. Recently, some efforts have been made to build probabilistic wind power forecasting models. The problem is how to well represent the forecast errors of wind power. There are two main approaches to deal with this problem: the nonparametric approaches and the parametric approaches. The probabilistic models available today mainly concentrate on nonparametric approaches using quantile regression. A method to compute specified quantiles of the wind power generation is proposed in [16]. In addition, a nonparametric method that performs the interval estimation of wind speed and forecasts the wind power with the operation probability of wind turbine provides another modeling approach [17]. The main advantages of these approaches are that the shape of the distributions does not have to be assumed and any information about these distributions can easily be included. But these approaches have the drawback that can only provide predefined quantiles but not full probability density function (PDF). Otherwise, nonparametric approaches usually need more parameters than the parametric ones, which always means higher requirements on the computation and the amount of training data [15]. Several parametric approaches with shape assumption of predictive distribution have also been proposed. In recent researches, the forecast errors of wind power generation are usually assumed as a variable that obeys the law of Gaussian [18, 19] or Beta [20, 21] distribution. However, studies have shown that the shape of PDF of wind power forecast error varies with the forecast timescales and magnitudes. For very short-term wind power forecasting, Gaussian distribution is unsuitable to represent the error distributions at minute-level forecast timescales [22].

In this paper, we expand our previous work on short-term wind power forecasting. A novel probabilistic wind power forecasting approach is proposed based on Sparse Bayesian Learning (SBL) method. Real wind farm datasets including NWP data and measured data from a Supervisory Control and Data Acquisition (SCADA) system are analyzed and applied to forecast the wind power in the horizon of 1–24 hours. The major work and contributions of this paper are as follows:(1)As a parametric approach, the proposed approach based on SBL and NWP can improve the forecasting accuracy significantly for deterministic evaluation and provide full probabilistic information of wind power generation.(2)After analyzing the forecast data, the predicted wind speed data from NWP is corrected first using a separate model before wind power forecasting.(3)Selectors are used to determine the inputs of each model. And the historical wind speed data from a SCADA system is used as an additional input to the wind power forecasting model for 1–4-hour-ahead prediction since it is effective in this range of time horizons.(4)When using SBL method in each model, the modified-Gaussian kernel function is adopted and Particle Swarm Optimization (PSO) is used to optimize the parameters of the kernel functions.

This paper is organized as follows. Section 2 describes the properties of meteorological variables in NWP. Section 3 expresses the SBL algorithm applied to build the models. Section 4 constructs the NWP wind speed correction model and the probabilistic wind power forecast model and gives the detailed optimization method. The case study and simulation results are presented and analyzed in Section 5. Finally, Section 6 summarizes the main conclusions.

2. Data Analysis

2.1. NWP Model

Numerical Weather Prediction (NWP) model uses atmospheric hydrodynamics and thermodynamics to predict the future weather under initial values and boundary conditions. With a high accuracy, NWP based on high-speed computers can describe the atmosphere in detail and provide the forecast of weather from different grid points and layers. NWP models can be divided into global models or regional ones. In our research, we use the Weather Research and Forecasting (WRF) NWP model. WRF is a new regional NWP model created jointly by more than 150 research institutions in the United States and other countries [23].

Short-term wind power forecasting needs the predictions form WRF model with a high spatial resolution. To increase the resolution, there are two steps: firstly, the accurate coordinate point of a wind turbine in a wind farm is inferred from a weather observation point; secondly, the observed altitude of meteorological data is transformed to the hub height of the wind turbine. A 3-layer nested grid is built and the NWP predicted data can be abstracted from the corresponding point in the 3rd layer. The data includes several kinds of meteorological variables: wind speed, wind direction, temperature, air pressure, and humidity. The data is provided at 20:00 GMT every day with an interval of 10 minutes for the following 72 hours. As we focus on probabilistic short-term wind power forecasting in the horizon of 1–24 hours, the NWP data is hourly sampled.

The other available data is the hourly measured data (wind speed and wind power) from the Supervisory Control and Data Acquisition (SCADA) system.

2.2. Error Analysis of NWP Speed Data

It is clear that wind power mainly depends on actual wind speed and the most important task for wind power forecasting is to get accurate wind speed prediction. However, there exists error between the NWP predicted wind speed and the measured wind speed. Figure 1 shows the approximately linear relationship between the predicted values and the measured values of wind speed, with a large fluctuation existing. Therefore, the statistical analysis of wind speed prediction error is necessary. Statistical analysis is to describe the average deviation between the predicted values and measured values over a period of time.

The Root Mean Square Error (RMSE) is the most commonly used error evaluation indicator. RMSE can be decomposed into two parts: the bias and the variance of the error. These measures are defined as follows:where denotes the predicted value of wind speed at time , denotes the measured value for the same period, denotes the number of prediction data, denotes the difference between the mean values of and , and denotes the standard deviation, reflecting the fluctuation of error around Bias.

From (3)–(5), we can see that RMSE is decomposed into Bias and Sde. As shown in Table 1, there exists large error in the original NWP wind speed data. This proves that it is necessary to correct the error of NWP wind speed prediction first in wind power forecasting using NWP. The detailed correction method is illustrated in Section 4.2.

3. Sparse Bayesian Learning

Recently, Bayesian methods have shown some unique advantages in statistical modeling and data analysis with uncertainty and variability [24]. Bayesian methods are adopted in modeling distributions in the application of forecasting [24].

Sparse Bayesian Learning (SBL), known as relevance vector machine formerly, is proposed by Tipping firstly as a new supervised learning method [25]. Using the Lagrange coefficient vector as the weights parameters and introducing prior probability to establish corresponding saliency framework, Tipping removed the irrelevant point to obtain a sparse model.

Compared with SVM which has similar function structure, SBL has obvious advantages. SBL is conducted under Bayesian framework and can be used in probabilistic forecasting. For one training sample set, SBL is sparser than that of SVM, representing the main feature of data. And in SVM, we have to estimate the possible values of the error trade-off parameter and the insensitivity parameter which is wasteful of computation. SBL also overcomes the limitation that the kernel function must meet the Mercer condition and provides probabilistic regression [26]. Of course, like other intelligent methods, SBL has drawbacks when used in wind power forecasting and the main one is the Gaussian assumption of the underlying distribution. The influence of Gaussian assumption for hour-level forecast timescales will be considered in further research.

SVM has been used popularly in wind speed and power forecasting. But the application of SBL present is mainly in image classification, text recognition, and timing analysis [27]. The detail of SBL can be found from [25], so here we will provide a brief description.

3.1. Method Specification

SBL that Tipping described is as follows: Given a dataset , denotes an input vector and denotes a scalar-valued target. Tipping assumes are samples distributed independently identically with additive noise which is assumed to be a random variable with Gaussian distribution of mean zero and variance

Considering the function structure of SVM, a general basis function is identified in SBL. According to the assumption of , the likelihood of the complete dataset can be expressed aswhere   is the weight vector and is .

The kernel method is usually used for the general basis function: , , and represents the kernel function. The way to optimize the kernel function will be an important work presented in Section 4.2 of this paper.

3.2. Bayesian Inference of the Parameter

Using maximum likelihood estimation directly to calculate and in (3) will result in the fact that most values of are nonzero. In SVM, overfitting occurs when there are too many nonzero values of . To avoid this, Tipping set the prior distribution of as Gaussian distribution and further set the prior distribution of and as Gamma distribution . is a hyperparameter associated independently with .

After the probability distribution given above, the posterior distribution of , , and can be inferred under the Bayesian theorem

The distribution of can be calculated from (3): the mean value of is , and the variance of is , where is . Accordingly, the problem is transformed to estimate the optimal values of the hyperparameters and :where   and are the maximum likelihood estimations of and , respectively.

From (9), it can compute and by equating the differentiation over and of the function to zero. In real application, the initial values of and would be given first, and and can be approximated through iteration. After the process of iterative reestimation, most of tends to infinity, where corresponding tends to zero. That is why SBL is a sparse model.

3.3. Probabilistic Forecasting

The distribution of the target value for a new input vector can be written as

can be approximated by , where and are both delta functions. The new target value can be calculated as follows:

4. Modeling Process

4.1. Forecasting Model Structure

The NWP data includes wind speed, wind direction, temperature, air pressure, and humidity. In general, the wind power generation mainly depends on actual wind speed. However, we do not know whether the other variables (wind direction, temperature, air pressure, and humidity) also play an important role. It is necessary to select the most beneficial training data for accurate prediction. According to our previous work presented in [28], the ARD toolbox in Netlab and the results in [28] are used to select appropriate inputs for each model.

From large body of empirical analysis and previous work, we know that the character of wind changes with diurnal cycle, and the NWP data shows different properties depending on the forecast horizon of  1–24 hours. On the other hand, a suitable number of training samples are needed in SBL which has bad performance in large training dataset case. Therefore, the training dataset (NWP and historical data) was separated into 24 subsets according the horizon of 1–24 hours to build models.

In addition, when using SBL in each model, the selection of kernel function and the optimization of its parameters have a significant impact on the model performance. In this paper, the application of SBL is improved by adopting modified-Gaussian kernel function and Particle Swarm Optimization (PSO) to optimize the parameters of the kernel functions.

According to Section 2, there are some systematic and stochastic biases present in the original NWP wind speed data. To improve the accuracy of the wind power forecast, it is important to get accurate wind speed prediction. Therefore, a wind speed correction model is applied first in this paper to correct the error in NWP wind speed prediction.

Figure 2 shows the detailed structure of the NWP wind speed correction model. ARD is used to determine which variables in NWP are relevant to the measured wind speed. Correction models 1–24 based on optimized SBL (OSBL) give the result of corrected speed for 1–24-hour-ahead prediction.

According to the analysis above, the whole modeling process of the probabilistic short-term wind power forecasting model based on SBL and NWP is illustrated in Figure 3. We donate our method of probabilistic wind power forecasting in this paper as OSBL-CS (meaning based on optimized SBL and corrected speed).

As shown in Figure 3, firstly, the corrected wind speed is obtained through the correction model. And then, the original NWP speed is replaced with this corrected wind speed. Secondly, ARD is also applied to choose variables which are relevant to wind power. As mentioned before, the historical data of wind speed is useful for 1–4-hour-ahead wind power forecasting and therefore it is included as an additional input for models in the horizon of 1–4 hours. Finally, with these combined data as inputs, a separate model for each of the 24 hourly horizons is built for probabilistic wind power forecasting.

4.2. Optimization of Kernel Function

Kernel function implies the similarity of input data, and the parameters of kernel functions have a “zoom-in” or “zoom-out” effect on this scale. Both of them have a significant impact on the performance of the model.

Gaussian kernel function is usually adopted in kernel function based methods and has been proven to be effective [29]:where is the width of kernel function.

In recent researches using kernel-based methods, the entire model always uses only one scale parameter in the application of the Gaussian kernel function and uses cross-validation method to find the optimal value of . However, for multidimensional signal having complex frequency components, it tends to be difficult to learn the model accurately if there is only one fixed scale parameter. The inputs of each model in this paper have multidimensional variables (selected from wind speed, wind direction, temperature, air pressure, and humidity). To solve this problem, we modify the original Gaussian kernel function and introduce a number of scale parameters. Based on the form of Gaussian kernel function, the kernel function becomeswhere denotes the number of variables comprised in the input vector and denote the corresponding scale parameters.

Selection of kernel parameters also plays a crucial role in the prediction results. For (10), due to an increase in the number of kernel parameters, it is difficult to find the optimal values of using cross-validation method. We adopted Particle Swarm Optimization (PSO) to search for the optimal kernel scale parameters. Compared to a similar method Genetic Algorithm (GA), PSO can reach the global optimum more quickly, and there is no need to adjust so many parameters. The detailed method of PSO can be found in [30].

The set of parameters in PSO is relatively fixed. The number of particles for each parameter we set is 30 and the adaptive function of particles is Root Mean Square Error (RMSE), as defined in Section 2.2. The maximum number of iterations is 100. The speed and location of particles are updated by the following functions:where and are speed and location of the th particle in the th iteration in j-dimension; and are, respectively, the individual best location and group best location; and are uniform random numbers ; and are accelerating factors; is inertia weight factor.

5. Experimental Validation

5.1. Case Study Description

In order to evaluate our approach, two real-world datasets based on wind farms are used. The first wind farm is located in Gansu Province, in western China, denoted by WF-G. WF-G is located in high altitude area, where the wind is strong and the wind direction is relatively single. The second is located in Jiangsu Province, in eastern China, denoted by WF-J. The humidity of WF-J is relatively high. It is a good way to validate the prediction performance of the proposed approach under different weather conditions.

As shown in Table 2, WF-G has 33 wind turbines in one group, and WF-J is a larger wind farm with 67 wind turbines in one group. The installed capacity of single wind turbine in both wind farms is 1500 kW. The time span of the NWP dataset and measured dataset is from April 2011 to May 2013. We use the data in the whole year round from April 1, 2011, to April 1, 2012, as the training set and the remaining data as the test set.

5.2. Forecasting Accuracy Evaluation

Before evaluating and comparing the models, we should introduce several indicators. To the probabilistic forecasting based on SBL, one aspect is to evaluate the error of forecasted expectation values (deterministic evaluation) and the other is to compare the difference between the wind power predictive distribution and the actual one (probabilistic evaluation). These indicators are listed as follows.

The Root Mean Square Error (RMSE), the Mean Absolute Error (MAE), andthe Normalized Mean Absolute Percentage Error (NMAPE) are used as deterministic evaluation indicators. RMSE has been defined as (2); MAE and NMAPE are defined aswhere denotes the predicted value at time , denotes the measured value for the same period, denotes the number of prediction data, and is the installed capacity of wind farm.

The reliability evaluation (RE) criterion [31] is employed to evaluate the probabilistic correctness of the predictive distribution:where is the number of test samples and is the number of times that actual test samples lie within the -confidence prediction interval. To ensure high reliability, RE should be as close to zero as possible; that is, the smaller the , the better.

In addition, the Skill Score (SC) criterion is employed to evaluate the quality of probabilistic forecasting as a whole:where is the number of quantiles and is the quantile with proportion in predictive distribution. is an indicator for , being equal to 1 if and being zero otherwise. In general, the SC is a minus, and the higher the SC, the better.

For the deterministic evaluation, the RMSE, MAE, and NMAPE are popular indicators [32, 33]. According to the specified accuracy requirement of Chinese government, NMAPE should be less than 10% for 1–4-hour forecast and less than 20% for 5–24-hour forecast. Thus, NMAPE is used as the main indicator in deterministic evaluation.

Evaluation of the quality of the probabilistic forecasting results is quite different from common statistical approach. The RE, the SC, and the Sharpness Evaluation (SE) are main indicators introduced in [34] for probabilistic forecasting of wind power. The method in this paper has the underlying Gaussian assumption of the distribution shape. We mainly use the RE and the SC as the indicators, since the SE directly relates to the variance ; the smaller the , the sharper and the better.

5.3. Simulation Results
5.3.1. Effectiveness of NWP Wind Speed Correction

The ARD toolbox is employed to choose inputs of the model. The relevance values of each NWP variable to the error of measured wind speed and NWP predicted speed are listed in Table 3 [28]. According to the results, we choose wind speed, temperature, and humidity as inputs of each correction model for WF-J and add air pressure for WF-G.

To illustrate the effectiveness of the NWP speed correction model, we introduce a common correction method based on Linear Regression (LR) as a benchmark. LR is a statistical method using the least squares approach to model the relationship between one or more explanatory variables and the dependent variable. For the horizon of 1–24 hours, a comparison of corrected error using LR and OSBL is displayed in Figure 4.

Figure 4 shows that the corrected results in both farms are much better than the original NWP data, especially using the proposed NWP speed correction model in this paper.

Table 4 shows the RMSE and MAE results of the whole correction process on test datasets. From Table 4, the correction model in this paper successfully reduced the prediction error. The improvement of accuracy based on RMSE is 20.36% for WF-G and 33.05% for WF-J.

5.3.2. Deterministic Evaluation of Wind Power Forecasting

This section aims to evaluate the forecasted expectation values using the probabilistic model OSBL-CS. According to the ARD results, in addition to the corrected wind speed, we also choose temperature, air pressure, and humidity as inputs of wind power forecasting model for WF-G and choose wind direction and temperature as inputs for WF-J, which is reasonable for the different weather conditions in the two farms.

In order to validate its performance, some methods commonly used for deterministic wind power forecasting are adopted as the benchmarks: the Persistence method, MLP neural networks, SVM, and original SBL. The Persistence method simply uses the current value as the forecast. The MLP neutral networks have been applied to researches on short-term wind power forecasting widely. Like OSBL-CS, the model using the method of MLP also corrects wind speed first and adds historical data for 1–4 hours ahead, denoted by MLP-CS. The first part of MLP-CS contains 9 hidden layer neurons; the second part using the corrected wind speed as inputs contains 7 hidden neurons.

SVM has similar function structure to SBL; hence a SVM based model which has the same wind power forecasting process with OSBL-CS is built for comparison. If we use the modified-Gaussian kernel function proposed in this paper, it is difficult to estimate so many additional parameters accurately and it becomes meaningless in practical application of SVM. Therefore, standard Gaussian kernel function is adopted in the model based on SVM, and cross-validation method is used for parameters optimization. Furthermore, an original SBL based model using Gaussian kernel function which is the same as SVM-CS is built to compare with the improved model of OSBL-CS.

The deterministic results of 24-hour-ahead wind power forecast in each wind farm are shown in Figure 5. At the prediction interval of 1–4 hours, these methods all have a good performance. It shows that the additional inputs of historical data for each model successfully reduced the prediction error. With the increase of the forecast horizon, the forecast error also increases. The predictive wind power curve of the OSBL-CS model basically captures the actual wind power curve in both wind farms. In some moments, the wind power generation changes dramatically, and the prediction effect of the OSBL-CS model becomes a bit worse. But on the whole, the OSBL-CS model has a better performance compared with the SVM model.

Tables 5 and 6 show the calculation results on the whole test sets of RMSE, MAE, and NMAPE in each wind farm. The proposed OSBL-CS model has a better performance than the other models. Compared to the Persistence method, the improvement of accuracy is 31.52% for test dataset of WF-G and 41.92% for WF-J. Compared with other artificial intelligence methods, the SBL based models (SBL-CS and OSBL-CS) have better forecast accuracy. Compared to the original SBL-CS model, the OSBL-CS model has improved the accuracy of 2.65% for WF-G and 7.36% for WF-J. According to the above analysis, this proposed model has the feature that the improvement of performance in WF-J is higher than the one in WF-G. And the OSBL-CS model has lower average forecast error in WF-J. That is mainly because WF-G is located in the windy area and the wind speed changes more dramatically.

The NMAPE values in Tables 4 and 5 embody the overall forecast performance. As the specified requirement of NMAPE by Chinese government for 1–4-hour and 5–24-hour forecast, the detailed results of NMAPE in each forecast hour for WF-G are calculated, as shown in Figure 6. It can be seen from this figure that NMAPE according to OSBL-CS model is lower than the others and can satisfy the government’s requirement successfully.

We should note that the forecast accuracy of OSBL-CS is not so competitive for 1-hour-ahead forecast, as the NMAPE values of the Persistence model and the MLP-CS model are a bit smaller. It suggests that, for very short-term wind power forecasting, the OSBL-CS could not outperform others universally. But the other points in Figure 6 show a significant improvement. From the above, we can conclude that the OSBL-CS is a better model for deterministic wind power forecasting compared with the other four models in this paper.

5.3.3. Probabilistic Evaluation of Wind Power Forecasting

To fully validate the performance of the OSBL-CS model, another aspect is to evaluate the wind power predictive distribution.

An example of 24-hour-ahead wind power forecast for a wind turbine in WF-J containing the information of uncertainty (the and the confidence intervals) is shown in Figure 7. As shown in this figure, the actual wind power values are mostly included in the and range of the predicted values. The value of is small in the horizon of 1–4 hours, which indicates that the Sharpness Evaluation values are also small in such prediction period, and the value of becomes large especially when the measured power fluctuates obviously.

In order to evaluate its probabilistic forecasting performance, quantitative analysis is still needed. The reliability represents the statistical consistency of predictive distribution and observations. The reliability evaluation plays the role of basic evaluation [35]. By definition (16), for high reliability, the actual test samples are expected to lie within the -confidence prediction interval. We choose the nominal confidence levels as 90%, 95%, and 99% (while is 0.1, 0.05, and 0.01, resp.) and calculate the RE values in the forecast horizon of 1–24 hours, as shown in Table 7.

According to Table 7, in the horizon of 1–24 hours, the RE values for the probabilistic forecasting results using the proposed OSBL-CS model are small, especially for the confidence levels of 95% ( is 0.05) and 99% ( is 0.01). The most absolute RE values at confidence level of 95% are less than 2% and less than 3% at 99%. Particularly, the RE values corresponding to the horizon of 1–10 hours are closer to zero, which means that the predictive distributions match the observations better. And the OSBL-CS model has a good performance of probabilistic forecasting in this horizon. Overall, the RE values obtained by the Gaussian distribution model OSBL-CS show high reliability of the predictive distribution.

The Skill Score (SC) criterion provides comprehensive information on probabilistic forecasting. Since the forecast results obey the law of Gaussian distribution, for (17), we choose seven quantiles according to , , and points around expectation, and the corresponding is 0.0013, 0.0228, 0.1587, 0.5, 0.8413, 0.9772, and 0.9987. The SC values are calculated in the horizon of 1–24 hours for the probabilistic forecasting results given by OSBL-CS model and the original SBL-CS model is also used as a comparison in both farms WF-G and WF-J. The SC values are contained in Tables 8 and 9.

According to Tables 8 and 9, the performance of the OSBL-CS model is clearly better compared with the SBL-CS model. We can see that the OSBL-CS model gives good probabilistic forecasting performance in both farms, particularly in the horizon of 1–4 hours due to the high reliability and the small Sharpness Evaluation values. It should be noted that, in the horizon of 10–15 hours for WF-G, the SC values are a bit higher, which means that the performance is relatively worse. The wind characteristics and the geographical environment of WF-G may account for this phenomenon. During this period, the wind speed of WF-G may be high enough that the wind turbine would be close to the full state, and the temperature in this place is always high. Thus, the OSBL-CS model is not absolutely suitable for probabilistic forecasting in this horizon for WF-G.

6. Conclusion

Research on probabilistic short-term wind power forecasting is an important and challenging task. In this paper, a new supervised learning method SBL is introduced and optimized for the training of the forecast model. In order to improve the forecast accuracy, some methods were used: the wind speed data in NWP is firstly corrected before it is used to forecast; selectors are used to determine the inputs of each model and for 1–4-hour-ahead forecasts, and historical data is added to modeling process; the SBL is optimized by adopting modified-Gaussian kernel function and Particle Swarm Optimization (PSO) to optimize the parameters of the kernel function. The simulation results in deterministic and probabilistic evaluation reveal the effectiveness of the proposed OSBL-CS model.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

The authors are thankful to the National Fund for Creative Groups of China (Grant no. 61121003) for their support of this study.