Abstract

The paper illustrates an adaptive approach based on different topologies of artificial neural networks (ANNs) for the power energy output forecasting of photovoltaic (PV) modules. The analysis of the PV module’s power output needed detailed local climate data, which was collected by a dedicated weather monitoring system. The Department of Energy, Information Engineering, and Mathematical Models of the University of Palermo (Italy) has built up a weather monitoring system that worked together with a data acquisition system. The power output forecast is obtained using three different types of ANNs: a one hidden layer Multilayer perceptron (MLP), a recursive neural network (RNN), and a gamma memory (GM) trained with the back propagation. In order to investigate the influence of climate variability on the electricity production, the ANNs were trained using weather data (air temperature, solar irradiance, and wind speed) along with historical power output data available for the two test modules. The model validation was performed by comparing model predictions with power output data that were not used for the network's training. The results obtained bear out the suitability of the adopted methodology for the short-term power output forecasting problem and identified the best topology.

1. Introduction

Among renewable energy sources (RES), solar energy has the greatest energy potential and PV arrays permit to produce electric power directly from sunlight; furthermore, during the operational phase, the energy production occurs without fossil-fuel consumption or noise, and not posing health and environmental hazards. These features will make the PV devices one of the most important among the technologies based on the exploitation of RES [15]. Nevertheless, the technological and environmental benefits of PV technology are hindered by economic and technical factors. The high cost of production and installation make the PV technology feasible to the customer only if there are public funding opportunities. Furthermore, there are various concerns associated with PV modules, such as the impact of their interconnection to the grid [6]. Some studies have been carried out on this, for example, [7], but, in general, there is little information on the topic. The most severe disturbance caused by the connection of a large amount of PV generation to the grid would be encountered when a band of cloud sweeps over an area with a large concentration of PV generators. This could result in a fairly large and sudden variation in the PV output. The condition would be aggravated if this change in irradiance occurred during a rapid increase in load [8]. For these reasons, it is clear that the availability of reliable predictive tools is very important for the dissemination of PV technologies, to optimize the performance of PV systems in the planning and operational phase and finally to correctly assess the economic return. In order to evaluate the real performance of PV panels is very important the correct prediction of power output; an increase of even a few degrees of the PV panel together with a lower solar irradiance can considerably reduce the conversion efficiency of the system thus reducing the power output [9]. Indeed, an important consideration in achieving the efficiency of a PV panel is to evaluate the performance for any weather conditions and to match the maximum power point. Many methods based on the MPPT (maximum power point technique) have been reported in the literature, many others applied empirical correlations to evaluate the thermoelectrical performance of a PV system. However, these approaches require detailed knowledge of physical parameters of the PV system and manufacturing specifications. Another approach is represented by adaptive systems. An adaptive system is a system that is able to adapt its behaviour according to changes in its environment or in parts of the system itself. An adaptive system, such as artificial neural networks (ANN), does not require any physical definitions for a PV array but should allow predicting, in a fast and reliable procedure, the power output of the PV module varying the weather conditions. This paper presents a comparison of different types of ANNs that better forecasts the PV power output. The authors have tested the use of ANN to predict the power output of a PV panel using the data monitored in a test facility.

2. The Power Output of a PV Module

To design and assess the performances of a PV system, an accurate PV model should predict a reliable current-voltage (I-V) and power-voltage (P-V) curves under real operating conditions.

The “five-parameters model” represents the most common equivalent circuit that better describes the electrical behaviour of a PV system. The equivalent circuit is composed of a photocurrent source , a diode in parallel with a shunt resistance , and a series resistance as shown in Figure 1.

Based on this simplified circuit, the mathematical model of a photovoltaic cell can be defined in accordance with the following expression that permits to retrieve the I-V curve: in which depends on the solar irradiance, is the diode reverse saturation current and is affected by the silicon temperature, n is the ideality factor, and is the cell absolute temperature.

As it is known, the performance of a photovoltaic panel is defined according to the “peak power,” which identifies the maximum electric power supplied by the panel when it receives a solar irradiance of 1 kW/m2 at a cell temperature of 25°C. For given values of G, and , the operating point can be identified by drawing lines of the different loads on the I-V characteristic (Figure 2); the maximum power points are indicated by red circles.

In actual conditions, it is essential to evaluate the operating condition under all possible circumstances of G, , wind speed W, air temperature , and electric load . The temperature thus is a key parameter that affects the energy conversion efficiency of a PV panel: increasing the temperature decreases the delivered power.

Furthermore, in the literature, it is possible to find different algorithms for seeking the maximum power point [1012]. In detail, the indirect methods have the particular feature that the MPP is estimated from the measures of the PV generator’s voltage and current PV, the irradiance, or using empiric data, by mathematical expressions of numerical approximations. In the most of the maximum power point tracking (MPPT) methods, described in the literature, the optimal operation point of a generic PV system is estimated by linear approximations [13, 14] as where and are the maximum voltage and current, respectively, and are two constants of proportionality (voltage and current factors) dependents on the characteristics of the PV array used, is the open circuit voltage, and is the short circuit current.

Nevertheless, the direct methods can also be used; they offer the advantage that they obtain the actual maximum power from the measures of the PV generator’s voltage and current PV. In that case, they are suitable for any irradiance and temperature [15]. All algorithms, direct and indirect, can be included in some of the DC/DC converters, maximum power point tracking (MPPTs), for the stand-alone systems [10].

Recently, the fuzzy logic controllers (FLCs) and artificial neural network (ANN) methods have received attention and increased their use very successfully in the implementation for MPP searching [1626]. The fuzzy controllers improve control robustness and have advantages over conventional ones. They can be summarized in the following way [27]: they do not need exact mathematical models, they can work with vague inputs and, in addition, can handle nonlinearities, and are adaptive, in nature; likewise, their control gives them robust performance, under parameter variation, load and supply voltage disturbances. Based on their heuristic nature and fuzzy rule tables, these methods use different parameters to predict the maximum power output: the output circuit voltage and short circuit current [17]; the instantaneous array voltage and current [1820]; instantaneous array voltage and reference voltage (obtained by an offline trained neural network) [16]; instantaneous array voltage and current of the array and short circuit current and open circuit voltage of a monitoring cell [21, 22] and solar irradiance, ambient temperature, wind velocity and instantaneous array voltage and current, used in [23, 25, 26].

Next, three different ANNs are proposed with the aim to forecast power output of PV modules.

3. Generalities on Adaptive and ANN Systems

Adaptive systems and ANNs are nonlinear elaboration information systems whose operation function draws its inspiration by biological nervous system. When there is no clear relationship between the inputs and outputs, it is not easy to formulate the mathematical model for such as system; on the contrary, the ANN can model this system using samples [27].

Their ability to learn from experimental data makes ANN very flexible and powerful than any other parametric approaches. Therefore, neural networks have become very popular for solving regression and classification problems in many fields [28]. Because the neural network does not require any detailed information about the system or process, it operates like a black box [29].

4. The Artificial Neuron

An ANN consists of many interconnected processing nodes known as neurons that act as microprocessors (Figure 3).

Each artificial neuron (Figure 4) receives a weighted set of inputs and produces an output.

The activation potential of an AN is equal to where is the number of elements in the input vector , are the interconnection weights, and is the “bias” for the neuron [30]; the bias is a coefficient that controls the activation of the signal handled by the AN. The neuron output depends only on information that is locally available at the neuron, either stored internally or arrived via the weighted coefficients.

5. The Activation Function

The neuron output is calculated by the summation of weighted inputs with a bias through an “activate on function” as follows:

The activation function is intended to limit the output of the neuron, usually between the values or [–1, +1]. Typically it is used the same activation function for all neurons in the network, even if it is not necessary [31]. The most commons activate functions are the step function, the linear combination, and the sigmoid function as shown in Figure 5.

In the step function, the output of this transfer function is binary, depending on whether the input meets a specified threshold. The “signal” is sent; that is, the output is set to one, if the activation meets the threshold:

The step activation function is especially useful in the last layer of an ANN to perform a binary classification of the inputs.

A linear combination, usually more useful in the first layers of an ANN, where the weighted sum input of the neuron plus a linearly dependent bias becomes the system output. A number of such linear neurons perform a linear transformation of the input vector as in which is a scale parameter.

A sigmoid activation function produces an output value between 0 and 1. Furthermore, the sigmoid function is continuous and differentiable. Due to these reasons, this activation function is used in ANN models in which the learning algorithm requires derivatives. Often, sigmoid function refers to the special case of the logistic function defined by the formula where is a constant that control the shape of the curve. The sigmoid function such as the logistic function also has an easily calculated derivative, which can be important when calculating the weight updates in the network. It thus makes the network more easily mathematically manipulable and was attractive to early computer scientists who needed to minimize the computational load of their simulations.

6. Architecture or Topology of an ANN

Generally, an ANN is usually divided into three parts: the input layer that collects the inputs , the hidden layer , and the output layer that issues the outputs . If a neural network is composed by a single layer of unidirectional connections from the input nodes to output nodes is called Perceptron.

This configuration is the simplest and is not able to solve not linearly separable problems. For these kind of complex problems is more useful to use a multilayer perceptron (MLP) ANN that is a feed forward ANN model that maps sets of input data onto a set of appropriate outputs. The feed forward was the first and arguably simplest type of ANN developed. In a feed forward ANN the connections between the units do not form a directed cycle; the information moves in only one direction, forward, from the input nodes, through the hidden nodes (if any) and to the output nodes. By this way, there are no cycles or loops in the network.

According to the above definitions, a feed forward MLP consists of multiple layers of nodes in a directed graph, with each layer fully connected to the next one. Except for the input nodes, each node is a neuron (or processing element) with a nonlinear activation function.

On the contrary, a radial neural network (RNN) is a class of neural network where connections between units form a directed cycle. This creates an internal state of the network that allows the ANN to exhibit a dynamic behaviour. Unlike feed forward ANN, RNNs can use their internal memory to process arbitrary sequences of inputs. This makes them applicable to tasks such as the recognition of time series, where they have achieved the best known results.

7. Training Algorithm

Before the neural network can be used to a specific problem, its weights have to be tuned. This task is accomplished by the learning process in which the network is trained. This algorithm iteratively modifies the weights until a specific condition is verified. In most applications, the learning algorithm stops when the error between desired output and the calculated output produced by the ANN reach a predefined value. The error is updated by optimizing the weights and biases. After the training process, the ANN can be used to predict the output parameters as a function of the input parameters that have not been presented before. An epoch is a collection of all available samples; it is also the term used for a training iteration of the system: when one epoch has passed, the adaptive system has been presented with the available data once. As adaptive systems are for the most part trained iteratively, many epochs are usually required to fully train a system.

Concerning the learning algorithm, there are generally two typologies of ANN learning algorithm [32]:(i)supervised learning;(ii)unsupervised learning.

Supervised learning is characterised by a training set composed of pairs of inputs and corresponding desired outputs. The error produced by the ANN is then used to update the weights (back propagation).

In unsupervised learning algorithms, the network is only provided with a set of inputs and without desired output. The algorithm guides the ANN to self-organize and to adapt its weights. This kind of learning is used for tasks such as data mining and clustering, where some regularities in a large amount of data have to be found.

The information in the previous layers obtained by updating the weighting coefficients is supplied to the next layers through the intermediate hidden layers. More hidden layers can be added to obtain a quite powerful multilayer network. The MLP architecture has been successfully employed as a universal function approximation in many modelling situations [28].

8. Generalities on the PV Panel Behaviour

The electrical power produced by PV devices is linked to the solar irradiance on the panel and the operating temperature, but also depends on the connected electrical load as shown in Figure 2; indeed, the load defines the operating point on the P-V characteristic. For given values of irradiance, temperature, and electrical load, the operating point can be identified by drawing on the P-V characteristic the lines of the different . Therefore, in correspondence with a generic constant load connected to a photovoltaic panel, the working point will move along the load curve under the effect of temperature variations and solar irradiance during the day. The maximum power point (MPP) is identified by a red circle and its coordinates in the P-V plane are ; in the I-V plane, the coordinates of MPP are . A careful analysis of P-V curves permits to immediately recognize as the electrical behaviour of a generic PV panel can be represented in three modes or regimens:(i)when the ratio between the working voltage and the voltage of maximum power at given temperature is less than 0.95, the characteristic P-V is practically linear and the power is strongly correlated to the incident solar irradiance; for constant solar irradiance, there is no temperature influence in the power output;(ii)when the ratio for a given solar irradiance and temperature is greater than 1.05, the P-V characteristics of the panel decreases much more rapidly and the influence of solar irradiance becomes less significant (saturation conditions); for constant solar irradiance there is a linear correlation between temperature and the power output;(iii)the regimen identified by a ratio characterizes the state of a PV panel connected to a maximum power point tracking system (MPPT) in which the load dynamically adapts to generate the maximum power (red circle).

9. Data Acquisition System: Input Data Vector

To employ and train an ANN, a large database of specific data that represent the analysed physical system is required. To this aim, a test facility was built up on the roof of the Department of Energy, Information Engineering, and Mathematical Models (DEIM) at the University of Palermo. The monitoring system consists of two photovoltaic modules and a pyranometer tilted at 38° facing south, a precision resistance set used as calibrated load and a multimeter. Concerning the data acquisition of climate parameters, a network of weather stations was built up [33]. The thermal regimen of the PV modules has been measured with thermocouples (type T, copper-constantan) installed at the rear film of the module. All data were collected every 30 minutes and stored for the further calculations and comparisons. The physical data used for the training of the ANN were as follows:(i)air temperature [°C];(ii)cell temperature [°C];(iii)solar irradiance [W/m2];(iv)wind speed [m/s];(v)open circuit voltage [V];(vi)short circuit current [A].

These last two parameters are important to improve the evaluation the PV panel power output. Their values are evaluated by using the following expressions [34]: where the subscript ref identifies the reference conditions and and are the short circuit current and open circuit voltage temperature coefficients, respectively [35].

The dataset used for the following analyses consists in more than 6000 data points. The 15% of data will be used as a test dataset (not used for the ANN training phase).

The monitoring campaign involved the measurement of the performances of two different photovoltaic panels: a Kyocera KC175-GH-2 polycrystalline panel and a Sanyo HIT240 HDE4 monocrystalline panel. The principal characteristic of the two panels are showed in Tables 1 and 2.

The measurement campaign about the power output of the PV modules took several months and was characterized by a frequent change of the resistive loads to the aim of acquiring data relating to the entire P-V curve. All data are subject to a preprocessing step that consists in a preliminary analysis that permits to identify possible outliers, to remove uncorrected values, to carry out a statistical analysis, and to perform a correlation analysis.

To simulate the presence of a MPPT device, individual records characterized by a were extracted from the original database.

After the preprocessing step, the database was validated and the correlation analysis has permitted a first evaluation of the mutual relationships among the considered variables.

Figures 6 and 7 show the linear correlation between the power output and all the other features. The higher the bar goes, the more the features are correlated. In both cases the preliminary correlation analysis identified a strong correlation between and the solar irradiance; a moderate correlation with air temperature and wind speed was found.

A statistical analysis permitted to assess the maximum (Max), mean (Mean) and minimum (Min) values and the standard deviation (StDev) of all considered features (Tables 3 and 4).

In our study, for the topology of the tested ANN, we decided to use an input vector with six components: , G, , W, , and ; the output vector has only one component: the power output P, as shown in Figure 8.

10. ANN Topologies

After the preprocessing phase, the authors explored different topologies of ANN. In the following part, will be described only the best ANN solutions:(i)one hidden layer MLP;(ii)RNN MLP;(iii)gamma memory ANN.

For each topology are analysed the design and the algorithm, each neural network was trained and was validated with a post processing phase.

11. Description of the Implemented ANN Topology

11.1. One Hidden Layer MLP

The one hidden layer MLP is a kind of ANN consisting of three layers of ANs in a directed graph, with each layer fully connected to the next one. In this work, except for the input ANs, each node is a neuron with a sigmoid activation function and a common supervised learning technique for training the network was used. The tested topology is one of the simplest available for ANNs and is composed by two input sources, two function blocks, two weight layers, one hidden weight layer, and one error criterion block.

Figure 9 schematizes the tested one hidden layer MLP topology to evaluate power output of a PV panel.

11.2. RNN MLP

The RNN MLP is a simple ANN topology that employs a recursive flow of the signal to preserve and to use the temporal sequence of events as a useful information. This topology is composed of two input sources, two weight layer, one hidden weight layer, two recursive function blocks, and one error criterion.

Figure 10 shows the RNN MLP topology for the power output evaluation. The recursivity is iconized by a feedback connection where μ is the weight of the feedback used to scale the input. In our test, each signal flowing into the recursive function block is linked to a different value of μ.

11.3. Gamma Memory ANN

The gamma memory (Figure 11) processing element (PE) is used in dynamic systems to remember past signals [36]. It enables the usage of past information to predict current and future states. The gamma neuron is ideal for neural networks since the time axis is scaled by the parameter μ, which can be treated as any weight and adapted using back propagation.

The application of gamma memory permitted to employ an ANN to emulate the trends. In this work was proposed an ANN constituted by two input sources, three gamma memory blocks, three weight layer, three function blocks, and one error criterion block (Figure 12).

12. Postprocessing Phase: Performance Assessment of ANNs

After the training, for each ANN, the postprocessing phase evaluate the difference between the calculated and the measured output vector. The data used for this phase are not used for the training process. The performance assessment is carried out by means of three indexes:(i)the mean error (ME) is where is the number of samples,(ii)the mean absolute error (MAE) represents the quantity used to measure how close forecasts or predictions are to the eventual outcome: (iii)the standard deviation shows how much variation or “dispersion” exists from the average (mean or expected value). A low standard deviation indicates that the sample data tend to be very close to the mean; high standard deviation indicates that data are spread out over a large range of values:

13. Results and Discussions

As previously described, each ANN was characterized by a training phase, a postprocessing phase evaluates the error, and the absolute error between the measured and the calculated operating temperature data. To better analyse the validity of the ANN, different simulations were carried out changing the time of the training phase and/or the epochs. In all cases, the training phase has been suspended in order to avoid the over-fitting. Furthermore, for each topology was identified the confidence plot that contains the 95% of the outputs.

To better understand how ANNs performance can be evaluated, Figure 13 shows the calculated power output versus measured power output (data points not used for training phase).

In Tables 5 and 6, the results of several ANNs tested topologies are reported.

The result coming from the ANNs designed to predict the power output produced by a PV panel shows that this kind of approach is very promising. Mean errors appear to be generally very low (1 W). ANN topologies based on MLP for both panels were very good in terms of prediction error even if they required a longer time for the training phase. The results of the RNNs and gamma memory ANNs are characterized by good performances with shorter training time for the Kyocera module. The Sanyo panel has generally required longer training time but with excellent results in term of mean error especially with the gamma memory ANN.

14. Conclusions

In the paper, different network architectures have been tested in order to forecast the electric power generated by a PV module in real conditions. Data used to train the networks were acquired using two different types of PV modules connected to calibrated electrical loads. Climatic variables were acquired by means of a weather station. The performances evaluation of the ANNs was performed by comparing the prediction with the real power output and the errors were generally contained within the 0.05–1% of the module peak power output. ANNs with simpler architecture generally required longer training time while more complex ANNs have requested shorter training time. Results show that adaptive techniques are able to predict the power output of a PV panel with great accuracy and short computational time. These algorithms can play a dominant role concerning remote management of PV in a probable future when this technology will be extremely widespread in the territory.

Nomenclature

:Activation potential
AN:Artificial neuron
ANN:Artificial neural network
:Bias coefficient
FLCs:Fuzzy logic controllers
:Solar irradiance [W/m2]
:Current [A]
:Diode reverse saturation current [A]
:Maximum current [A]
:Photocurrent [A]
:Short circuit current [A]
:Scale parameter
:Constants of current proportionality
:Constants of voltage proportionality
MPP:Maximum Power Point
:Maximum Power Point technique
:Ideality factor
:Number of elements in the input vector
:Power output [W]
PV:Photovoltaic
:Electric load [Ω]
RNN:Radial neural network
:Shunt resistance [Ω]
:Series resistance [Ω]
:Air temperature [°C]
:Cell absolute temperature [°C]
:Voltage [V]
:Maximum voltage [V]
:Open circuit voltage [V]
:Weights
:Wind speed [m/s]
:Interconnection
:Neuron output
:Short circuit current temperature coefficients [mA/°C]
:Open circuit voltage temperature coefficients [V/°C].

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.