Research Article | Open Access
Comparative Study of Ground Measured, Satellite-Derived, and Estimated Global Solar Radiation Data in Nigeria
In this study, the performance of three global solar radiation models and the accuracy of global solar radiation data derived from three sources were compared. Twenty-two years (1984–2005) of surface meteorological data consisting of monthly mean daily sunshine duration, minimum and maximum temperatures, and global solar radiation collected from the Nigerian Meteorological (NIMET) Agency, Oshodi, Lagos, and the National Aeronautics Space Agency (NASA) for three locations in North-Western region of Nigeria were used. A new model incorporating Garcia model into Angstrom-Prescott model was proposed for estimating global radiation in Nigeria. The performances of the models used were determined by using mean bias error (MBE), mean percentage error (MPE), root mean square error (RMSE), and coefficient of determination (). Based on the statistical error indices, the proposed model was found to have the best accuracy with the least RMSE values (0.376 for Sokoto, 0.463 for Kaduna, and 0.449 for Kano) and highest coefficient of determination, values of 0.922, 0.938, and 0.961 for Sokoto, Kano, and Kaduna, respectively. Also, the comparative study result indicates that the estimated global radiation from the proposed model has a better error range and fits the ground measured data better than the satellite-derived data.
Solar radiation is the most important source of energy on earth because it plays a major role in sustaining all the activities and processes that support life of both plants and animals on earth. Solar radiation data are needed in a variety of technological areas: agriculture, engineering, forestry, meteorology, water resources management, and the designing and sizing of solar energy systems. Among the various professionals that use solar radiation data, solar energy devices design experts are more concerned about the accuracy of the data since efficient design, sizing, and performance of solar energy devices depend on the accuracy of the available insolation data of the site.
One of the ways of getting accurate and reliable global solar radiation data for solar energy system design is by ground measurements at the site of interest. Ground measurements are typically pin point measurements which are temporally integrated. This involves installation of solar sensor such as pyranometer for continuous, long-term measurements of solar data. Compared to the measurements of other meteorological parameters, the equipment for solar radiation measurements is very expensive and requires experts for operation and maintenance. Although ground measure data are said to be accurate and reliable, the cost implication and technicality involved have made such data unavailable in many locations. This has led to the search for alternative means of getting solar data for research and development of solar energy systems.
Reference  identified three classes of methods generally used in getting global solar radiation data apart from direct ground measurements. The first class uses empirical approach where meteorological data are employed with regression techniques. The second class uses solar constant by considering the depletion of insolation value due to clearness index variation and the third class is based on satellite measurements. The first and second classes above (fall under) are classified as global solar radiation estimation techniques. Angstrom proposed the first model for estimating monthly average daily global solar radiation in 1924. Ever since, many of such models have been developed and tested by researchers around the world. Several of these models have been reported for their accuracy in predicting global solar radiation in different locations across the globe [2–6].
Satellite-derived data classified as the third class sources are instantaneous spatial averages remotely measured several kilometres from the earth’s surface by geostationary and polar orbiting satellites. Satellite measurements provide easy access to long-term, cheap, and verifiable means of deriving regular solar radiation data for any desired location in the world. Satellite-derived data fit better to a selected site than ground measurements from a site farther than 25 km away . The ground measured and satellite-derived solar radiation data complement each other and are required to build a comprehensive solar radiation database. It is difficult to have high capability solar radiation monitoring network, and accuracy of the interpolation of data decreases with the increase in distance between sites. Satellite measurements are not as accurate as ground measurements and short time interval data are needed for the engineering and site-specific studies. Hence, combining these two, ground-based and satellite- derived measurements create a comprehensive solar radiation database .
The availability of ground measured solar radiation data is not guaranteed for all locations. Likewise meteorological parameters required for estimating global solar radiation data may not be easy to come by in some locations but satellite-derived data are available for every location in the world. The focus of this study is therefore to compare the following: (i) the performance of some global solar radiation models and (ii) estimated global solar radiation data and satellite-derived data with ground measured data so as to determine their relative levels of accuracy and reliability for solar energy system design and sizing.
2. Research Methodology
2.1. Study Area
Three locations from the North-Western region of Nigeria were selected for this study. The geographical locations of the sites are shown in Table 1.
2.2. Data Collection
The twenty-two years’ (1984–2005) meteorological data consisting of monthly mean daily sunshine duration, minimum and maximum temperatures, and global solar radiation used for this study were collected from two data sources: the Nigerian Meteorological (NIMET) Agency, Oshodi, Lagos, and the archives of the National Aeronautics Space Agency (NASA).
2.3. Data Analysis Techniques
Microsoft Excel software package was used for the collation of the monthly mean values of the data collected from NIMET and in carrying out other statistical analysis and computation.
2.4. Models for Estimating Global Solar Radiation
The solar radiation data for the selected sites were estimated from sunshine duration and air temperature using the Angstrom-Prescott model, Garcia model, and a newly developed model incorporating Angstrom-Prescott and Garcia models. The three models were used to allow for comparison of their relative performance in the context of the research.
2.4.1. Angstrom-Prescott Model
The first model for estimating the monthly average daily global solar radiation was proposed by Angstrom in 1924. The proposed relation was deduced based on a correlation between the ratios of average daily global radiation to the corresponding value on an entirely clear day. In 1940, Prescott modified the Angstrom relation with a view to resolving the ambiguity characterizing the definition of the clear sky global solar radiation. The modified version is known as Angstrom-Prescott model. The Angstrom-Prescott model and its corresponding equations are presented in (1)–(5) as given by The values of can be calculated using the equation given by  aswhere is sunset hour angle in degree defined as is declination angle given as is the latitude of the location; is day number of the year starting from the first of January; is solar constant given as 1367 (Wm−2); is monthly average daily global radiation on a horizontal surface (MJm−2day−1); is monthly average daily extraterrestrial radiation on a horizontal surface; is monthly average daily number of hours of bright sunshine; is monthly average daily maximum number of hours of possible sunshine given as , are regression constants to be determined.
2.4.2. Garcia Model
Garcia proposed a model for estimating global solar radiation in 1994. Garcia model is an adaptation of Angstrom-Prescott model with a slight modification that makes it temperature-based type as described inwhere , are regression constants to be determined and is the difference between maximum and minimum temperature values.
2.4.3. Proposed Model
A multilinear two-parameter regression model was developed for the estimation of global solar radiation in the selected sites. Garcia model was incorporated into Angstrom-Prescott model to form a new model with three regression constants. The proposed model is of the form: where , , and are the regression constants to be determined. All other symbols remain as earlier defined.
2.5. Statistical Analysis of the Empirical Models
Statistical indicators which include mean bias error (MBE), mean percentage error (MPE), root mean square error (RMSE), coefficient of correlation (), and coefficient of determination () were used to test the accuracy of the estimated values. The comparison of the satellite-based data and estimated global radiation values with the ground measured data was done using percentage error.
The expressions for the MBE (MJm−2day−1), MPE (MJm−2day−1), and RMSE (MJm−2day−1) as stated by  arewhere , , , and are th estimated, measured, mean estimated, and mean measured values, respectively, of solar radiation and is the total number of observations.
MBE and MPE provide information on the long-term performance of models. A positive value and a negative value of MBE and MPE indicate the average amount of overestimation and underestimation in the calculated values, respectively. RMSE provides information on short-term performance of the models. It is always positive. It is recommended that a zero value for MBE is ideal while a low RMSE is desirable [12–14]. The -value defines the linear relationship between the measured and estimated values of global solar radiations while statistic gives the percentage variation of the dependent variable in connection with the independent variables. For better modelling, and should approach unity as closely as possible .
3. Result and Discussion
The regression constants for the proposed model as well as those of Angstrom-Prescott and Garcia models were obtained using IBM SPSS 20 software. Linear regressions were carried out between the monthly observed clearness index and other meteorological parameters using 15 years’ data (1984–1998). The values of the regression constants obtained were substituted into (1), (6), and (7). The modified equations of Angstrom-Prescott model, Garcia model, and the proposed model, respectively, obtained are presented below.
Modified Models Equations for Sokoto
Modified Models Equations for Kano
Modified Models Equations for KadunaThe global solar radiation for the three selected locations was estimated using the modified equations of the three models as presented in (9)–(11). Long-term average (22 years) sunshine duration and air temperature (maximum and minimum temperatures) data obtained from NIMET were used as input parameters for the models. The input parameters used in this analysis are presented in Table 2.
A comparison of the monthly mean values of the estimated global solar radiation from the three models with ground measured data for Sokoto, Kaduna, and Kano is shown in Figures 1–3. represents ground measured data, while , , and represent estimated global solar radiation from Angstrom-Prescott model, Garcia model, and the proposed model, respectively.
From Figures 1–3, it can be observed that shows better agreement with the measured data than and in the three locations. Angstrom-Prescott model () shows the highest level of overestimation and underestimation among the three models used. This indicates that the proposed model is better than Angstrom-Prescott model and Garcia model in predicting the monthly mean global solar radiation in the selected locations.
3.1. Statistical Error Indicators of the Studied Models
The calculated values of the error indices of the studied models for the three locations are summarised in Table 3. As stated earlier, a low RMSE is desirable while and should approach unity as closely as possible. A positive value or a negative value of MBE and MPE indicate overestimation or underestimation in the calculated values, respectively.
It is observed from Table 3 that the error indices of the studied models vary from one location to another. This could be due to the variability in the atmospheric parameters which influences the solar radiation in those locations. On the long-term performance of the models, the MBE and MPE values for the three models show slight overestimation and slight underestimation of the estimated global solar radiation, respectively. However, it can be noted that Angstrom-Prescott model gave the highest values of MBE and MPE in all the locations. This indicates a poor performance of the model on long-term basis.
The proposed model (model 3) gave the lowest RMSE values in the three locations (0.376 for Sokoto, 0.463 for Kaduna, and 0.449 for Kano) while model 1 (Angstrom-Prescott model) produced the highest RMSE values in the range 0.654–1.337. The statistical error indices presented in Table 3 also shows that model 3 and model 2 have higher values of coefficient of correlation () and coefficient determination () when compared to model 1. However, model 3 produced the highest values in all the locations (0.922 for Sokoto, 0.938 for Kano, and 0.961 for Kaduna). The value of 0.961 shows that 96.1% of the clearness index for Kaduna is accounted for by model 3. The coefficient of determination, with values of 0.922–0.961 obtained for model 3, indicates that this model fits the data very well. This implies that the monthly mean global solar radiation estimated from Model 3 are more accurate than the monthly mean global solar radiation estimated from models 1 and 2. Also, it can be concluded base on the result of the statistical error indicators that Model 2 (Garcia model) performed better than model 1 (Angstrom-Prescott model) in the study locations. However, it is noteworthy that the inclusion of air temperature to Angstrom-Prescott model (1) in the proposed model (equation (7)) has a significant impact on the accuracy of the sunshine-based model. This is in agreement with  where the inclusion of air temperature as input parameter improves the performance of global solar radiation in areas with high-temperature difference.
On the whole, it could be observed that model 3 gives the best error estimates in terms of RMSE and the highest values too. This implies that model 3 has the overall best performance among the three studied models. Hence, the proposed model (model 3) is recommended for estimating the monthly mean daily global solar radiation on the horizontal surface in Sokoto, Kano, Kaduna, and other locations with similar meteorological parameters.
3.2. Comparison of Ground Measurements with Estimated Values and Satellite Data
The monthly mean values of the twenty-two years (1984–2005) global solar radiation data obtained from NIMET were compared with estimated values from the proposed model and satellite-derived data spanning the same years. Tables 4 and 5 present the summary of the comparison in terms of their percentage error. From Table 4, it can be observed that the satellite-derived global solar radiation data exceeds the ground measurements in all the months of the year with percentage error ranging from 2.2% to 25.0% in Sokoto. Kano has an average percentage error margin of −0.2% which indicates that the variation between the ground measurements and satellite data is minimal. For Kaduna, the percentage error is maximum in December being 2.9% and minimum in September being −7.1%.