Abstract

Evaporation estimation is very essential for planning and development of water resources. The study investigates the ability of new method, dynamic evolving neural-fuzzy inference system (DENFIS), in modeling monthly pan evaporation. Monthly maximum and minimum temperatures, solar radiation, wind speed, and relative humidity data obtained from two stations located in Turkey are used as inputs to the models. The results of DENFIS method were compared with the classical adaptive neural-fuzzy inference system (ANFIS) by using root mean square error (RMSE), mean absolute relative error (MARE), and Nash-Sutcliffe Coefficient (NS) statistics. Cross validation was applied for better comparison of the models. The results indicated that DENFIS models increased the accuracy of ANFIS models to some extent. RMSE, MARE, and NS of the ANFIS model were increased by 11.13, 11.45, and 6.83% for the Antalya station and 20.11, 12.94%, and 8.29% for the Antakya station using DENFIS.

1. Introduction

One of the major issues that may negatively affect the agriculture of dry regions such as Iran is the evaporation of water resources. It has been estimated that evaporation is responsible for annual loss of more than 40% of water resources. Such loss of water may severely undermine not only the productivity of agricultural sector but environmental projects as well. The pan evaporation tests are prone to multiple errors originating from pan size and material, water depth in the pan, sun and wind exposure, and animal activity in the vicinity. The factors effectively deciding the degree of evaporation in given area are air and soil temperature, relative humidity, sunshine and wind speed, vapor pressure deficit, and atmospheric pressure. Thus, empirical evaporation prediction formulas [1, 2] need to account for several climatic factors associated with this phenomenon. However, the large amount of data required to develop such complex empirical relationships restricts their actual use in the field [3].

Evaporation pans are valuable in hydrology since they are simple strong instruments that incorporate the important physical factors, in particular radiation, temperature, humidity, and wind speed, into a single measure of evaporative demand [4, 5]. However, it is unfeasible to place evaporation pans at each point where there is an arranged or existing reservoir and irrigation. It is additionally exceedingly improbable to have in distant areas where exact instruments can not be set up. A practical means of predicting the pan evaporation where no pans are accessible is of impressive noteworthiness to the hydrologists, meteorologists, and agriculturists [6].

Research has shown the artificial intelligence methods such as ANFIS and ANN to be very successful in civil engineering and especially in water resources analysis applications [716]. ANFIS, for example, was successfully used by Terzi et al. [13] to analyze the daily meteorology data of the Lake Egirdir in Turkey. In a study by Moghaddamnia et al. [11], pan evaporation was modeled by both ANFIS and artificial neural network (ANN) and their results showed only a slight difference between these methods. In a study by Shiri et al. [12], ANFIS was used to estimate the daily pan evaporation in Illinois (USA) and reported accurate estimations especially when using only a limited number of climatic parameters. In a study by Sanikhani et al. [3], ANFIS with grid partitioning (GP) and ANFIS with subtractive clustering (SC) were used to model the daily pan evaporation based on air temperature, wind speed, solar radiation, and relative humidity. Ultimately, this study reported that both ANFIS methods have the same degree of accuracy, which is better than the accuracy of multivariate nonlinear regression (MNLR), ANN, Stephens-Stewart (SS), and Penman [2] models. In a study by Goyal et al. [8], ANFIS was used to estimate the pan evaporation in subtropical climates, and more specifically in Karso watershed in India. Given the good accuracy of soft computing models, the majority of studies in this field have used these models for predicting the daily pan evaporation. In a study by Shirsath and Singh [17], the ANN and regression based models were used to estimate daily pan evaporation and compared with the multiple linear regression. In a study by Chang et al. [18], the self-organizing map neural network and back propagation neural network were compared in estimating daily pan evaporation. In a study by Furuhashi et al. [19], the suitability of hybridizing the Cuckoo optimization algorithm with ANN and ANFIS methods was tested for estimating daily evaporation. In a study by Lin and Lee [20], two data-driven methods, ANN, co-active neurofuzzy inference system, and multiple linear regression were applied to simulate daily evaporation at Pantnagar, India. In a study by Lughofer [21], radial basis neural networks and ANFIS approaches were utilized for multi-lead ahead prediction of evaporation from Layang reservoir, located in the southeast part of Malaysia. According to the authors’ knowledge, the accuracy of dynamic evolving neural-fuzzy inference system has not been previously investigated in modeling pan evaporation.

In the present study, a new method, dynamic evolving neural-fuzzy inference system (DENFIS), was applied for evaporation modeling. This is the first study that uses DENFIS for solving this problem.

2. Materials and Methods

2.1. Case Study

In the current study, monthly maximum and minimum temperatures, solar radiation, wind speed, relative humidity, and pan evaporation data measured in Antalya (latitude 36.42°N, longitude 30.44°E, and altitude 64 m) and Antakya (latitude 36.33°N, longitude 36.30°E, and altitude 100 m) stations located in Mediterranean Region of Turkey were used (Figure 1). According to the Köppen criteria, the Mediterranean climate is identified by hot, dry, sunny summers and rainy winter season (Csa, Csb) and this is actually very different from the monsoon climate [22]. The climate is defined by mild and rainy winters with warm and dry summers. This condition is because of the air mass occupying the region. Actually, most of the heavy rainfall during the summer periods is related to the maritime tropical air mass. This air mass originates from the Atlantic Ocean. Also, the maritime polar air mass that comes from Western Europe is the other reason of the heavy rainfalls which cause the storms and severe floods. This phenomenon leads to a severe damage to the agricultural land and green houses. In the Mediterranean zone, the summer season is subjected to the only maritime tropical and continental tropical air masses which causes a rainless summer. The average annual precipitation changes between 800 and 2000 millimeters. The frontal activities, exposure, the direction of mountain ranges, and altitude are some causes of the distribution of the precipitation. The Taurus Mountains prevent the fronts coming from the Mediterranean Sea. Therefore, the southwest slopes of the mountain are the rainy area of the region. Also, the annual precipitation in the southern high slopes of the Taurus Mountains increases up to 1000 millimeters because of orographic rainfall and decrease of yearly precipitation in shadow areas less than 500 millimeters in the poljes occurring in the vicinity of Antalya, for instance. At this area, rainfall begins at the beginning of autumn to the early month of the spring season. It means half of the total annual precipitation is related to the winter season. The average relative humidity is over 60%. Also, because of the decrease in temperature in the late night and early morning particularly in hot summer days, the relative humidity exceeds 90%. Today’s dew and fog happen along the coastal belt. A reduction in evapotranspiration is caused due to high relative moisture causes. That is why the influence of the drought is lower than other components of the Mediterranean coastal areas, because of high moisture content [22].

Data used in the present study include 203 monthly values from 1983 to 2010 for the Antakya and 362 values from 1967 to 2006 for the Antalya station. Whole data were divided into three equal subsets and each subset was used for testing the applied models. Basic statistical properties of the used data are provided for each data set in Table 1. For Antalya station, the least evaporation value belongs to 1st data set while the 3rd data set includes the highest evaporation. In Antakya station, the lowest and highest values correspond to 1st and 2nd data sets, respectively; Antalya data show more scattered distribution compared to Antakya.

2.2. Dynamic Evolving Neural-Fuzzy Inference System (DENFIS)

The principles of evolving neural networks and in particular evolving neurofuzzy method were first introduced by Kasabov and Song [23]. An important outcome of efforts of Kasabov and others in this field is the dynamic evolving neural-fuzzy inference system (DENFIS) [23], which is an extension of Evolving Fuzzy Neural Network (EFuNN), itself based on the idea of developing a connectionist model in Evolving Connectionist System (ECOS) architecture [23]. Even before its development into DENFIS, EFuNN was known to have extensive ability to refine the model structure via an incremental sample-wise lifelong learning process by creating new structural components in a dynamic manner by using its five different input layers.

Input and output neurons can be fuzzified by a fuzzy quantization approach. Thus, fuzzy neural networks can be seen as connectionist structures with rules expressed with fuzzy logic [19, 20]. These networks also possess all attributes of traditional neural networks such as recall, reinforcement, and hidden layers. Neurofuzzy inference systems are defined as the systems that employ fuzzy rules and associated fuzzy inference mechanisms for learning and rule optimization purposes. ANFIS is an example of this data-centered approach to learning. In DENFIS, the described model architecture is implemented in an evolving approach based on cross-linking to Takagi-Sugeno fuzzy systems.

The notable difference of this approach from other evolving fuzzy systems is the method of making a prediction for a new sample. In this respect, DENFIS follows a model-based lazy learning approach, where network assesses the position of the input vector in the feature space, and forms, accordingly, a fuzzy inference system for predicting the output through a dynamic process based on the nearest fuzzy rules created during the incremental learning. In other words, the classical lazy learning of this network employs a sample-based approach, where local samples taken from the area closest to the query point are used to construct a small local model on demand [21]. This learning process is involved with the following synergies:(1)Use of a fuzzy model with Takagi-Sugeno architecture: this means converting the model to a neurofuzzy format in a way that produces the same learning problem, which means linear consequent, nonlinear antecedent parameters, rules, or neurons need to be evolved on demand.(2)Use of a clustering-based process for evolution of rules or neurons (evolving a new cluster means evolving a new rule).(3)Local learning of consequent parameters.The notable differences of this method from others include the use of ECM as Evolving Clustering Method, use of triangular membership functions (in place of Gaussian functions) as fuzzy sets, and the use of an alternative weighting scheme for local learning of consequent parameters.

DENFIS uses Takagi-Sugeno type fuzzy inference engine [24]. Such inference engine consists of fuzzy rules inferred in in which “ is ”, , and are fuzzy arguments as prior defined form fuzzy rules. are prior parameters defined over universes of discussion. ; and ; and are fuzzy arrangements determined by fuzzy membership functions ; ; , and . The parameter is a resultant variable, and polynomial functions ; are used. All fuzzy membership functions are triangular functions in DENFIS models. The membership functions are given as in which denotes the cluster center on the dimension, , and , to 2; here Dthr is a clustering variable.

A zero-order Takagi-Sugeno type fuzzy inference system is defined as a system where resultant functions are crisp fixed parameters; that is, . If , are linear functions, the system is named a first-order Takagi-Sugeno type fuzzy inference system. Similarly, it is named high-order Takagi-Sugeno fuzzy inference system if these functions are nonlinear.

The conclusion of inference of the output, , for the input vector is the weighted mean of each rule’s output pointed out as follows [25]:

Here: .

2.3. Adaptive Neurofuzzy Inference System

ANFIS is a multilayer feed-forward neurofuzzy network capable of combining the linguistic flexibility of fuzzy logic with the numeric capabilities of artificial neural networks (ANNs). Desirability of ANFIS lies in its ability to synergize the merits of ANN and fuzzy logic to map an input space to an output space more efficiently than either approach and achieve more effective forecasting models through its enhanced learning and data classification capabilities. Given the outstanding ability of ANFIS to infer fuzzy rules or expert knowledge from numerical data, this technique has found countless applications in classification, rule-based process control, and pattern recognition, for example, to analyze and predict the wind speed, dynamic load of power systems, and faults in engines. In a way similar to ANN’s mechanism of solving function estimation problems, before making any prediction, ANFIS model needs to undergo a training phase specific to the data at hand and the target application [26, 27]. Theoretically, option variables can serve as the sole basis of development of nonparametric option pricing models with predictive and nonlinear modeling methods; but the use of human expert knowledge and decisions can significantly contribute to quality and speed of this process. Thus, an ideal prediction model is the one where option variables and human expert decisions and knowledge are all incorporated into the process. Such need makes ANFIS a perfectly suitable tool for merging the human expert knowledge and decisions and the option variables for better training and learning and thus better modeling of option prices.

From the functional perspective, ANFIS architecture is, on the one hand, an equivalent of fuzzy model as defined by Takagi-Sugeno-Kang (TSK model) and, on the other hand, a rough equivalent of Radial Basis Function Networks (RBFNs). But equivalence of RBFN and TSK fuzzy model is subject to the following requirements [28]:(a)For both models, the aggregation method used to extract the overall outputs must be the same (weighted average or weighted sum).(b)There must be an equal number of activation functions and fuzzy IF-THEN rules.(c)For the rule bases consisting of several inputs, each activation function must be equal to a composite input membership function. This can be achieved by several methods, the simplest of which is to incorporate Gaussian membership functions with the same variance into the rule base, while using the algebraic product for the “AND” operation. The product of the Gaussian membership functions will yield a multidimensional Gaussian RBFN.(d)The activation functions on the output of neurons should have the same functions as their corresponding fuzzy rules.

3. Application and Results

The DENFIS method was employed for estimating pan evaporation based on the climatic data of maximum and minimum temperatures, solar radiation, wind speed, and relative humidity. The results of the proposed model were compared with those of the classical ANFIS model. The employed models were evaluated by using three commonly applied comparison criteria, namely, root mean square error (RMSE), mean absolute relative error (MARE), and Nash-Sutcliffe Coefficient (NS). The expressions of the applied statistics arewhere and , respectively, are the observed and modeled pan evaporation at the th time step, is number of time steps, and is mean of the observed pan evaporation.

First, whole data set was divided into three equal parts. Two parts were used in training of the applied models while the remaining one part was used for testing. Thus, three applications were obtained for each method. Then, the models were compared with each other according to the mean of the used statistics. Table 2 reports the periods of training and test data sets used for each DENFIS and ANFIS model. The accuracy of DENFIS models is mostly dependent on the choice of distance threshold value (Dthr) [29] and, therefore, different Dthr values were tried to find the optimal ones.

The optimal Dthr values obtained for the DENFIS models are 0.02, 0.01, and 0.013 for the M1, M2, and M3 data sets, respectively. For the ANFIS models, different number of membership functions (MFs) were tried and 2 Gaussian MFs for each input provided the best accuracy. 100 iterations were used for each model following the suggestions of [30]. Comparison of the DENFIS and ANFIS models is made in Table 3 for modeling pan evaporation of Antalya station. According to the mean of the comparison statistics, the DENFIS has a better accuracy than the ANFIS model. Both methods have the highest accuracy for M3 data set while the M2 provides the worst results. The worst accuracy of the M2 models may be due to the highest skewness of this data set (see Table 1). The other reason may be the different data ranges between training and test data sets in M2. The minimum of the test data set (1.3 mm) is lower than that of the training period (1.5 mm) and this may cause some extrapolation difficulties for the applied models in estimating low pan evaporation values in the test phase. The DENFIS increased the accuracy of ANFIS by 11.13, 11.45, and 6.83% with respect to RMSE, MARE, and NS, respectively. Figure 2 illustrates the observed and estimated pan evaporation of Antalya by DENFIS and ANFIS in the test period. The ANFIS has higher than the DENFIS for the M1 and M3 data sets. For the DENFIS, however, the slope of the fit lines is closer to 1 than those of the ANFIS in case of M1 and M3. For the M2 data set, the DENFIS seems to be more successful than the ANFIS from the fit line coefficients and viewpoints.

Comparison of the DENFIS and ANFIS in estimating pan evaporation of Antakya station is made in Table 4. The optimal Dthr values obtained for the DENFIS models are 0.017, 0.02, and 0.013 for the M1, M2, and M3 data sets, respectively. It is clear from the table that both applied models give the worst result for the M3 data set. Here, also the main reason might be the fact that the maximum pan evaporation value (9.8 mm) of the test period is higher than the corresponding value (8.1 mm) of the training period for the M3 case. This gives the applied models some extrapolation difficulties in catching high pan evaporation values in the test stage. The RMSE, MARE, and NS accuracies of the ANFIS model were increased by 20.11, 12.94, and 8.29% using DENFIS model, respectively. The scatterplot comparison of both the methods is made in Figure 3 for the test stage. It is apparent from the figure that the slope and bias coefficients of the fit line equations are closer to those of the exact line (, slope: 1 and bias: 0) with a higher for the DENFIS model compared to those of the ANFIS in all three cases.

The results of the DENFIS and ANFIS models are tested by using one-way analysis of variance (ANOVA) and reported in Table 5. It is obvious from the table that the DENFIS models generally yield small testing values with higher significance levels for the ANOVA. It can be said that the DENFIS is more robust in pan evaporation estimation than the ANFIS. Recently, Kisi [31] modeled pan evaporation of Mersin and Antalya stations which are located in Mediterranean Region of Turkey by using least square support vector regression (LSSVR), multivariate adaptive regression spline (MARS), and M5 model tree (M5Tree) methods and he found mean RMSE values as 0.904, 1.013, and 1.190 mm for the LSSVR, MARS, and M5Tree models in Mersin Station while corresponding values were 0.808, 0.996, and 0.859 mm in Antalya station, respectively. By comparing these results with those given in Tables 3 and 4, it can be said that the DENFIS model provided satisfactory estimates of the pan evaporation for the both stations.

4. Conclusions

This study investigated the accuracy of DENFIS as a new method in pan evaporation modeling. Monthly climatic data obtained from two stations from Turkey were used for testing the used models. Results were compared with those of the classic ANFIS models by using cross validation and RMSE, MAE, and NS statistics. According to the obtained results DENFIS model improved the ability of ANFIS by 11.13, 11.45, and 6.83% from the viewpoints of RMSE, MARE, and NS, respectively, for Antalya station, while, in Antakya station, the ANFIS accuracy was improved by 20.11% for RMSE, 12.94% for MARE, and 8.29% for NS. ANOVA test results indicated the superior accuracy of the DENFIS to the ANFIS models. Comparison with previous studies suggested the use of this method (DENFIS) in pan evaporation modeling.

The ability of DENFIS was compared with ANFIS method using two stations’ data. The proposed method can also be compared with other data-driven methods in modeling pan evaporation using more data having different climatic conditions.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (2017R1A2B2010120).