Abstract

Saturated hydraulic conductivity (), among other soil hydraulic properties, is important and necessary in water and mass transport models and irrigation and drainage studies. Although this property can be measured directly, its measurement is difficult and very variable in space and time. Thus pedotransfer functions (PTFs) provide an alternative way to predict the from easily available soil data. This study was done to predict the in Khuzestan province, southwest Iran. Three Intelligence models including (radial basis function neural networks (RBFNN), multi layer perceptron neural networks (MLPNN)), adaptive neuro-fuzzy inference system (ANFIS) and multiple-linear regression (MLR) to predict the were used. Input variable included sand, silt, and clay percents and bulk density. The total of 175 soil samples was divided into two groups as 130 for the training and 45 for the testing of PTFs. The results indicated that ANFIS and RBFNN are effective methods for prediction and have better accuracy compared with the MLPNN and MLR models. The correlation between predicted and measured values using ANFIS was better than artificial neural network (ANN). Mean square error values for ANFIS, ANN, and MLR were 0.005, 0.02, and 0.17, respectively, which shows that ANFIS model is a powerful tool and has better performance than ANN and MLR in prediction of .

1. Introduction

Soil hydraulic properties such as saturated hydraulic conductivity govern many soil hydrological processes; therefore, they are very important and even necessary in water and mass transport models and irrigation and drainage studies [1]. Direct measurement of soil hydraulic properties including is costly and time-consuming and becomes impractical due to spatial and temporal variabilities when hydrologic predictions are needed for large areas. Also it requires sophisticated measurement devices and skilled operators [2]. In the past few decades, as an alternative, indirect approximation of hydraulic properties from some basic and easily measured soil properties (such as clay, sand, and silt contents, and bulk density) using pedotransfer functions (PTFs) has received considerable acceptance [37]. “Pedotransfer function” was first introduced for empirical regression equations relating water and solute transport parameters to the basic soil properties that are available in soil survey [8].

The is an important soil hydraulic property often estimated using PTFs. Different methods such as regression models [3, 911] and artificial neural networks (ANN) are available for derivation of PTFs. In recent years, PTFs constructed by using artificial neural networks (ANN) (especially feed forward ANN) have proven popular with many researchers. ANN-PTFs have been developed by researchers such as Minasny et al., Minasny and McBratney, and Pachepsky et al. [5, 6, 11]. The overall conclusion made by these (and other) investigators was that when the number of input parameters is greater than three, ANN usually performs better than regression techniques, particularly when uncertainties in the quality of the data were small [12]. Multilayer perceptron (MLP) and radial basis function (RBF) are two of the most widely used neural network architecture. General difference between MLP and RBF is that RBF is a localist type of learning which is responsive only to a limited section of input space. RBF utilizes a local learning strategy versus MLP global learning and this leads to a higher rate of accuracy and faster training of RBF [13].

Adaptive neuro-fuzzy inference system (ANFIS) can be applied as a further alternative technique [14]. Similar to ANN, the neuro-fuzzy system constructs input-output membership function relationships. The fuzzy system is set to learn from the training data by adjusting the parameters of the membership functions using the least-squares method. The trained system can be used as a PTF to predict properties of an unknown soil sample [11]. A close spatial relationship between soil hydraulic properties and other easily available soil and terrain attributes, as auxiliary variables, can be exploited to predict these hydraulic characteristics, as target variables, with a reasonable accuracy at unobserved locations [15].

State of the art shows that, in most previous studies, was predicted using regression and ANN models [3, 16, 17]. Many comparisons of PTFs have been made with respect to different data sets, different mathematical procedures (regression versus ANN models), and different input parameters. However, there are few studies which have used ANFIS model for developing PTFs. Wieland and Mirschel compared a feed-forward neural network (NN), a radial basis function network (RBF), and a trained fuzzy algorithm for regional yield estimation of agricultural crops (winter rye, winter barley) [18]. As mentioned before, ANFIS model has more ability to develop PTFs than ANN and regression models. Moreover unlike the tradition, radial basis function (RBF) technique, as far as our knowledge goes, has not been used for this comparison purpose. Therefore the objectives of this study were (a) to employ the techniques of ANN (MLR and RBF methods), ANFIS, and multiple linear regression (MLR) to predict saturated hydraulic conductivity using easily available soil properties, (b) to evaluate the performance of these techniques and determine the best methodology for prediction of saturated hydraulic conductivity.

An ANN is a highly interconnected network of many simple processing units called neurons, which are analogous to the biological neurons in the human brain. Neurons having similar characteristics in an ANN are arranged in groups called layers. The neurons in one layer are connected to those in the adjacent layers but not to those in the same layer. The strength of connection between the two neurons in adjacent layers is represented by what is known as connection strength or weight. An ANN normally consists of three layers, an input layer, a hidden layer, and an output layer. In a feed-forward network, the weighted connections feed activations only in the forward direction from an input layer to the output layer. On the other hand, in a recurrent network additional weighted connections are used to feed previous activations back into the network. The structure of a feed-forward ANN is shown in Figure 1 [19].

In Figure 1, the circles represent neurons; the lines joining the neurons represent weights; sand, silt, clay, and are input variables; represents the output variables; and represent the weights between input and hidden and hidden and output layers, respectively. Parameters and represent the bias of the corresponding hidden and output layer neurons. The role of bias in a neuron is to displace the original functional domain by a magnitude equal to that of the bias and thereby translate the area of influence to its activation state [19].

An important step in developing an ANN model is training of its weight matrix. The weights are initialized randomly in a suitable range and then updated using certain training mechanism. There are primarily two types of training mechanisms: supervised and unsupervised. A supervised training algorithm requires an external teacher to guide the training process. This typically involves a large number of examples (or patterns) of inputs and outputs for training. The inputs in an ANN are the cause variables and outputs are the effect variables of the physical system being modeled. The primary goal of training is to minimize the objective (error) function by searching for a set of connection strengths that cause the ANN to produce outputs that are equal to or closer to target data [19].

Multilayer perceptron (MLP) and radial basis function (RBF) are two of the most widely used neural network architecture in literature for classification or regression problems. They are robust classifiers with the ability to generalize for imprecise input data. Both artificial neural network and fuzzy logic are used in ANFIS's architecture. ANFIS is consisted of if-then rules and couples of input-output. ANFIS training uses learning algorithms of neural network [14, 20, 21]. An adaptive network, as its name implies, is a network structure consisting of nodes and directional links through which the nodes are connected. Moreover, parts or all of the nodes are adaptive, which means each output of these nodes depends on the parameters pertaining to this node and the learning rule specifies how these parameters should be changed to minimize a prescribed error measure [14]. ANFIS is a multilayer feed-forward network where each node performs a particular function on incoming signals. Both square and circle node symbols are used to represent different properties of adaptive learning (Figure 2). To perform desired input-output characteristics, adaptive learning parameters are updated based on gradient learning rules [14, 22, 23]. For simplicity, we assume that the fuzzy inference system under consideration has two inputs, and , and one output . suppose that the rule base contains two fuzzy if-then rules of Takagi and Sugeno’s type.

Rule 1. If is and is then .

Rule 2. If is and is then ,

where and are the fuzzy sets, are the outputs within the fuzzy region specified by the fuzzy rule, and , , and are the design parameters that are determined during the training process. The ANFIS architecture to implement these two rules is shown in Figure 2, in which a circle indicates a fixed node, whereas a square indicates an adaptive node.

Layer 1. Every node in this layer is a square node with a node function: where is the input to node , and is the linguistic label (small, large, etc.) associated with this node function, and where , can adopt any fuzzy membership function. Usually, we choose and to be bell-shaped with a maximum equal to 1 and a minimum equal to 0, such as where (, , and ) is the parameter set. Parameters in this layer are referred to as premise parameters.

Layer 2. The nodes in this layer are fixed. These are labeled as M to indicate that they play the role of a simple multiplier. The outputs of these nodes are given by which are the so-called firing strengths of the rules.

Layer 3. Every node in this layer is a circle node labeled as N. The th node calculates the ratio of the th rule’s firing strength to the sum of all rules’ firing strengths: For convenience, outputs of this layer will be called normalized firing strengths.

Layer 4. In this layer, the nodes are adaptive nodes. The output of each node in this layer is simply the product of the normalized firing strength and a first-order polynomial (for a first-order Sugeno model). Thus, the outputs of this layer are given by Parameters in this layer will be referred to as consequent parameters.

Layer 5. The single node in this layer is circle node labeled as S that computes the overall output as the summation of all incoming signals; that is, It is seen that there are two adaptive layers in this ANFIS architecture, namely, the first layer and the fourth layer. In the first layer, there are three modifiable parameters , which are related to the input membership functions. These parameters are the so-called premise parameters. In the fourth layer, there are also three modifiable parameters , pertaining to the first-order polynomial. These parameters are the so-called consequent parameters [14].

2. Materials and Methods

2.1. Data Collection

The data used in this study were obtained from the reports of basic soil science and land reclamation studies conducted by Khuzestan Water and Power Organization. Khuzestan province is located in the southwest of Iran from east 47°40′ to 50°33′ longitude and north of latitude. Khuzestan province has mild winters and very hot summers [24].

The data set which was used to develop the PTFs included the data from 175 soil profiles from different parts of Khuzestan province. Soil samples were taken from the 0–30 cm layer. The data consisted of five soil properties (four as independent variables and one as dependent variable). Independent variables were the percentages of clay (C), silt (Si), sand (S), and bulk density . Saturated hydraulic conductivity was considered as dependent (output) variable. Soil samples were air-dried and sieved by a 2 mm sieve for physical analysis. Particle size distribution was determined using pipet method. was determined on undisturbed samples using cylinder method, being made of 10 cm and 100 cm3 cylinders, after drying 24 h in 105°C ovens. Saturated hydraulic conductivity was measured using inversed auger hole method. The principle of the auger hole test above the water table consists of boring a hole to given depth, filling it with water and measuring the rate of fall of the water level [25].

MLP, RBF, ANFIS and MLR of pedotransfer functions were used to predict the . The data set was divided into two separate data sets: the training one (80%) and the testing one (20%). The training data set was used to train the ANN, ANFIS, and MLR models, whereas the testing data set was used to verify the accuracy and the effectiveness of the trained models. A computer program was run in MATLAB (ver.7.6.0.323) to train and to test the data set using ANFIS and ANN structures for prediction. The performances of new techniques (ANN and ANFIS) were compared with MLR method.

2.2. Regression Analysis

In the multiple-linear regression (MLR) analysis, first, the most essential input variables were selected using backwards stepwise method, and then linear, quadratic, and possible interaction terms of these basic soil properties were investigated using the SPSS software. The general form of the regression equations was: where is the dependent variable representing , is the intercept, are regression coefficients, and are independent variables referring to basic soil properties.

The same data sets were used in the derivation and testing of PTFs developed using ANN, ANFIS, and regression methods for reliable comparison. The resulting functions were tested using data sets not included in the derivation procedure.

2.3. Developing PTFs Using Artificial Neural Network Model (ANN)

First step in developing an ANN model involves identifying input and output variables and normalizing the data between 0 and 1. The method of channelized normalization was used to normalize the input and output data, wherein the data representing separate physical variables/parameters are normalized separately. Then the best ANN architecture is determined by finding the optimal number of hidden neurons through training of the various architectures using a trial and error method. Once the best ANN architecture is trained, it is validated using the testing data set.

In this study, two different types of ANN were developed. The first ANN model is multilayer perceptron (MLP) which is the most commonly used neural network structure in ecological modeling and soil science [26], whereas the second ANN model is radial basis function (RBF).

2.4. Neural-Fuzzy Model Design

For designing the neural-fuzzy system, the feed-forward multilayer neural networks with hybrid training algorithm and Takagi-Sugeno fuzzy inference system were used. In this model, the “gaussmf” membership function was used for input function, the “linear” membership function was used for output function, and defuzzification operation was performed by weight average function.

2.5. Performance Evaluation Criteria

Two standard statistics were used as evaluation criteria to evaluate the performance of ANN models (MLP and RBF) and ANFIS methods. These were mean square errors (MSE) and coefficient of determination as calculated using the following equations: where is the number of data set, is the measured value of output variable, is the predicted value of output variable, and is the average of predicted value of output variable.

3. Results and Discussion

Before developing PTFs using ANN, ANFIS, and MLR models, descriptive statistics of data were derived using SPSS (version 16). Descriptive statistics for the soil physical and hydraulic parameters which were used in the development (train) and validation (test) of PTFs using models are summarized in Table 1.

The studied soils have wide ranges of physical properties; for instance, the ranges of sand, silt, and clay contents, and are 4–70, 12–66, 9.4–62%, and 0.96–1.77 Mg m−3 for training or derivation data set, respectively. Testing or validation data set had similar ranges. The studied soils are mostly originated from alluvial processes which probably cause such wide range in soil physical properties. The soils are on average with medium textures, where the mean silt percent and are 44.6 and 1.45 Mg m−3, respectively.

The values of the studied soils are mostly very high, ranging from 6.4–207 (cm day−1), with standard deviation of 29.4 and 10.7 cm day−1 for training and testing data sets, respectively. A reason for high values of might be the large area of the studied region. Good structure and existence of macropores due to wide ranges of particle size distribution might be also responsible for high values of .

3.1. PTFs Development Using MLR Model

In the regression analysis, normalizing the data distribution is one of the primary assumptions that have to be carried out. Therefore, the normality of the data was evaluated using the Kolmogrov-Smirnov method. ,  , sand, silt and clay data did not conform to normal distribution and were normalized using the natural-based logarithmic transformation. After normalizing data, in order to develop PTFs for predicting through MLR model, first the most essential input variables were selected using stepwise method, and then linear interaction terms of these basic soil properties were investigated by means of SPSS 16 software.

After training the regression model with training data set, the derived regression equation was as follows:

In derived equation clay, sand, silt, and bulk density were chosen as the independent variables. After determining regression equation, the accuracy of MLR model was evaluated through comparing its predicted with experimental data. The obtained values of and MSE using MLR model for predicting are tabulated in Table 2. As observed from this table for test data set, the and MSE values that have been obtained are 0.5 and 0.17, respectively. Merdun et al. obtained higher and RMSE values varied from 0.637 to 0.979 and from 0.013 to 0.938 for regression method, respectively [3]. The scatter plot of the measured against predicted values obtained from the MLR model for the test data set with a poor correlation coefficient is illustrated in Figure 3.

3.2. PTFs Development Using ANN Models

In current study two different algorithms of ANNs including radial basis function (RBF) and multilayer perceptron (MLP) models were investigated to predict through employing the same data set which was used by MLR. In order to reach this end, all data sets were first normalized between 0 and 1 to achieve effective network training. Luk et al. stated that neural networks trained on normalized data achieve better performance and faster convergence in general, although the advantages diminish as network and sample size become large [27]. Normalizing the data set was done through where is the normalized value, is the actual value, is average value, for each parameter, is the maximum value and is the minimum value. A three-layered feed-forward ANN architecture with an input layer, one hidden layer, and an output layer was developed for predicting by means of both ANN models.

3.3. MLP Network

Table 3 shows the results of statistical analyses between the observed and MLP ANN-simulated values of for training and testing stages. For MLP network, the best architecture consists of four neurons in the input layer, seven neurons in the hidden layer, and one neuron in the output layer with Tansig and Purelin threshold functions for hidden and output layers, respectively, gave the best results. The and MSE values among the observed and predicted are 0.66 and 0.02, respectively. The values are in accordance with the previous studies using ANN methods. Agyare et al. while estimating obtained and NMSE about 0.6 and 0.42, respectively [26]. Merdun et al. obtained ranges and RMSE varied from 0.444 to 0.952 and from 0.020 to 3.511, respectively [3]. Figure 4 shows the relationship between the measured and predicted values for testing stage indicating that MLP network can predict with acceptable accuracy. In order to employ RBF, Gaussian function that is the most widely used in applications was chosen as a threshold function for hidden layer. In the next step a regression analysis of the network response between ANN outputs and the corresponding targets was performed. Table 3 tabulated that the provided predictions between ANNs outputs and the corresponding targets using RBF model (, ) and MLP were approximately similar and differences were not significant.

The levels of and MSE derived by both ANN models had higher accuracy than those derived by multiple linear regressions for predicting which was a support for those previous studies conducted by Merdun et al., Tamari et al., Yilmaz et al., and other researchers [3, 28, 29]. This is due to that, unlike the traditional regression PTFs ANNs do not require a priori regression model which relates input and output data that in general is difficult because these models are not known [30]. Also Minasny and McBratney, Pachepsky et al., and Tamari et al. stated that when the number of input parameters is greater than three, ANNs usually perform better than regression techniques, particularly when uncertainties in the quality of the data were small [11, 28, 31]. Additionally many investigations have indicated that a neural network with one hidden layer is capable of approximating any finite nonlinear function with very high accuracy [32, 33]. The scatter plots between measured and predicted using MLP and RBF models for testing stage with acceptable accuracy are indicated in Figures 4 and 5, respectively.

3.4. PTFs Development Using ANFIS Model

In order to develop PTFs for prediction through ANFIS model the same normalized data which employed by both ANN models were used as the input variables. In the ANFIS system, each input parameter might be clustered into several class values in layer 1 to build up fuzzy rules, and each fuzzy rule would be constructed using two or more membership functions in layer 2. Several methods have been proposed to classify the input data and to make the rules, among which the most widespread of them are grid partition and subtractive fuzzy clustering [3436]. In this study subtractive fuzzy clustering was taken into consideration to develop the rule base relationship between the input and output variables. Sugeno fuzzy model, also known as Takagi-Sugeno-Kang (TSK) model [37], that is, one of the most widespread methodologies for developing fuzzy rules was used in the current study to predict soil .

The performance of the ANFIS model for the test data set and the related statistical evolutionary results are given in Figure 6 and Table 4, respectively. The values of 0.71 and 0.005 for and MSE parameters, respectively, for ANFIS testing stage, show higher prediction accuracy of ANFIS model than the ANN models.

4. Conclusions

Predicting the saturated hydraulic conductivity of soil is one of the important subjects in modeling water flow and solute transport processes in the vadose zone. This paper presents the development and validation of PTFs for prediction of from basic soil properties by using ANN and ANFIS models in Khuzestan province, southwest Iran. The predictive capabilities of these methods were also compared using some evaluation criteria of testing data. The results revealed that the ANFIS model had superiority to the ANN models for prediction. Both applied ANNs algorithms exhibited acceptable accuracy. The comparison of the RBF and MLP networks indicates that they had approximately similar performance for prediction as the obtained results using RBF were slightly better than MLP ones, but their differences were not found significant. Among the employed ANN models, RBF prediction was more accurate than MLP network prediction.