Discrete Dynamics in Nature and Society

Discrete Dynamics in Nature and Society / 2016 / Article

Research Article | Open Access

Volume 2016 |Article ID 8957530 | https://doi.org/10.1155/2016/8957530

Renfu Jia, Shibiao Fang, Wenrong Tu, Zhilin Sun, "Driven Factors Analysis of China’s Irrigation Water Use Efficiency by Stepwise Regression and Principal Component Analysis", Discrete Dynamics in Nature and Society, vol. 2016, Article ID 8957530, 12 pages, 2016. https://doi.org/10.1155/2016/8957530

Driven Factors Analysis of China’s Irrigation Water Use Efficiency by Stepwise Regression and Principal Component Analysis

Academic Editor: Juan Pavón
Received23 Jul 2015
Accepted09 Dec 2015
Published28 Mar 2016

Abstract

This paper introduces an integrated approach to find out the major factors influencing efficiency of irrigation water use in China. It combines multiple stepwise regression (MSR) and principal component analysis (PCA) to obtain more realistic results. In real world case studies, classical linear regression model often involves too many explanatory variables and the linear correlation issue among variables cannot be eliminated. Linearly correlated variables will cause the invalidity of the factor analysis results. To overcome this issue and reduce the number of the variables, PCA technique has been used combining with MSR. As such, the irrigation water use status in China was analyzed to find out the five major factors that have significant impacts on irrigation water use efficiency. To illustrate the performance of the proposed approach, the calculation based on real data was conducted and the results were shown in this paper.

1. Introduction

Agriculture is the basis for human survival, consuming up to 75% of current human water use to feed about 7.2 billion people in the world [1]. At present, intensifying water scarcity and water pollution cause the freshwater resources for irrigation to reduce, hindering agriculture’s development [2]. And then the demand of food and living space for the additional 1.8 billion people by 2050 according to population projections will put further enormous pressure on irrigation water use and limit the arable land [3, 4]. In other words, future increases in agricultural production will have to come mainly by growing more food on existing land and water [5]. And the decrease of water resources and severe environmental pollution are changing the way the agriculture is developing, with emphasis on sustainable and clean development [6]. As such, improving the efficiency of irrigation water use attracts more and more attention currently and this is consistent with the sustainable development.

In China, about 80% of production of food comes from irrigated farmland. The irrigated area reaches to 5 × 107 hectares, accounting for about 95% of arable land [7]. And the irrigation water use is always the biggest consumer of water. However, the proportion of water use for irrigation has been narrowing from 80% in 1970s to less than 60% at present. In the past three years, total amount of water use in China is about 614 billion m3/year on average, among which the irrigation water use is about 346.9 billion m3/year accounting for about 56.5% [810]. And the proportion for irrigation continues to decline in its trend, which will widen the gap between supply and demand of irrigation water use.

In addition, the water-wasting phenomena always exist in the agriculture. Compared with the developed countries’ irrigation water use efficiency of about 70%–80%, China’s water utilization efficiency is very low. China’s irrigation water use efficiency is about 45% at present, wasting more than 50% of water in the process of water delivery and irrigation. Therefore, the grain production is also very low in China; only 1 kg grain can be produced by 1 m3 of water [11]. Given all of the above, the government of China put forward the most stringent water resource management to enhance the water use efficiency. Under this situation, firstly the government needs to find out the major factors which influence the water use efficiency significantly, and these major factors can be called driven factors.

In real world, the correlations among the factors influencing the irrigation water use efficiency (IWUE), namely, multicollinearity, will increase the complexity of factor analysis. As such, PCA is introduced to the driven factor analysis of IWUE [12]. PCA can compress data by reducing the dimensionality of a data set which includes a large number of interrelated variables without losing information. After compression, the original data is transformed into a new set of variables, namely, new principal components, which are uncorrelated to each other [13].

In this paper, to overcome the mentioned weakness of the stepwise regression, PCA approach is introduced and combined with the stepwise regression for driven factors analysis of irrigation water use efficiency in China. First, stepwise regression is used to find out the explanatory variables which have no multicollinearity with other variables. Second, the PCA approach is used to reduce the number of the remaining variables transforming them into a new set of variables, called principal components, which are uncorrelated with each other. At last, a linear regression is established for modeling the relationship between a scalar dependent variable (IWUE) and one or more explanatory variables (driven factors).

The rest of the paper is organized as follows: “Review of the Research on Factor Analysis’’ provides a detailed review of the state of the art on the methods that can be used to solve similar problems. Then this paper justifies the interest of the proposed approach with respect to others. “Methodology” introduces the methodology of the proposed model. Then in the “Driven Factors Screening,” the process for selecting driven factors has been explained. In “Calculation,” the case study is presented to show the computation process. Finally, the results and conclusions are summarized in “Results” and “Conclusions,” respectively.

2. Review of the Research on Factor Analysis

2.1. Subjective Weighting Methods

Subjective weighting methods evaluate the factors using qualitative measures according to empirical expertise, such as AHP (Analytic Hierarchy Process), Delphi method, fuzzy comprehensive method, DARE method, and order relation analysis method. Among the subjective weighting methods, AHP method is a common mean for deciding the indicators’ weight which is frequently used in various fields. For example, Wang et al. [14] used the AHP-GRA analytic hierarchy process-gray relational analysis method to make the decision for ergonomic evaluation.

The biggest advantage of subjective weighting method is that it could reasonably determine the order of all the indicators. That is to say, though the subjective weighting method cannot well know the weighting coefficient of each indicator, it can effectively ensure the sequence of given indicators according to their influencing degrees. Then the biggest disadvantage of this method is its subjectiveness and randomness, because the different the experts, the different the weighting coefficients. As such, in some cases, applying subjective weighting methods may cause the significant difference compared with actual situation.

2.2. Objective Weighting Methods

Determinations or evaluation results of objective weighting methods are based on the strong mathematical theory reducing the burden of decision-makers. Except the popular objective methods like least squares and eigenvector method [15, 16], there are other advanced methods such as mean-squared deviation weight method, simple correlation function method, principal component analysis, and factor analysis [17, 18]. To date, the research history of objective weighting methods is still short and immature. And this weighting method is confined to the realistic problem domain, which fails to own the generality. In addition, the computational methods are relatively complex without the consideration of subjective intent of the policy makers, causing poor participation.

Among the mentioned objective weighting methods above, principal component analysis (PCA) is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. The number of principal components is less than or equal to the number of original variables. This transformation is defined in such a way that the first principal component has the largest possible variance (i.e., accounts for as much of the variability in the data as possible), and each succeeding component in turn has the highest variance possible under the constraint that it is orthogonal to the preceding components. The resulting vectors are an uncorrelated orthogonal basis set. The principal components are orthogonal because they are the eigenvectors of the covariance matrix, which is symmetric. PCA is sensitive to the relative scaling of the original variables.

The limitations of PCA areas follows: (1) getting the weight of principal components rather than the objective weight of each independent indicator and (2) invalid results once the correlations among indicators are too low.

2.3. Advantages of Proposed Methods

To sum up, the determination of indicators’ weighting in the multiobjective decision is significantly important. Weighting is a complicated metric for subjective assessment and objective reality. It reflects the indicator’s important level in the evaluation process. Rationality of weighting’s value affects the judging results significantly. As such, we need to figure out the value of indicator’s weighting effectively and scientifically, as well as the appropriate weighting methods. In this paper, the proposed methods were put forward combining the subjective and objective weighting methods. And the methods’ interests compared with other methods are shown in the following.

There were too many original indicators collected from the literature in the beginning, and we just need some of them. So we used the subjective weighting methods to screen the indicators firstly because it could reasonably determine the order of all the indicators. The useless indicators we deleted were meaningless for the assessment, so their weighting values were not necessary to be estimated accordingly (i.e., why we denied objective methods). We chose Delphi method to get the effective indicators so that the efficiency of assessment operation was relatively high.

Then, the driven factors after screening need to be dealt with on the strong basis of mathematical theory because we need to know their meanings that influence the water use efficiency. We used the method of correlation coefficient and modified PCA method to analyze the indicators. In accordance with the limitations of PCA, we introduced the MSR to reduce the invalid results. And we abstracted all the principal components to retain the original data so we can get the objective weight of each independent indicator.

3. Methodology

3.1. PCA Model

According to [13, 19, 20], PCA is applied in this paper to eliminate the multicollinearity among the variables.

After literature analysis, a lot of factors that affect the irrigation water use efficiency have been found. Then 22 driven factors were introduced to the irrigation water use efficiency analysis and taken as the variables in the PCA model. For the period of 1997–2010, 14 samples were collected for analysis of IWUE in China. Consider a data matrix, , with column-wise zero empirical means (the sample mean of each column has been shifted to zero), where each of the rows represents a different repetition of the experiment, and each of the columns gives a particular kind of datum. Driven factors are exhibited in columns, and the data matrix is shown in the following:

In matrix , represents the number of samples and represents the number of driven factors.

Mathematically, the transformation is defined by a set of -dimensional vectors of weight or loadings that map each row vector of to a new vector of principal component scores , given by

As such, the individual variables of considered over the data set successively inherit the maximum possible variance from , with each loading vector constrained to be a unit vector. And the new variables in are uncorrelated with each other. New variables are presented as follows:

For the first principal component in the above, the corresponding loading vector has to satisfy

A standard result for a symmetric matrix such as is that the quotient’s maximum possible value is the largest eigenvalue of the matrix, which occurs when is the corresponding eigenvector. With found, the first component of a data vector can then be given as a score in the transformed coordinates, or as the corresponding vector in the original variables.

The th component can be found by subtracting the first principal components from :

Then we find the loading vector which extracts the maximum variance from this new data matrix in the following:

It turns out that this gives the remaining eigenvectors of , with the maximum values for the quantity in brackets given by their corresponding eigenvalues. Then the th principal component of a data vector can therefore be given as a score in the transformed coordinates or as the corresponding vector in the space of the original variables, where is the th eigenvector of .

The full principal components decomposition of can therefore be given as .

3.2. Stepwise Regression

In statistics, stepwise regression includes regression models in which the choice of predictive variables is carried out by an automatic procedure [21, 22]. Usually, this takes the form of a sequence of -tests or -tests, but other techniques are possible, such as adjusted , Akaike information criterion, Bayesian information criterion, Mallows’ Cp, PRESS, or false discovery rate. The frequent practice of fitting the final selected model followed by reporting estimates and confidence intervals without adjusting them to take the model building process into account has led to calls to stop using stepwise model building altogether [23, 24] or to at least make sure model uncertainty is correctly reflected [25, 26].

Stepwise regression includes three main approaches: forward selection, backward elimination, and bidirectional elimination. A widely used algorithm was first proposed by [27]. This is an automatic procedure for statistical model selection in cases where there are a large number of potential explanatory variables, and no underlying theory on which to base the model selection. The procedure is used primarily in regression analysis, though the basic approach is applicable in many forms of model selection. This is a variation on forward selection. At each stage in the process, after a new variable is added, a test is made to check if some variables can be deleted without appreciably increasing the residual sum of squares (RSS). The procedure terminates when the measure is (locally) maximized or when the available improvement falls below some critical value.

One of the main issues with stepwise regression is that it searches a large space of possible models. Hence it is prone to the overfitting of the data. In other words, stepwise regression will often fit much better in sample than it does on new out-of-sample data. This problem can be mitigated if the criterion for adding (or deleting) a variable is stiff enough. The key line in the sand is at what can be thought of as the Bonferroni point, namely, how significant the best spurious variable should be based on chance alone. On -statistic scale, this occurs at about , where is the number of predictors. This fence turns out to be the right trade-off between overfitting and missing signal. If we look at the risk of different cutoffs, then using this bound will be within a factor of the best possible risk. Any other cutoff will end up having a larger such risk inflation [28, 29].

3.3. Stepwise Regression-PCA Model

The purpose of irrigation water use efficiency study is to find out five major driven factors from all the driven factors of China. In other words, the final results must be presented using the original data, and the number of chosen factors cannot be less than five. The major challenges in the mathematical model selecting are shown below.(1)While eliminating the multicollinearity, the factors strongly unrelated with other factors must be contained.(2)The variables which actually carry data information must be included as many as possible.(3)The data can be retransformed back to its origin.

As such, the stepwise regression model combined with PCA model is applied in this paper. Mathematically, one or some factors among these driven factors may be uncorrelated with other factors. So we use variance inflation factor (VIF) which quantifies the severity of multicollinearity in an ordinary least squares regression analysis to measure how much the variance (the square of the estimate’s standard deviation) of an estimated regression coefficient is increased because of collinearity. Then the factors uncorrelated with others can be found. For the remaining factors, we use PCA model to eliminate their multicollinearity by transforming them into a new set of variables while carrying all the original information. At last, stepwise regression is used to establish an optimal regression model to fit the data. The flowchart of irrigation water use efficiency analysis is shown in Figure 1.

4. Driven Factors Screening

The change of agricultural water use efficiency is driven by multifactors such as climatic conditions, resource endowment, irrigation technology and management, and capital investment. Through literature review [3045], we have listed the driven factors as many as possible. Then these driven factors were screened according to four rules in the following.(1)Comprehensive. The driven factors must be related to the irrigation water use efficiency closely and the factors can reflect the China’s agricultural development status synthetically.(2)Objectivity. The driven factors must be objective to eliminate the subjective factors as far as possible. As such, the experts could judge the importance of factors fairly.(3)Independence. Although it is inevitable that the interconnected relationship exists among variables, we could choose the factors inferred with high independent in accordance with the experts’ empirical conclusion.(4)Convenient. Other than the three rules above, we must remove the factors whose data cannot be collected for convenient. It is helpful to decrease the difficulties of this study.

According to the four rules above, the flowchart of driven factors screening is shown in Figure 2. After the screening, the final driven factors are shown in the Table 1.


IndicatorsExplanationClassificationCorrelation coefficient with irrigation water use efficiency

Annual precipitation (AP)The total rainfall in a year; millimetre (mm)Climatic condition0.36
Average temperature per year (AT)Average temperature according to daily temperature in a year; °CClimatic condition0.075
Rainfall frequency (RF)Accumulated frequency of rainfall of one yearClimatic condition−0.16
Drought index (DI)Dividing total evaporation by precipitation in a yearClimatic condition−0.337
Proportion of farmers (PR)Proportion of farmers in total employmentHuman resources0.482
Arable land area (AA)Agricultural land, reclaimed tideland and river filed; (103 ha)Resource endowment−0.521
Per capita arable land (PA)Dividing arable land by total population in ChinaResource endowment0.46
Local water resource amount (LW)The amount of local surface water resources and groundwater resources quantity; (108 m3)Resource endowment0.457
Per capita water resource amount (PW)Local water resource amount/populationResource endowment0.526
Per capita water consumption (PWC)Total water consumption in a year/populationResource endowment−0.4
Per capita income of farmers (PIF)Yuan/yearEconomic condition−0.5
Per capita GDP (PGDP)Yuan/yearEconomic condition−0.5
First industrial output (FIO)Agricultural output proportion accounted for the total industrial outputEconomic condition0.535
Rice planting area proportion (RPA)Rice planting area proportion in the total sown areaEconomic condition0.1
Grain crop area proportion (GCA)Grain crop area proportion in the total sown areaEconomic condition0.1
Crop production (CP)104 ton (104 t)Economic condition−0.3
Spending on science and technology (SST)Science and technology spending proportion accounted for the financial expenditureTechnology condition−0.4
Water-saving irrigation area (WSIA)103 Hectare (103 ha)Technology condition−0.6
Highly efficient water-saving irrigation area (HWSIA)103 Hectare (103 ha)Technology condition−0.5
Effective irrigation area (EIA)Land area truly irrigated; 103 Hectare (103 ha)Technology condition−0.5
Spending on agriculture (SA)Spending proportion on agricultureManagement indices−0.3
Water price (WP)YuanManagement indices−0.5
Irrigation water use coefficient (IWUC)Dividing water consumption in farmland by water use for irrigation in a yearResearch object (dependent variable) in this paper’s study1

Note: the economic indices in this table are calculated at comparable prices.

5. Calculation

Taking the data of 22 independent variables and IWUC of China in the period of 1997–2010 into consideration, we have conducted the stepwise regression and principal component analysis. The data used here come from National Bureau of Statistics of China, 2011–2014.

5.1. Stepwise Regression for Collinearity Diagnostics

The collinearity can be diagnosed by Tolerance. Tolerance is the ratio of residual sum of squares to total sum of squares after regression analyzing one independent variable (as the dependent variable) and the other independent variables. The smaller this index is, the more serious the collinearity is. That is to say this independent variable can be predicted by the remaining independent variables precisely. The criterion of Tolerance in this paper is 0.1: namely, if an independent variable’s Tolerance is less than 0.1, there will exist collinearity. Variance inflation factor (VIF) is the reciprocal of Tolerance.

The outcome of collinearity diagnosis by SPSS is shown in Table 2.


ModelAdjusted Std. error of the estimateCorrelationToleranceVIF

10.5350.2860.2270.007710.5351.01.0

First industrial output.

The first industrial output (FIO) is the only indicator which does not relate to other 21 indicators. However, the coefficient of determination indicating the data does not fit the statistical model very well; there remains which can be enhanced. Then we need to introduce other variables without collinearity.

5.2. Principal Component Analysis for Remaining Variables

As mentioned above, only one factor has been picked up; the remaining 21 variables related to each other. As such, the principal component analysis has been conducted to eliminate the collinearity among the 21 variables, figuring out the principal factors under the condition that the original data information must be retained.

At first, the data standardization of 21 variables is shown in Table 3. Then these standardized data have been used in the SPSS to conduct the principal component analysis. The total variance of interpretation for the 21 variables is shown in Table 4. And we got the principal component score coefficient matrix, shown in Table 5.


APATRFDIPRAAPALWPWPWCPIFPGDPRPAGCACPSSTWSIAHWSIAEIASAWPIWUC

1997−0.66−0.960.170.020.85−2.321.240.120.521.76−0.95−1.002.081.790.09−0.04−1.40−1.24−1.51−0.57−1.640.00
19982.181.43−1.55−2.290.83−2.321.052.492.57−0.22−0.89−0.961.351.690.63−0.41−1.23−1.10−1.11−0.10−1.342.28
1999−0.200.39−0.55−0.690.890.250.890.250.470.21−0.85−0.911.351.400.51−0.41−1.04−0.96−0.78−0.43−1.080.00
2000−0.08−1.78−0.28−0.280.870.250.740.060.23−0.65−0.82−0.820.370.18−0.870.02−0.85−0.80−0.53−0.55−0.880.00
2001−0.67−0.841.160.670.870.250.60−0.26−0.12−0.14−0.73−0.72−0.48−0.26−1.15−0.77−0.65−0.64−0.37−0.79−0.230.00
20020.670.46−1.05−0.460.870.250.470.270.29−0.82−0.64−0.62−0.84−0.72−1.02−0.46−0.43−0.47−0.33−0.66−0.580.00
20030.05−0.401.441.430.690.250.35−0.03−0.03−2.20−0.53−0.47−1.82−1.53−1.80−0.83−0.21−0.29−0.46−1.080.101.14
2004−1.010.350.940.780.260.250.24−1.31−1.20−0.91−0.27−0.24−0.48−1.14−0.65−0.990.02−0.09−0.280.08−0.081.14
20050.23−0.66−0.170.25−0.160.250.120.190.05−0.48−0.02−0.01−0.36−0.77−0.21−1.040.270.11−0.07−0.590.60−1.14
2006−0.721.020.720.73−0.590.250.02−0.85−0.890.380.240.29−0.230.010.20−0.830.540.350.20−0.640.98−1.14
2007−0.741.731.271.14−0.950.64−1.30−0.88−0.960.380.690.76−0.11−0.060.311.810.800.660.500.430.42−1.14
20080.53−0.14−0.72−0.63−1.190.67−1.39−0.04−0.290.731.181.22−0.23−0.251.121.281.040.901.240.791.15−1.14
2009−1.280.350.060.49−1.480.67−1.48−1.29−1.390.901.491.46−0.23−0.101.181.331.371.491.542.051.200.00
20101.68−0.96−1.44−1.16−1.760.67−1.561.290.751.072.112.03−0.36−0.231.651.331.762.101.962.051.380.00


Components Initial eigenvalue
TotalVariance (%)Cumulative (%)

110.51150.05450.054
25.66726.98577.039
32.38911.37888.417
41.0394.94793.364
50.4542.16095.524
60.4352.07397.597
70.2251.07098.667
80.1260.60099.267
90.0670.32099.588
100.0380.17999.767
110.0300.14299.909
120.0120.05799.965
130.0070.035100.00


Components
12345678910111213

AP−0.1110.705−0.6760.0990.1060.0360.067−0.039−0.0320.033−0.0470.0290.024
AT0.1060.0730.1850.969−0.084−0.0110.005−0.0330.0420.0170.0000.001−0.007
RF0.012−0.8540.3280.0920.3240.0900.0780.1620.036−0.018−0.0850.0220.006
DI0.196−0.9120.2530.0010.2190.0480.0470.011−0.0090.0770.079−0.0360.005
PR−0.965−0.180−0.074−0.049−0.1060.088−0.0270.0000.0850.001−0.013−0.0220.026
AA0.680−0.480−0.233−0.097−0.4060.0610.2450.1020.0410.055−0.016−0.001−0.005
PA−0.981−0.0680.015−0.063−0.024−0.146−0.0330.0290.0350.0240.0400.0470.013
LW−0.3680.762−0.4830.0220.1770.0410.0900.0700.0400.0260.013−0.022−0.022
PW−0.5570.715−0.3730.0150.1550.0590.0640.0700.0440.0090.018−0.024−0.014
PWC0.3380.5810.650−0.2050.088−0.1310.109−0.1650.1300.036−0.044−0.009−0.006
PIF0.9730.205−0.032−0.0310.078−0.030−0.0170.0100.0020.0200.0090.0070.025
PGDP0.9810.168−0.035−0.0150.080−0.024−0.005−0.004−0.0110.0140.0020.0110.023
RPA−0.4440.6430.595−0.077−0.061−0.0470.0250.074−0.1120.051−0.0330.013−0.033
GCA−0.4250.7070.5300.024−0.0150.0060.1270.1000.039−0.0560.0790.0300.013
CP0.5730.7350.2830.073−0.064−0.1530.0240.098−0.0600.003−0.016−0.0460.043
SST0.7070.3890.258−0.0170.0130.5190.067−0.073−0.041−0.0240.0170.0060.002
WSIA0.986−0.024−0.1140.0420.054−0.0800.004−0.014−0.0270.0330.0110.020−0.015
HWSIA0.9820.079−0.109−0.0190.065−0.057−0.0380.0240.0300.0620.0380.031−0.003
EIA0.9760.098−0.141−0.067−0.062−0.0030.0150.0150.072−0.0590.0050.015−0.005
SA0.8170.4360.101−0.053−0.0610.108−0.3030.1300.0770.011−0.013−0.010−0.013
WP0.928−0.164−0.1800.0270.086−0.2240.1020.010−0.021−0.0950.011−0.017−0.019

According to principal components score coefficient matrix, we introduced the original data forming new data of principal components. For example, the data of first component (FAC_1) was obtained according to the following equation:where is the new data of FAC_1 and is the original data of 21 variables; ; .

As such, the numerical matrix of 13 principal components was listed in Table 6. Because the first industrial output (FIO) has been picked up, its standardized data is also listed in Table 6.


FAC_1FAC_2FAC_3FAC_4FAC_5FAC_6FAC_7FAC_8FAC_9FAC_10FAC_11FAC_12FAC_13FIO

−13.03.764.73−1.190.77−0.32−0.11−0.050.00−0.130.02−0.04−0.031.82
−14.111.59−1.631.790.39−0.27−0.060.11−0.03−0.160.01−0.02−0.031.55
−8.922.901.710.22−0.82−0.280.230.260.03−0.060.01−0.03−0.021.15
−7.74−0.95−0.60−1.74−0.400.35−0.040.13−0.06−0.110.020.00−0.030.63
−5.80−5.100.07−0.810.170.05−0.050.150.22−0.05−0.05−0.02−0.020.38
−5.54−1.01−2.780.55−0.600.11−0.22−0.220.11−0.020.02−0.02−0.020.14
−4.17−8.75−3.370.220.590.50−0.360.240.070.010.02−0.020.00−0.20
−0.38−6.830.430.42−0.25−0.30−0.440.05−0.040.02−0.060.010.000.02
−0.12−1.87−1.89−0.500.05−0.45−0.150.06−0.040.09−0.01−0.02−0.01−0.45
3.60−3.421.400.950.01−0.64−0.040.030.080.080.010.030.01−0.82
9.73−2.422.601.610.220.880.33−0.07−0.050.07−0.010.020.01−0.95
12.123.49−0.49−0.31−0.210.250.29−0.25−0.110.04−0.050.020.04−0.96
16.91−0.132.58−0.09−0.220.090.13−0.22−0.090.050.060.030.05−1.11
17.478.74−2.75−1.140.300.030.49−0.21−0.070.160.010.050.06−1.19

5.3. Multivariate Linear Regression Analysis

Based on the stepwise regression and principal component analysis, the 14 independent variables which are not related to each other have been found. After principal component analysis, we could see that the first three components contribute about 88.42% to the total variance. Then combined with the variable of FIO, it is considered that the FAC_4 also could be included into the linear regression. Namely, the first four components and the FIO have been considered in multivariate linear regression analysis taking the IWUC as the dependent variable.

Then the results after multivariate linear regression analysis are shown in Table 7, and the coefficients of regression are shown in Table 8.


ModelAdjusted Std. error of the estimate

10.9020.8140.6980.549

First industrial output, FAC_1, FAC_2, FAC_3, FAC_4.

ModelUnstandardized coefficientsStandardized coefficientsSig.
Std. error

(Constant)−0.0030.147−0.0200.985
FAC_10.2870.0793.0163.6280.007
FAC_2−0.1440.049−0.816−2.9270.019
FAC_3−0.4320.097−1.033−4.4640.002
FIO3.7740.8823.7734.2770.003
FAC_40.5090.1650.5293.0950.015

Table 7 indicates that the introduction of principal components affected the dependent variable positively, because the determination coefficient increased along with the introduction of principal components step by step. And , which is big enough according to the criterion in Figure 1. So we got the regression equation:

6. Results

Table 6 represents the projections of original factors on the components and we get four principal components and one original driven factor according to the regression equation. As such, we need to get other original driven factors by formula conversion using Table 9, based on (7) and (8).


FAC_1FAC_2FAC_3FAC_4Regression coefficients
0.287−0.144−0.4320.509

AP−0.1110.705−0.6760.0990.209
AT0.1060.0730.1850.9690.433
RF0.012−0.8540.3280.0920.032
DI0.196−0.9120.2530.0010.079
PR−0.965−0.180−0.074−0.049−0.244
AA0.680−0.480−0.233−0.0970.316
PA−0.981−0.0680.015−0.063−0.310
LW−0.3680.762−0.4830.0220.005
PW−0.5570.715−0.3730.015−0.094
PWC0.3380.5810.650−0.205−0.372
PIF0.9730.205−0.032−0.0310.248
PGDP0.9810.168−0.035−0.0150.265
RPA−0.4440.6430.595−0.077−0.516
GCA−0.4250.7070.5300.024−0.441
CP0.5730.7350.2830.073−0.026
SST0.7070.3890.258−0.0170.027
WSIA0.986−0.024−0.1140.0420.357
HWSIA0.9820.079−0.109−0.0190.308
EIA0.9760.098−0.141−0.0670.293
SA0.8170.4360.101−0.0530.101
WP0.928−0.164−0.1800.0270.381

Then five major driven factors have been selected in Table 10, because they have the large regression coefficients relatively, affecting the regression equation significantly.


IndicatorsRegression coefficientsDriven factor level

First industrial output (FIO)3.774Very important
Rice planting area proportion (RPA)−0.516Important
Grain crop area proportion (GCA)−0.441Normal
Average temperature per year (AT)0.433Normal
Water price (WP)0.381Normal

7. Conclusions

The enforcement of “Efficient Red Line” is the core of “the most strict water management institution.” This paper made a research on the irrigation water use efficiency serving for the dynamic management of water resources in China. The main aim of this paper is to improve the sustainable water resources use so that China’s economy and society can develop smoothly.

Through the literature data research and survey, there are 22 indicators that have been selected as the driven factors. By the experts discussion and correlation analysis, the index system of irrigation water use efficiency has been established. And this paper analyzed a variety of mathematical models in order to pick up the most appropriate methods. After analysis, the MSR and PCA were selected and modified to fulfill our requirements for research. At last, the major driven factors influencing China’s irrigation water use efficiency mostly were found out. The lessons learned from the application of the methods, in view of the results and the process, are shown in the following, as well as the consideration that how the proposed approach can be reused in similar problems and how it can be improved.(1)We improve the agricultural irrigation water use efficiency index system. Delphi method could reasonably determine the order of all the indicators, so we used it to get the effective indicators. This method is feasible because we need to know the order of indicators rather than their weighting values for simplifying index system. After the applying of Delphi method, the useful indicators were obtained and then we tried our best to collect the indicators and their data. Members of the investigation team collected the indicators and data sharing out the work and cooperating with one another. Through a variety of channels, means, and mathematical prediction models, each indicator has its own data source for construction of agricultural irrigation water use efficiency control system.(2)Quantitative analysis is in a strong basis of mathematical theory for scientific research. We used the method of correlation coefficient and modified PCA method to analyze the indicators. For eliminating the limitations of PCA, we introduced the MSR to reduce the invalid results, and we abstracted all the principal components to retain the original data. We combine the multiple linear regression with principal components analysis to eliminate the multicollinearity within indicators. The indicators without multicollinearity have been found out by the stepwise regression; then the principal components analysis has been conducted to figure out the principal components of the factors with multicollinearity on the basis of national data in China. As such, the regression equation of irrigation water use efficiency has been constructed. And the major driven factors have been found out because the principal components preserved the original data information.(3)In this paper, up to different districts and datum years, the authors gave their objective assessment scientifically about the national irrigation development model, saving irrigation style, and saving input and output. Meanwhile, this paper made a conclusion of the processing of irrigation water use efficiency development. Then, the “principal component analysis” (PCA) was applied to eliminate the interfering factors of irrigation water use efficiency; the information included 31 provinces, crossing the period of 1997 to 2010. The study provided the zoning of national irrigation water use efficiency with reference, eliminating the linear interference between indicators and summing up five principal factors.(4)Outlook about the proposed approach’s use in similar problems and its improvement was as follows. The utilization efficiency of irrigation water is in the process of continuous dynamic change, limited or influenced by the national policies and resource endowment. So it is very hard to collect the data of indicators, which causes the subjective index selection inevitably. The proposed approach in this paper will solve this issue effectively and it can be reused in similar problems because of the combination of subjective and objective weighting methods. For example, the ecological problems draw the public attention increasingly nowadays, so it is inevitable to introduce ecological indicators in the assessment of irrigation water use efficiency, such as groundwater depth and fertilization frequency in irrigation district. However the popular study methods of ecological indicators are limited to qualitative methods, if this proposed approach in this paper can be adopted, the research results of ecological indicators of irrigation water use will be more convincing.(5)The improvement of the proposed approach in the future is as follows. Path analysis, a statistical method of testing cause/effect relationships, can effectively explain the impact path of independent variable parameters on the variable factors. Through the analysis of the relationship between the direct and indirect effects on the study, the impact mode of factors can be figured out. As such, we can apply the path analysis to study the mechanism of the effect of driven factors on irrigation water use efficiency. Then the development of irrigation water use efficiency control system is our ultimate goal. Combined with software development technology, we can further research and develop the irrigation water use efficiency control system; the research results of the computation information system also can be applied in the agricultural water-saving management. Through the computer network, the agricultural water-saving information collection, storage, and calculation can be conducted. The agricultural water use monitoring and scheduling also can constantly improve our agricultural water-saving management level; the modern agricultural water-saving management will come true finally.

Additional Points

(i)Analysis of driven factors affecting irrigation water use efficiency in China.(ii)Presentation of the process of driven factors screening.(iii)PCA employed to eliminate the multicollinearity in the modeling.(iv)Presentation of an integrated approach combining MSR and PCA.(v)Major driven factors found out using MSR and PCA.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This paper was sponsored by the Ministry of Water Resources Special Fund for Scientific Research in the Public Interest (no. 201001020), Ministry of Water Resources Special Fund for Scientific Research in the Public Interest (no. 201201020), and Key laboratory of Water Science and Engineering of Ministry of Water Resources Open Fund for Scientific Research (no. YK914012). The efforts from the reviewers into making positive suggestions and helpful comments to improve the paper are greatly appreciated. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the Zhejiang University.

References

  1. J. B. Braden and J. S. Shortle, “Agricultural sources of water pollution,” in Encyclopedia of Energy, Natural Resource, and Environmental Economics, vol. 3, pp. 81–85, Elsevier, 2013. View at: Google Scholar
  2. X.-J. Hu, Y.-C. Xiong, Y.-J. Li et al., “Integrated water resources management and water users' associations in the arid region of northwest China: a case study of farmers' perceptions,” Journal of Environmental Management, vol. 145, pp. 162–169, 2014. View at: Publisher Site | Google Scholar
  3. G. F. Wang, J. C. Chen, F. Wu, and Z. H. Li, “An integrated analysis of agricultural water-use efficiency: a case study in the Heihe River Basin in Northwest China,” Physics and Chemistry of the Earth Parts A/B/C, vol. 89-90, pp. 3–9, 2015. View at: Publisher Site | Google Scholar
  4. J. S. Wallace, “Increasing agricultural water use efficiency to meet future food production,” Agriculture, Ecosystems and Environment, vol. 82, no. 1–3, pp. 105–119, 2000. View at: Publisher Site | Google Scholar
  5. B. Nunes, D. Bennett, and S. Marques Jr., “Sustainable agricultural production: an investigation in Brazilian semi-arid livestock farms,” Journal of Cleaner Production, vol. 64, pp. 414–425, 2014. View at: Publisher Site | Google Scholar
  6. C. J. C. Blanco, Y. Secretan, and A. L. A. Mesquita, “Decision support system for micro-hydro power plants in the Amazon region under a sustainable development perspective,” Energy for Sustainable Development, vol. 12, no. 3, pp. 25–33, 2008. View at: Publisher Site | Google Scholar
  7. S. J. Sun, C. Q. Chen, and Z. C. Qiu, “Current situation of agricultural water use in China and the way of water saving,” Liaoning Agricultural Sciences, vol. 1, pp. 35–37, 1999 (Chinese). View at: Google Scholar
  8. National Bureau of Statistics of China, China Statistical Yearbook 2011, Statistics Press, Beijing, China, 2011 (Chinese).
  9. National Bureau of Statistics of China, China Statistical Yearbook 2012, Statistics Press, Beijing, China, 2012 (Chinese).
  10. National Bureau of Statistics of China, China Statistical Yearbook 2013, Statistics Press, Beijing, China, 2013 (Chinese).
  11. X. Y. Wang and L. G. Zhao, “Agricultural water efficiency and the causal factors,” Issues in Agricultural Economy, vol. 3, pp. 10–17, 2008 (Chinese). View at: Google Scholar
  12. H. Omrani, R. Gharizadeh Beiragh, and S. Shafiei Kaleibari, “Performance assessment of Iranian electricity distribution companies by an integrated cooperative game data envelopment analysis principal component analysis approach,” International Journal of Electrical Power & Energy Systems, vol. 64, pp. 617–625, 2015. View at: Publisher Site | Google Scholar
  13. L. L. Fan, Structural Health Monitoring Base on Principal Components Analysis Implemented on a Distributed and Open System, Department of Building & Construction, City University of Hong Kong, Hong Kong, 2006.
  14. W.-J. Wang, S.-H. Yu, S.-X. Wang, J. Chen, and L. Deng, “Multilevel comprehensive evaluation and decision making of ergonomics,” Discrete Dynamics in Nature and Society, vol. 2015, Article ID 689203, 9 pages, 2015. View at: Publisher Site | Google Scholar | MathSciNet
  15. J. Liu, L. Yan, and C. Ma, “Valuing convertible bonds based on LSRQM method,” Discrete Dynamics in Nature and Society, vol. 2014, Article ID 301282, 9 pages, 2014. View at: Publisher Site | Google Scholar | MathSciNet
  16. P. Y. Han, A. T. Jin, and L. H. Siong, “Eigenvector weighting function in face recognition,” Discrete Dynamics in Nature and Society, vol. 2011, Article ID 521935, 15 pages, 2011. View at: Publisher Site | Google Scholar | MathSciNet
  17. Z. Wang, Y. Sun, and P. Li, “Functional principal components analysis of Shanghai stock exchange 50 index,” Discrete Dynamics in Nature and Society, vol. 2014, Article ID 365204, 7 pages, 2014. View at: Publisher Site | Google Scholar
  18. E. E. Abusham and E. K. Wong, “Locally linear discriminate embedding for face recognition,” Discrete Dynamics in Nature and Society, vol. 2009, Article ID 916382, 8 pages, 2009. View at: Publisher Site | Google Scholar | MathSciNet
  19. K. Pearson, “LIII. On lines and planes of closest fit to systems of points in space,” PhilMag, vol. 2, no. 11, pp. 559–572, 1901. View at: Publisher Site | Google Scholar
  20. K. C. Lam, T. S. Hu, and S. T. Ng, “Using the principal component analysis method as a tool in contractor pre-qualification,” Construction Management and Economics, vol. 23, no. 7, pp. 673–684, 2005. View at: Publisher Site | Google Scholar
  21. R. R. Hocking, “The analysis and selection of variables in linear regression,” Biometrics, vol. 32, no. 1, pp. 1–49, 1976. View at: Publisher Site | Google Scholar
  22. N. Draper and H. Smith, Applied Regression Analysis, John Wiley & Sons, New York, NY, USA, 2nd edition, 1981.
  23. P. L. Flom and D. L. Cassell, “Stopping stepwise: why stepwise and similar selection methods are bad, and what you should use,” in Proceedings of the NorthEast SAS Users Group Inc 20th Annual Conference (NESUG '07), Baltimore, Md, USA, November 2007. View at: Google Scholar
  24. F. E. Harrell, Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis, Springer, New York, NY, USA, 2001.
  25. C. Chatfield, “Model uncertainty, data mining and statistical inference,” Journal of the Royal Statistical Society Series A: Statistics in Society, vol. 158, no. 3, pp. 419–466, 1995. View at: Publisher Site | Google Scholar
  26. B. Efron and R. J. Tibshirani, An Introduction to the Bootstrap, Chapman & Hall/CRC, 1998.
  27. M. A. Efroymson, “Multiple regression analysis,” in Mathematical Methods for Digital Computers, A. Ralston and H. S. Wilf, Eds., Wiley, 1960. View at: Google Scholar
  28. D. P. Foster and E. I. George, “The risk inflation criterion for multiple regression,” The Annals of Statistics, vol. 22, no. 4, pp. 1947–1975, 1994. View at: Publisher Site | Google Scholar
  29. D. L. Donoho and J. M. Johnstone, “Ideal spatial adaptation by wavelet shrinkage,” Biometrika, vol. 81, no. 3, pp. 425–455, 1994. View at: Publisher Site | Google Scholar
  30. S.-H. Yoo, J.-Y. Choi, and M.-W. Jang, “Estimation of design water requirement using FAO Penman-Monteith and optimal probability distribution function in South Korea,” Agricultural Water Management, vol. 95, no. 7, pp. 845–853, 2008. View at: Publisher Site | Google Scholar
  31. J. C. Poussin, A. Imache, R. Beji, P. Le Grusse, and A. Benmihoub, “Exploring regional irrigation water demand using typologies of farms and production units: an example from Tunisia,” Agricultural Water Management, vol. 95, no. 8, pp. 973–983, 2008. View at: Publisher Site | Google Scholar
  32. Y. Ito, K. Momii, and K. Nakagawa, “Modeling the water budget in a deep caldera lake and its hydrologic assessment: Lake Ikeda, Japan,” Agricultural Water Management, vol. 96, no. 1, pp. 35–42, 2009. View at: Publisher Site | Google Scholar
  33. M. O. Abu-Madi, “Farm-level perspectives regarding irrigation water prices in the Tulkarm district, Palestine,” Agricultural Water Management, vol. 96, no. 9, pp. 1344–1350, 2009. View at: Publisher Site | Google Scholar
  34. T. R. Green, Q. Yu, L. Ma, and T.-D. Wang, “Crop water use efficiency at multiple scales,” Agricultural Water Management, vol. 97, no. 8, pp. 1099–1101, 2010. View at: Publisher Site | Google Scholar
  35. M. A. Kahlown, A. Raoof, M. Zubair, and W. D. Kemper, “Water use efficiency and economic feasibility of growing rice and wheat with sprinkler irrigation in the Indus Basin of Pakistan,” Agricultural Water Management, vol. 87, no. 3, pp. 292–298, 2007. View at: Publisher Site | Google Scholar
  36. I. Hussain, H. Turral, D. Molden, and M.-U. Ahmad, “Measuring and enhancing the value of agricultural water in irrigated river basins,” Irrigation Science, vol. 25, no. 3, pp. 263–282, 2007. View at: Publisher Site | Google Scholar
  37. A. Z. Salman and E. Al-Karablieh, “Measuring the willingness of farmers to pay for groundwater in the highland areas of Jordan,” Agricultural Water Management, vol. 68, no. 1, pp. 61–76, 2004. View at: Publisher Site | Google Scholar
  38. M. Huang, T. Dang, J. Gallichand, and M. Goulet, “Effect of increased fertilizer applications to wheat crop on soil-water depletion in the Loess Plateau, China,” Agricultural Water Management, vol. 58, no. 3, pp. 267–278, 2003. View at: Publisher Site | Google Scholar
  39. F.-M. Li, P. Wang, J. Wang, and J.-Z. Xu, “Effects of irrigation before sowing and plastic film mulching on yield and water uptake of spring wheat in semiarid Loess Plateau of China,” Agricultural Water Management, vol. 67, no. 2, pp. 77–88, 2004. View at: Publisher Site | Google Scholar
  40. S. Kaneko, K. Tanaka, T. Toyota, and S. Managi, “Water efficiency of agricultural production in China: regional comparison from 1999 to 2002,” International Journal of Agricultural Resources, Governance and Ecology, vol. 3, no. 3-4, pp. 231–251, 2004. View at: Publisher Site | Google Scholar
  41. H. Jin, W. Qingjie, L. Hongwen, L. Lijin, and G. Huanwen, “Effect of alternative tillage and residue cover on yield and water use efficiency in annual double cropping system in North China Plain,” Soil and Tillage Research, vol. 104, no. 1, pp. 198–205, 2009. View at: Publisher Site | Google Scholar
  42. R. J. Hutton and B. R. Loveys, “A partial root zone drying irrigation strategy for citrus—effects on water use efficiency and fruit characteristics,” Agricultural Water Management, vol. 98, no. 10, pp. 1485–1496, 2011. View at: Publisher Site | Google Scholar
  43. A. Wahbi and A. S. A. Shaaban, “Relationship between carbon isotope discrimination (Δ), yield and water use efficiency of durum wheat in Northern Syria,” Agricultural Water Management, vol. 98, no. 12, pp. 1856–1866, 2011. View at: Publisher Site | Google Scholar
  44. R. Boelens and J. Vos, “The danger of naturalizing water policy concepts: water productivity and efficiency discourses from field irrigation to virtual water trade,” Agricultural Water Management, vol. 108, pp. 16–26, 2012. View at: Publisher Site | Google Scholar
  45. B. Bluemling, H. Yang, and C. Pahl-Wostl, “Making water productivity operational—a concept of agricultural water productivity exemplified at a wheat-maize cropping pattern in the North China plain,” Agricultural Water Management, vol. 91, no. 1–3, pp. 11–23, 2007. View at: Publisher Site | Google Scholar

Copyright © 2016 Renfu Jia et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


More related articles

 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder
Views1575
Downloads682
Citations

Related articles

We are committed to sharing findings related to COVID-19 as quickly as possible. We will be providing unlimited waivers of publication charges for accepted research articles as well as case reports and case series related to COVID-19. Review articles are excluded from this waiver policy. Sign up here as a reviewer to help fast-track new submissions.