Abstract

Entropy weight method (EWM) is a commonly used weighting method that measures value dispersion in decision-making. The greater the degree of dispersion, the greater the degree of differentiation, and more information can be derived. Meanwhile, higher weight should be given to the index, and vice versa. This study shows that the rationality of the EWM in decision-making is questionable. One example is water source site selection, which is generated by Monte Carlo Simulation. First, too many zero values result in the standardization result of the EWM being prone to distortion. Subsequently, this outcome will lead to immense index weight with low actual differentiation degree. Second, in multi-index decision-making involving classification, the classification degree can accurately reflect the information amount of the index. However, the EWM only considers the numerical discrimination degree of the index and ignores rank discrimination. These two shortcomings indicate that the EWM cannot correctly reflect the importance of the index weight, thus resulting in distorted decision-making results.

1. Introduction

The entropy weight method (EWM) is an important information weight model that has been extensively studied and practiced [1, 2]. Compared with various subjective weighting models, the biggest advantage of the EWM is the avoidance of the interference of human factors on the weight of indicators, thus enhancing the objectivity of the comprehensive evaluation results [3, 4]. Therefore, the EWM has been widely used in decision-making in recent years [57]. For example, Wu et al. [8] made a comprehensive assessment on lake water quality in Shahu Lake to provide valuable information about present lake water quality for decision-making [8]. Based on the EWM, Zhang and Wang [9] evaluated stress factors and the efficiency of water management measures in the Chongqing city of China [9]. Yu et al. [10] studied the water characteristics of Gucheng Lake, such as eutrophication, health, and spatial distribution by the EWM [10].

The EWM evaluates value by measuring the degree of differentiation. The higher the degree of dispersion of the measured value, the higher the degree of differentiation of the index, and more information can be derived. Moreover, higher weight should be given to the index, and vice versa. According to the traditional literature, the results of the EWM are always reliable and effective [11, 12]. However, based on engineering practice, we have found that the weighted result of the EWM cannot always accurately reflect the information amount and importance of the index [13]. Subsequently, the decision-making result is distorted [14, 15].

In this study, we consider the site selection of water source as an example and discuss the multi-index decision-making process with the help of Monte Carlo Simulation. The simulation reveals the distortion phenomenon in the calculation of the EWM and its influence on decision-making.

2. Methods and Materials

2.1. EWM

In this method, m indicators and n samples are set in the evaluation, and the measured value of the ith indicator in the jth sample is recorded as xij.

The first step is the standardization of measured values [16, 17]. The standardized value of the ith index in the jth sample is denoted as pij, and its calculation method is as follows:

In the EWM, the entropy value Ei of the ith index is defined as [18]

In the actual evaluation using the EWM, is generally set when pij = 0 for the convenience of calculation.

The range of entropy value Ei is [0, 1]. The larger the Ei is, the greater the differentiation degree of index i is, and more information can be derived. Hence, higher weight should be given to the index. Therefore, in the EWM, the calculation method of weight is [1, 19]

2.2. Water Quality Index

In the decision-making of a water source location, several alternative water sources are selected according to the water quantity. Then, according to the evaluation results for the water quality of each water source, the region with the best water quality will be selected as the final water source.

Water quality index is the most common evaluation model for water quality in water resource management. The representative water quality indexes in the study area are assumed to be permanganate (CODMn), ammonia nitrogen (NH3), and sulfide. According to China’s Environmental Quality Standards for Surface Water, the environmental quality of each index is divided into five levels, and the corresponding threshold is illustrated in Table 1.

Given that CODMn, NH3, and sulfide are the indicators with smaller and better values than other indicators, their water quality index can be calculated as follows:where WQIij is the water quality index of the ith pollutant in the jth water area. lik and rik are the lower and upper thresholds of the ith index in the kth evaluation grade, respectively.

The comprehensive water quality index SWQIj of the jth water area is defined as [20]

The domain of SWQIj is [0, 100]. When , the comprehensive water environment quality of the jth water area is judged at the k level.

3. Results and Discussion

3.1. Measured Values

In this section, we present a set of typical data generated by Monte Carlo Simulation to discuss the inauthenticity of the EWM and its influence on the location of a water source. The measured values generated by Monte Carlo Simulation are shown in Table 2.

According to Table 2, sulfide was the least dispersed, which only concentrated in the narrow range of [0, 0.045 mg/L], and all the samples were excellent. The information content of CODMn and NH3 was higher than that of sulfide, regardless of the dispersion degree of the measured value or the different degree of the grade. Therefore, sulfide should be the least weighted index. The measured values of NH3 could divide the five groups of evaluation samples into five grades, whereas CODMn could only divide the groups into three grades. The information content of NH3 was higher than that of CODMn and should be given the highest weight. To sum up, the reasonable ranking of weights should be NH3 > CODMn > sulfide.

Of the five participating water sources, only Water 1 was rated excellent in all three criteria. In addition, by comparing CODMn and NH3, which was of great importance in determining water quality, the pollution degree of Water 1 was the lowest. In the least-significant sulfide index comparison, the concentration of pollutants in Water 1 was higher than that of the other four samples but was still at excellent level. Therefore, the water quality of Water 1 was the best among all the participating samples and should be selected as the water source.

3.2. Entropy Weight Results

Table 3 shows the weights of the calculated indexes based on the EWM.

Conclusions were easily drawn based on Table 3. In the weighted result of the EWM, the weight of sulfide with the lowest dispersion degree of measured data and no grade discrimination was as high as 0.743, far more than any other pollutants. However, the weight of NH3, the smallest among all the indicators, had the highest degree of discrimination and grade discrimination, which was only 0.119. The weights in Table 3 were opposite of the reasonable weight ranking discussed above.

By comparing Tables 2 and 3, the weight distortion of the EWM was found to have come from two aspects:(1)The measured value of sulfide contained too many zero values. After standardization, the zero value in the measured value was converted into the zero value in the normalized value. In the calculation process of the EWM, when the normalized value is pij = 0, let . Thus, excessive zero values led to low entropy value and high weight of sulfide.(2)When the threshold of the evaluation index was divided differently, such as the NH3 and permanganate index in this example, no necessary relationship emerged between the numerical and the grade differentiation degrees. However, the EWM only considered the numerical discrimination degree and ignored the rank discrimination degree of the index. However, in the multi-index decision-making involving classification, the classification degree more accurately reflected the information amount of the index.

3.3. Comprehensive Evaluation Results

The weight calculation results in Table 3 were substituted into the water quality index model. The water quality index of each evaluated water area was calculated, as shown in Table 4.

Combined with Table 4, the weight distortion led to the following problems in the comprehensive evaluation results:(1)As the EWM gave much weight to sulfide without grade differentiation and too little weight to NH3 with the highest grade differentiation, the comprehensive water quality of the five water areas was rated excellent without any difference. This result led to difficult choices.(2)Given that the EWM gave too much weight to sulfide with the lowest pollution degree and ignored NH3 with the most serious pollution degree, the evaluation result of water quality was too optimistic. By comparing Tables 2 and 4, NH3 pollution in Water 2 had the worst water level and Water 3 had a bad level. These areas had extremely high nutrient content and eutrophication risk. Given the low weight of NH3 given by the EWM, both water sources were rated excellent.(3)According to the discussion in Section3.1, all the indicators of Water 1 could reach excellent level. Thus, Water 1 had the best water quality and should be selected as the water source. However, given the distortion of the weight result of the EWM, Water 1 had the worst water quality sample in the final decision-making and should not be selected as the water source.

3.4. The Protentional Solutions of the Distortions of EWM

As is illustrated in Section 3.2, the EWM has the following two distortions:(i)When the measured data set contains too many zero values, its entropy value may be undervalued, which makes the weight overexaggerated(ii)When the threshold of the evaluation index is divided differently, the numerical discrimination degree cannot correctly reflect the grade differentiation degree

The first distortion is a technical problem instead of a theoretical background problem. As is discussed in Section 3.2, the proximate cause for the first distortion is that the zero values in the measured data set correspond to the zero normalized value. As a result, it may be solved by modifying the standardization method to avoid the zero values in the normalized data set. For example, a protentional substitution formula of equation (1) iswhere C is a constant which should at least satisfy

Obviously, for the zero values in the measured data set, the larger the C is, the farther its normalized value exceeds zero, which reduces the influence of the first distortion. However, for the measured data set, the larger the C is, the less its discrimination degree is. Therefore, the concrete selecting method of the constant C still needs to be further studied.

The second distortion is a theoretical background problem rather than a technical problem because the EWM only considered the numerical discrimination degree and ignored the rank discrimination degree of the index. Considering that the classification degree more accurately reflected the information amount of the index, we thought that the theoretical basis of the EWM is partial in the multi-index decision-making problem. A protentional solution method is used to introduce new variables which represent rank discrimination degree into the weighting process. However, the concrete combination method between the numerical discrimination degree and the rank discrimination degree also needs to be further studied.

4. Conclusion

The rationality of the EWM in decision-making is questionable. First, when too many zero values are in the measured values, the standardized results of the EWM are prone to distortion. Subsequently, this outcome will lead to the excessive weight of the index with low actual differentiation degree. Second, the classification degree can accurately reflect the information amount of the index in multi-index decision-making involving classification. However, the EWM only considers the numerical discrimination degree and ignores the rank discrimination degree of the index. These two shortcomings indicate that the EWM is unable to reflect the importance of the index weight correctly, thus resulting in distorted decision-making results.

The protentional solutions of these two distortions are modifying the standardization method and introducing new rank discrimination degree variables, respectively. However, the concrete algorithms of these solutions still need to be further studied.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China under the contract no. 51709142.