Abstract

Dempster-Shafer (D-S) evidence theory has been widely used in various fields. However, how to measure the degree of conflict (similarity) between the bodies of evidence is an open issue. In this paper, in order to solve this problem, firstly we propose a modified cosine similarity to measure the similarity between vectors. Then a new similarity measure of basic probability assignment (BPAs) is proposed based on the modified cosine similarity. The new similarity measure can achieve the reasonable measure of the similarity of BPAs and then efficiently measure the degree of conflict among bodies of evidence. Numerical examples are used to illustrate the effectiveness of the proposed method. Finally, a weighted average method based on the new BPAs similarity is proposed, and an example is used to show the validity of the proposed method.

1. Introduction

Uncertainty information modeling and processing are still an open issue. To address this issue, many mathematical tools are presented like fuzzy sets theory [15], evidence theory [69], rough sets theory [1012], numbers [1316], numbers theory [17, 18], and so on [19]. Decision making and optimization under uncertain environment are heavily studied [2023]. Due to the efficiency modeling and fusion of information, evidence theory is widely used [24, 25]. And sometimes, the methods with mixed intelligent algorithms are used for decision making or related problems [20, 25, 26].

Dempster-Shafer (D-S) evidence theory [6, 7] provides a natural and powerful way for the expression and fusion of uncertain information, so it has been widely used in various fields of information fusion [9, 27, 28], target recognition [29, 30], decision making [31], image processing [32], uncertain reasoning [33], and risk analysis [25, 26, 34, 35]. However, the counterintuitive results may be obtained by conventional combination rule when collected bodies of evidence highly conflict with each other [36]. For example, in the real battle field, because of hard natural factors and human interference, the information obtained by sensor tends to have high uncertainty and high conflict with other pieces of information. If this problem cannot be solved effectively, it will greatly limit the application of D-S evidence theory. Many scholars have conducted in-depth research and put forward a number of enhancements to the evidence combination algorithm [8, 24, 3739]. In general, the existing methods of the conflict information can be divided into two categories. One method is to modify Dempster’s combination rule. When , the rule is not applicable. It solves the problem that how to redistribute and manage the conflict by modifying the rule. Another method is to modify the data model. The conflict evidence is preprocessed first and then is combined by combination rules.

Obviously, it is a critical issue to determine the degree of conflict between bodies of evidence. Researchers usually use conflict coefficient to indicate the degree of conflict. Whereas only reflects the noninclusive between bodies of evidence. For example, if two bodies of evidence are in certain conflict, the value of may be 0. Obviously, the result is incorrect. The classical conflict coefficient is not a pretty good conflict measure. In recent years, many works on this issue have emerged [40, 41]. In order to avoid a wrong claim made by using only , in [41], pignistic distance is introduced by Liu and two-tuple variable is proposed to measure the degree of conflict among BPAs. But Liu did not propose a formula to unify the two factors, since we often need a measured value of conflict when we process the conflict of evidence. Moreover, Jousselme distance [40] is also commonly used to measure the conflict of evidence, which reflects the difference of BPAs. Nevertheless it captures only one aspect of the dissimilarity among BPAs mainly associated with a distance metric. In [42] Deng et al. proposed a new relative coefficient to characterize the conflict between bodies of evidence. Although this coefficient solves some problems of , there still exist some deficiencies. For example, it lacks physical meaning during the definition of relative coefficient and it is not equal to 0 when two BPAs are totally contradictory with each other. In order to indicate the conflict between bodies of evidence precisely, a new method to measure the degree of conflict is proposed in this paper.

In previous works, distance and angle are often considered in the similarity measurement between vectors [43]. However, both the angle-based similarity measurement and the distance-based similarity measurement cannot effectively indicate the similarity among vectors. If the two factors and other factors can be used jointly, better performance can be expected. According to such an idea, we proposed a modified cosine similarity based on distance, cosine angle, and ratio of vector modes to measure the similarity between vectors.

Furthermore, a new similarity of BPAs is proposed based on the modified cosine similarity, which is used to measure the degree of conflict. If two bodies of evidence are consistent with each other, we think that the similarity between them is high; it means that the degree of conflict among them is low. When the proposed similarity is close to , the value indicates that there is little contradiction between bodies of evidence. When the proposed similarity is close to , this value indicates that the two bodies of evidence are in high conflict.

This paper is organized as follows. Section 2 describes some basic concepts. Section 3 details the problem of existing conflict measurements between BPAs. In Section 4, we will investigate the new similarity of BPAs. Section 5 presents some examples and analysis is presented. In Section 6, an example is shown to illustrate the effectiveness of our method. Finally, some conclusions are summarized in Section 7.

2. Preliminaries

2.1. The Basic Concept of D-S Evidence Theory [6, 7]

Definition 1. Let be a nonempty finite set which is called the frame of discernment; its power set containing all the possible subsets of is denoted by , where the elements are mutually exclusive and exhaustive. Define the function as the basic probability assignment (BPA) (also called a belief structure or a basic belief assignment), which satisfieswhere is defined as the BPA of , representing the strength of all the incomplete information sets for .

If , then is called the focal element. The degree of one’s belief to a given proposition is represented by a two-level probabilistic portrayal of the information set: the belief level and the plausibility level. They are defined as follows:where is the sum of for all subsets contained in , representing all the bodies of evidence that support the given proposition . which is the sum of for all subsets has a nonnull intersection of and represents all the bodies of evidence that do not rule out the given proposition . Absolutely, . The belief interval represents the uncertainty of . When , this means absolute confirmation to .

Definition 2. Let and be two BPAs on the same frame of discernment . Dempster’s combination rule is expressed as follows:withwhere is a normalization constant, which is called conflict coefficient of two BPAs.

According to the above formula, we can obtain that corresponds to the absence of conflict between and , whereas implies complete contradiction between and . The above rule is meaningful only when ; otherwise the rule cannot be applied.

2.2. Liu’s Method [41]

Definition 3. Let be a BPA on the frame of discernment . Its associated pignistic probability function : is defined as follows:where is the cardinality of proposition .

Definition 4. Let and be two BPAs on frame and let and be the results of two pignistic transformations from them, respectively. Then is defined as follows:which is called the distance between betting commitments of the two BPAs.

Value is the difference between betting commitments to from the two sources. The distance of betting commitments is the maximum extent of the differences between betting commitments to all the subsets. is simplified as .

Definition 5. Let and be two BPAs on frame and let be a two-dimensional measurement. and are defined as in conflict, if both and hold, where is conflict coefficient and is the distance between betting commitments in Definition 4 and where is the threshold of conflict tolerance.

2.3. Correlation Coefficient [45]

Definition 6. Let the distribution of random variables , beThen the partial entropy of on is defined as follows:The entropy of random variables is defined as follows:

Definition 7. The relative entropy of random variables is defined as the sum of their partial entropy:

Definition 8. The partial correlation coefficient and correlation coefficient of random variables are defined as follows:

In [45], the authors have proved that the correlation coefficient has the following property:If and only if the distribution of is the same, thenThis property indicates that the correlation coefficient is the characteristic metrics of identity and consistency of the distribution of random variables and .

In order to measure the degree of conflict between bodies of evidence, Deng et al. in 2011 [42] proposed a similar partial entropy, and it is defined as follows:According to checking, the coefficient in (14) is relatively close to log.

2.4. Jousselme Distance [40]

Definition 9. Let and be two BPAs on the same frame of discernment . and are focal elements of and , respectively. The Jousselme distance, denoted by , is defined as follows:where ; ; represents the scalar product of two vectors. It is defined as follows:where and are the elements of framework , is the cardinality of common objects between elements and , and is the number of subsets of union of and .

2.5. New Conflict Coefficient [46]

Definition 10. The new conflict coefficient of two bodies of evidence is defined as follows:where is classical conflict coefficient and is the Jousselme distance. The larger the is, the larger the degree of conflict will be.

When both measurements are 0, it is safe to say that there is no contradiction between and . Only when both and have a relatively high value, this pair of values indicates a strong conflict between and . When both measurements have a relatively low value, this indicates that the two BPAs have little contradiction.

2.6. Cosine Similarity

There are two vectors , , , and , , . Due to the complexity in fault diagnosis [47, 48], evidence theory is used to handle sensor fusion to obtain the reasonable result [24]. It is necessary to develop the similarity function to measure the similarity between the collected bodies of evidence. The formulation of cosine similarity of vectors [43, 47] is defined as follows:where .

If angle between and is , then their similarity is 1. On the contrary, if the angle between them is , then the similarity is 0. The geometric explanation of cosine similarity measurement is shown in Figure 1.

3. The Problem of Existing Conflict Measurements between BPAs

Example 11. Let be a frame of discernment with five elements , , , , . We assume the BPAs from three distinct sources, and they are defined as follows: If classical conflict coefficient is used to measure the degree of conflict among bodies of evidence, we get , which indicates that the degree of conflict between and is equal to the degree of conflict between and , and they are both in high conflict. In fact, since is identical to , they are not in conflict. Moreover, the degree of conflict between and intuitively should be higher than the degree of conflict between and . With Liu’s method, we get , which considers that and have no apparent severe different beliefs and Dempster’s combination rule should be used with caution.
This example shows that classical conflict coefficient cannot measure the degree of conflict between bodies of evidence. Liu’s method is not good enough to measure the degree of conflict.

Example 12. Let and be two BPAs from two distinct sources on . The five pairs of BPAs are shown as follows:1st pair: , ;2nd pair: , ;3rd pair: , ;4th pair: , ;5th pair: , .The summary of and values of the five pairs is given in Table 1. From the 1st pair of BPAs to the 5th pair of BPAs, we can know that two BPAs always have only one incompatible element and the number of compatible elements increases from two to six. Intuitively, every pair of BPAs is in certain conflict and the degree of conflict between and should be lower than the similarity between and . In addition, the degree of conflict should decrease and the similarity should increase from the pair of BPAs to the pair of BPAs.

As can be seen from Table 1, is constant for five pairs of BPAs and it indicates that there is no conflict. Although the Jousselme distance decreases from the 1st pair of BPAs to the 5th pair of BPAs, the value of is higher than 0.5 which means that there is high conflict between BPAs. Obviously, both and are counterintuitive.

This example shows that cannot reveal the conflict between BPAs and is not good enough to characterize conflict precisely.

Example 13. Let , be two BPAs from two distinct sources on frame , such that
isand isIn this example, the two BPAs totally contradict with each other since and support the different hypothesis. In Song et al.’s method [49], a new measurement of conflict was presented based on the definition of correlation coefficient. This method is reasonable and effective. But in this example, it is unreasonable. When , we get . When , we get . When , we get . According to the value of , it is unreasonable because of the high conflict. But in our method, it is reasonable. Based on (4), we can obtain that classical conflict coefficient for . The other different measurements of conflict between and are shown in Figure 2.

As can be seen from Figure 2, we can know that , and are , when , which are consistent with the above analysis. But when , they are all against the above intuitive analysis. The values of and tend toward 0, tends toward 0.5, and tends toward 1, which indicates that and are closer and closer with the increase of . The results are counterintuitive and reasonable. Therefore, they cannot be used to measure the conflict of BPAs in this example.

4. New Similarity of BPAs

To determine whether there is conflict between bodies of evidence, the similarity between bodies of evidence can be considered. If there is little conflict between bodies of evidence, then the similarity is high. If the two bodies of evidence are in high conflict, then the similarity is low. Therefore the similarity of BPAs can be used to measure the degree of conflict between bodies of evidence.

4.1. Modified Cosine Similarity of Vectors

Cosine similarity in [43, 50] is a measurement of similarity between two vectors of an inner product space that measures the cosine of the angle between them. The angle-based cosine similarity is a direction-based similarity measure. Therefore, it measures the similarity between two vectors only based on the direction but ignoring the impact of the distance of two vectors. In [50], Zhang and Korfhage proposed a new similarity measurement (integrated similarity measurement) based on distance and angle. The integrated similarity measurement is defined as follows:where is a distance-based similarity measurement [7]. is the Euclidean distance between vectors and , and is a constant whose value is greater than 1. is a modifier based on angle. More detailed information can be found in [50].

Although the integrated similarity measurement takes the strengths of both the distance and direction of two vectors into account, there still exist deficiencies. For example, in Figure 3, the angle between vector and is and , .

According to (22) and the same parameters in [50], we can get the conclusion: . But we think that the similarity between and should be different from the similarity between and , since from Figure 3, we can get that and , but is much smaller than . Intuitively, it is expected that is more similar to than to . Therefore, we consider that the similarity between and should be higher than the similarity between and .

From the above example, we think that the vector norm (vector magnitude) should be an important factor for the similarity between vectors. The bigger the ratio of vector norm (the smaller vector norm is numerator) is, the higher the similarity between two vectors will be. In order to measure precisely the similarity between two vectors, vector norm should be taken into account besides distance and angle. From the above analysis, the modified cosine similarity based on angle, distance, and vector norm is proposed in Definition 14.

Definition 14. Let and be two vectors of . The modified cosine similarity between vectors and is defined below:where is the distance-based similarity measurement [7].

is the Euclidean distance between vectors and , and is a constant whose value is greater than 1. The larger the is, the greater the distance impact on vector similarity will be. is the minimum of and . is the cosine similarity which is defined in (18).

The modified cosine similarity satisfies the following properties:(1) (symmetry);(2) (nonnegativity);(3)when and , ;(4)when and , ;

Proof of Property (1). Based on (18) and (23), it is easy to get .

Proof of Property (2). If and , then , , and . Hence when . Then, based on (23), we can obtain . In addition, if or , . In summary, .

Proof of Property (3). When and , if , namely, , and , we can get the following formulas based on (18) and (23):It is obvious that . When and , we can see that , , and . Hence, if , we can obtain ; namely, , , and ; therefore . In summary, when and , .

Proof of Property (4). If and , then . Hence, based on (23), if , then . Therefore, based (18), we can get .
If , we can obtain based on (18). Furthermore we can obtain based on (23). In summary, when and , .

4.2. New Similarity of BPAs Based on the Modified Cosine Similarity

Under the frame of discernment , there are two evidence sources and . Let and be the BPAs, respectively. and can be expressed, respectively, as , , and , , where , are the singleton subsets [6].

According to the above confidence intervals, BPAs are in the form of two vectors on the singleton subsets. The two vectors are expressed, respectively, below:Then we can calculate the belief function vector similarity and the plausibility function vector similarity based on the modified cosine similarity.

Finally, the new similarity of BPAs is defined below:withwhere is the total uncertainty of BPAs, which is defined asBecause of and , we get that if , then will obtain the minimum value 0; if , then will obtain the maximum value 1.

The belief in the confidence interval represents the minimum trust that explicitly supports , and the plausibility expresses the potential support for . indicates the uncertainty of . Obviously, the larger the uncertainty is, the higher the influence on the similarity of BPA will be, but the influence of belief function similarity on the similarity of BPA should be lower. The new similarity of BPAs satisfies the following properties:(1);(2);(3);(4), if and only if and have no compatible element.Some proofs are detailed as follows.

Proof of Property (3). Based on (23), (25), and (28), we can obtain that So, we get based on (26). Moreover, if , then will get the maximum value 1 when . In summary, .

Proof of Property (4). Assuming that , according to (28), we can obtain , , ; then . Hence, based on (26), we can get If , then ; namely, . means that and have no compatible element.
Assuming that , if , we can easily obtain based on (26), which means that and have no compatible element.
If , based on (26), we can easily see that which means that and have no compatible element.
If and are not compatible, it is not difficult to obtain ; then we could obtain . In summary, if and only if and have no compatible elements, .

The establishment of the above four properties shows that the new similarity is the characteristic metrics of similarity of evidence and is consistent with the distribution of bodies of evidence and . Therefore, can be used to measure the conflict (similarity) between bodies of evidence and . Since the modified cosine similarity takes the impact of the ratio of vector norm into account, overcomes the deficiency of the integrated similarity measurement in [50]. Moreover, the new similarity of BPA based on the modified cosine similarity takes the impact of belief function, plausibility function, and uncertainty into account, so can effectively measure the degree of conflict between bodies of evidence.

5. Example and Analysis

In this section, four examples are presented to illustrate the effectiveness of the proposed similarity of BPAs.

Example 15. Considering the three BPAs which we defined in Example 11. With the proposed similarity of BPAs , the following results are obtained: which indicates that and completely agree with each other, and the degree of conflict between and is lower than the degree of conflict between and . The result is consistent with intuitive analysis.

Example 16. Consider the five pairs of BPAs in Example 12. With the proposed similarity of BPAs , the results are listed in Table 2.
From Table 2, the value of increases from 0.5909 to 0.7597 and the similarity is higher than others. The result is consistent with intuitive analysis.

Example 17. Consider the two BPAs which we defined in Example 13. With the proposed similarity of BPAs, we obtain for . It indicates that and are completely different. That is to say, they are in total conflict which is consistent with the analysis in Example 13.

Example 18. Let be a frame of discernment with 20 elements (or any number of elements that is predefined). For convenience, we use 1, 2, and so forth to denote , and so forth in the frame.
The first BPA, , is defined as follows: where is a subset of .
And the second BPA, , is defined as follows: There are 20 cases where subset increases by one more element at a time, starting from case 1 with and ending with case 20, where as shown in Table 3. The comparison of the aforementioned conflict measurements between and for these 20 cases is detailed in Table 3 and graphically illustrated in Figure 4.

In this example, the conflict should be lower from case 1 to case 5, since the supported hypothesis of and gets more and more similar. In case 5, is identical to hypothesis of ; hence the degree of conflict between and should obtain the minimum value. The conflict should increase from case 5 to case 20, since becomes more and more uncertain.

From Figure 4, we know that conflict measurement , and go up and down consistently when the size of changes. Although and present a similar behavior, the value of is always smaller than the corresponding values. So, we get the conclusion that is not better than to measure conflict among bodies of evidence. from case 2 to case 20 and its values even decrease from case 6 to case 8 which is abnormal. Furthermore, indicates that and have no conflict which is opposite to our analysis. Because both and are low, Liu’s method indicates that there is little contradiction between and from case 2 to case 20. Therefore cannot reflect the conflict changes when the size of changes. Whereas and are used to measure the similarity among BPAs. The bigger the and is, the lower the degree of conflict will be. The curves of , , and are consistent with the above analysis; therefore we think that can also be used to characterize the degree of conflict.

According to the above four examples, we get a conclusion that the new similarity of BPAs can efficiently characterize the degree of conflict between BPAs.

6. A New Combination Method and Application

In this section, we proposed a new combination method based on the new similarity of BPAs. If there are bodies of evidence (BOEs) , , the calculation process is listed as follows.

Step 1. Calculate the similarity between the bodies of evidence; then construct a similarity measure matrix (SMM), which gives us insight into the agreement between evidence, where is the similarity between and :

Step 2. The credibility of body of evidence is defined as follows:We know that the lower the distance between the bodies of evidence is, the higher the similarity is.

Step 3. The credibility degree of the body of evidence is defined as follows:

Step 4. The weighted average evidence is given by the following formula:

Step 5. When the number of original bodies of evidence is , we should combine the for () times. Then the result can be calculated. We can effectively deal with the information fusion problems in high-conflict evidence by this method.
In general, if a body of evidence is supported by others greatly, the evidence should be more important and has more effect on the final combination result [44]. This method has a faster convergence speed and improves the reliability and rationality when the evidence is contradictory. An example of target recognition is presented to show the validity of the proposed method.

Example 19. There is a target that is detected in combat air domain and the target is identified as enemy by military Identification Friend or Foe. Five different sensors such as airborne early warning radar (AEW radar) and electronic warfare support measure (EMS) detect the air targets, and the frame of discernment is . At a certain moment, five bodies of evidence have been collected from these five different sensors. The BPAs , , , , are given as follows:
:::::In this example, it is clear that is in high conflict with others. The results by different combination rules are shown in Table 4.

As can be seen from Table 4, Dempster’s rule produces counterintuitive result, though more bodies of evidence collected later support target . Although Murphy’s method, Deng’s method, and the proposed method can recognize the true target, the performance of convergence of the proposed method is better than others. The main reason is that the proposed method has better performance in conflict management. By making use of the new similarity of BPAs among bodies of evidence, the modified average approach decreases the weight of the “bad” evidence, so the “bad” evidence has less effect on the final results.

In conclusion, the weighted average method based on the proposed similarity measure can solve the problem of combination with high-conflict evidence. It illustrates the efficiency and the validity of the new method to deal with the high-conflict evidence.

7. Conclusion

In the actual battle field, due to the harsh environment and human interference, the high conflict often exists among the sensor reports. In order to solve the problem of conflicting information fusion, the existing methods pay more attention to the evidence combination algorithm while ignoring the study of conflict itself. Are there any other parameters or factors that can reflect the conflict between bodies of evidence? Is there a better conflict model that can comprehensively describe the conflict between bodies of evidence? These issues are the basis for resolving the fusion of conflicting information and are very worthy of research.

In this paper, a new similarity of BPAs is proposed to measure the degree of conflict, which is constituted by the belief function and plausibility function in evidence theory. By expressing the BPAs in the form of vectors on singleton subsets and calculating two similarities based on the modified cosine similarity, a reasonable measurement of similarity (conflict) among bodies of evidence can be achieved. The similarity can meet the basic nature of requiring for similarity. If two bodies of evidence are consistent, the two bodies of evidence support the same hypothesis.

Therefore, the proposed similarity of BPAs should be high; on the contrary, the degree of conflict is low. If the proposed similarity is low, we consider that the two bodies of evidence are in high conflict. The lower the similarity is, the higher the conflict is. Hence, the proposed similarity can be used to measure the degree of conflict. The example of target recognition shows that the similarity of BPAs could be used in sensor data fusion. In the later research, the similarity will be applied as a constraint condition in decision making based on BPAs.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Authors’ Contributions

Wen Jiang and Xiyun Qin contributed equally to this work.

Acknowledgments

The work is partially supported by the National Natural Science Foundation of China (Grant no. 60904099), the Natural Science Basic Research Plan in Shaanxi Province of China (Program no. 2016JM6018), and the Seed Foundation of Innovation and Creation for Graduate Students in Northwestern Polytechnical University (Program no. Z2016122).