Abstract

Dempster-Shafer evidence theory (D-S theory) has been widely used in many information fusion systems since it was proposed by Dempster and extended by Shafer. However, how to determine the basic probability assignment (BPA), which is the main and first step in D-S theory, is still an open issue, especially when the given environment is in an open world, which means the frame of discernment is incomplete. In this paper, a method to determine generalized basic probability assignment in an open world is proposed. Frame of discernment in an open world is established first, and then the triangular fuzzy number models to identify target in the proposed frame of discernment are established. Pessimistic strategy based on the differentiation degree between model and sample is defined to yield the BPAs for known targets. If the sum of all the BPAs of known targets is over one, then they will be normalized and the BPA of unknown target is assigned to ; otherwise the BPA of unknown target is equal to minus the sum of all the known targets BPAs. IRIS classification examples illustrated the effectiveness of the proposed method.

1. Introduction

As one of the most important tools in multisources information fusion, Dempster-Shafer evidence theory (D-S theory) [1, 2] has been widely used since it has abilities to deal with uncertainty and unknown information and requires fewer conditions than probability theory. D-S theory has a significant advantage over the traditional probabilistic approach; that is, it allows for the allocation of probability mass to sets or intervals. And it can hence handle both aleatory uncertainty and epistemic (or subjective) uncertainty.

Although D-S theory is widely used in many application systems such as information fusion systems [36], evidential reasoning [7, 8], risk and reliability analysis [912], and decision making [1315], some basic problems are still not completely clarified. One typical open issue is dependent evidence combination [1618]. The other open issue is how to determine the basic probability assignment (BPA). So far, there is no general method to obtain BPA.

In the years, many researchers have investigated different approaches to solve this problem. Xu et al. [19] put forward a nonparametric method to determine BPA based on the relationship between the test sample and the probability models. In their work, a nonparametric probability density function (PDF) is calculated firstly using the training data. Suh and Yook [20] presented a method to determine BPA through sensor data. Their study showed that a pedestrian could recognize a moving object with its method of determining BPA through multisensor data fusion. Yoon et al. [21] proposed a novel way to obtain BPA using sensor signals. The method enabled context inference even when there was no advanced information of the situation. Recently, with the generalized fuzzy numbers [22], we presented a fuzzy method to generate BPA [23].

Another vital issue is that the result of BPAs combination is always contradictory to common sense, using D-S theory of evidence to fuse highly conflicting evidences. This problem often appears in engineering applications because environmental noises and human disturbances often lead to conflict among the reports of multiple sensors.

Zadeh, the author of fuzzy mathematics, gives an example to analyze the deficiency of the combination rule of evidence theory, which has aroused the interest of many scholars [24]. How to handle conflicting evidence is heavily studied [25, 26]. Yager [27] holds on to the fact that the normalization of the combination rule is the main reason which leads to irrational fusion results. He advocates assigning the conflict coefficient to the unknown items in the frame of discernment (FOD) directly, without normalization step. Yager’s method and the method in [28] only satisfy commutativity, but not associativity. This limits the real application of evidence theory. Smets and Kennes [29] present another ingenious approach. They argue that the conflict is mainly due to the incomplete knowledge base when the reports of sensors are all reliable, so they put forward the concepts of the closed world and the open world. In the transferable belief model (TBM), the so-called closed world means that the FOD consists of all the possible propositions, and the open world refers to the incomplete FOD due for lack of knowledge and various uncertainty. They point out that the methods of Dempster et al. combine BPAs on the premise of assumptions that the information fusion environment is in a closed world.

Based on the conception of closed world and open world, Deng [30] proposed the basic frame of generalized evidence theory (GET). In GET, is regarded as an element with the same properties as the other elements. It represents unknown, but not a common empty. For example, in military applications, suppose there are three targets (, , and ) on the FOD. Then, the sensors can only recognize the different unions of these three targets. However, if there exists an unknown target (), the sensors cannot distinguish whether it is one of the previous three targets. In GET, can be interpreted as the unknown targets and . GET is a proper tool to build the real world in a rational way. In the open world, the generalized combination rule (GCR) can solve the problem that the result of BPAs combination is counterintuitive because of the incompleteness of FOD. In the closed world, GET degenerates to D-S theory.

As mentioned above, there are already various methods in the closed world. However, there’s no method to determine BPA in the open world. Determining BPA in the open world is a question worth to be explored. This paper proposed a new method to determine BPA in the open world. This method can be applied to classification problems and identification problems, especially when the FOD is not complete.

The remainder of this paper is organized as follows. Section 2 starts with a brief presentation of D-S theory and some necessary related concepts. The proposed method to construct BPA function is presented in Section 3. Section 4 investigates the effectiveness of the proposed method through a classical classification problem. Conclusions are presented in Section 5.

2. Preliminaries

2.1. Dempster-Shafer Evidence Theory

D-S theory, introduced by Dempster [1] and extended later by Shafer [2], is concerned with the question of belief in a proposition and systems of propositions. It mainly focuses on the epistemic uncertainty, but it is also valid for aleatoric uncertainty. D-S theory has many advantages, compared to probability theory. For instances, it can handle more uncertainty in real world. In probability theory, the belief can be only assigned to singleton subsets, while in D-S theory the belief can be assigned to any subsets of FOD. And in D-S theory, prior distribution is not necessary before information fusion. Due to its ability to handle uncertainty or imprecision embedded in the evidence, D-S theory has been increasingly applied in many fields [3135]. Formally, D-S theory concerns the following preliminary notations.

2.1.1. Frame of Discernment and Mass Function

Evidence theory first supposes the definition of a set of hypotheses called the frame of discernment, defined as . The set is composed of exhaustive and exclusive hypotheses. Denote , the power set composed of propositions of , aswhere denotes the empty set. The subsets containing only one element each are called singletons.

When the FOD is determined, the mass function is defined as a mapping of the power set to a number between 0 and 1; that is,which satisfies the following conditions:The mass function is also called the basic probability assignment (BPA) function. expresses the proportion of all relevant and available evidence that supports the claim that a particular element of belongs to the set but to no particular subset of . Any subset of such that is called a focal element.

2.1.2. Belief and Upper Probability Functions [2]

A function is called a belief function over if it is given by (4) for some basic probability assignment . ConsiderWhenever is belief function over a frame , the function defined by (5) is called the upper probability function of :

2.1.3. Dempster’s Combination Rule [2]

Suppose and are belief functions over the same frame , with BPA and and focal elements and , respectively. Then the function defined by andwithfor all nonempty is a BPA. The core of the belief function given by is equal to the intersection of the cores of and . It is also called the of and and is denoted by .

2.2. Jousselme Distance

Jousselme et al. [36] proposed a new distance to measure the difference between two bodies of evidence, which is also called the evidence distance.

Let and be two BPAs on the same FOD , containing mutually exclusive and exhaustive hypotheses. The distance between and iswhere and are the BPAs according to (3) in Subsection 2.1 and is a matrix whose elements are where are derived from and , respectively.

2.3. Fuzzy Sets Theory

Fuzzy sets theory was first proposed by Zadeh [37] in 1965. The theory is widely used in many uncertain environments such as decision making [3841] and optimization [4244]. Some relative notions on fuzzy sets are given as follows.

Definition 1 (fuzzy set). Let be a universe of discourse, where is a fuzzy subset of ; and for all , there is a number which is assigned to represent the membership degree of in and is called the membership function of [45].

Definition 2 (fuzzy number). A fuzzy number is a normal and convex fuzzy subset of [45].
Here, “normality” implies thatand “convex” means that

Definition 3 (generalized fuzzy numbers). A generalized fuzzy number is described as any fuzzy subset of the real line with membership function that possesses the following features: (1) is continuous, .(2), for all .(3) is strictly increasing in .(4), for , where is a constant and .(5) is strictly decreasing in .(6), for all .Particularly, a trapezoidal fuzzy number and triangular fuzzy number can be shown in Figures 1(a) and 1(b), respectively.

2.4. Generalized Evidence Theory [30]

Generalized evidence theory (GET), based on the classical Dempster-Shafer (D-S) theory, was proposed by Deng [30]. GET abolished the restriction on ; that is, is not necessarily zero. In GET, the empty set can be singleton subsets or multiple hypothesis sets. Moreover, GET can degenerate to the classical D-S theory if the value of is zero. In other words, GET is the extension of the classical D-S theory and can express and deal with more uncertain information in the open world, comparing with D-S theory.

2.4.1. Generalized Basic Probability Assignment [30]

Suppose that is a frame of discernment in an open world [30]. Its power set, , is composed of propositions, . If the function meets the condition then is the GBPA of the frame of discernment . The difference between GBPA and traditional BPA is the restriction of . Note that is not necessary in GBPA. If , the GBPA reduces to a traditional BPA.

2.4.2. Generalized Combination Rule (GCR) [30]

In GET, means that the intersection between two empty sets is still an empty set. Given two GPBAs ( and ), the GCR is defined as follows [30]:with if and only if .

3. Proposed Method to Determine BPA Function

3.1. Sample Difference Degree

In order to reflect the difference between the sample and the target model, we proposed a sample difference degree function, which performs well in the measurement of difference between the sample data and the model data. And now, it will be amended to define pessimistic function in a more reasonable way.

In our method, the triangular fuzzy numbers, which denote the target model and the sample data, should firstly be normalized into the interval as and , respectively. Four normalized triangular fuzzy numbers are defined as follows:(1)Triangular fuzzy number of target model .(2)Triangular fuzzy number of sample data .(3)Left standard triangular fuzzy number .(4)Right standard triangular fuzzy number ., . Then the Left and right average area, , , , and , are defined as follows and are shown in Figure 2. As can be seen from Figure 2, is the left membership degree curve of , and denotes inverse function of ; is the right membership degree curve of , and denotes inverse function of . Then the left adjacent area is the area enclosed by and left standard triangular fuzzy number ; the left far area is the area enclosed by and left standard triangular fuzzy number ; the right adjacent area is the area enclosed by and right standard triangular fuzzy number ; the right far area is the area enclosed by and right standard triangular fuzzy number . Obviously, the four kinds of area can be obtained by the following equations:Based on the four kinds of area, , , , and , the left average area and the right average area are defined, respectively, as follows:Figure 2 indicates that the larger , the closer the fuzzy number to ; the larger , the closer the fuzzy number to . That is to say, and can accurately represent the position and the state information of a triangular fuzzy number in the interval . Based on the fact that the shape and position of a fuzzy number can, to a large extent, be expressed as the credibility of the proposition, a conclusion can be made that the difference between the average area () of and the average area () of reflects the difference between the sample and the model . So, it is reasonable to define the difference degree to measure the difference between the sample and the model as follows:

3.2. Frame of Discernment in Open World

According to the basic framework of the generalized evidence theory [30], an open world is absolute and a closed world is relative. Assume the system to be concerned is not complete and the system FOD is constructed as , where denotes the unknown objects. could be one unknown object or the conjunction of several unknown objects. During the procedure of applying GET, new target representation model is generated by machine learning method along with the accumulation of sensor reports to revise the existing target model library until the system is judged to be complete.

3.3. Pessimistic Function

As can be seen from (18), the reflects the possibility that the sample data may be distributed into the interval built by the training samples. The larger the , the greater the deviation between the test sample and the training sample model, and vice versa. From this point of view, we use the as argument to define a pessimistic function and yield initial BPA, following a pessimistic strategy: when is greater than a threshold value, the incremental rate of BPA is less than the decreasing rate of ; on the contrary, when is less than the threshold value, the incremental rate of BPA is greater than the decreasing rate of . In this way, pessimistic function can effectively reflect the difference between test sample and training sample models to generate the initial BPA.

Definition 4 (pessimistic function). Considerwhere denotes the height of sample triangular fuzzy number, denotes the difference coefficient, which is used to revise the membership degree of unknown samples, and denotes the sample difference defined as (18).

3.4. Procedures to Determine BPA

A flow chart of the proposed method is shown in Figure 3 and details are as follows: Consider species , where denotes the unknown elements. Each species has attributes , so the test sample to be recognized also has attributes . We randomly choose instances for each species and build the model: where is a matrix and the th row denotes attribution value of each sample of species .

3.4.1. Step  1: Establish the Triangular Fuzzy Number Model Matrix

A triangular fuzzy number model for attribution of species is established according to the sample values, where Then the triangular fuzzy number models for each attribution of species can be represented as . Furthermore, all the triangular fuzzy number models for each attribution of species () can be acquired and denoted as a matrix , where each column of represents the triangular fuzzy numbers belonging to the different species but the same attribution. As can be seen from Figure 4, there is often some intersection between two triangular fuzzy numbers. In most cases, the intersection is a generalized fuzzy number. But in some cases, it is not a generalized fuzzy number, as shown in Figure 4. For this particular case, it can be processed by the method proposed by Xiao et al. [46] to construct a generalized triangular fuzzy number. Besides, if these is no intersection between two fuzzy numbers, a special fuzzy number could be used to represent this case. To do the same operation on each column of , we can get a fuzzy number matrix . The th attribution value of the test sample to be recognized should be converted to a special triangular fuzzy number . Doing this processing times, we can get a triangular fuzzy number matrix . Then and can be merged into a matrix . Now we need to normalize every triangular fuzzy number of the matrix to . Each element of the matrix is divided by times of the maximum element value, where represents normalization coefficient, which is adjustable according to different engineering application.

3.4.2. Step  2: Establish the Differentiation Matrix

In this step, we need to calculate the left and right average area () for each triangular fuzzy number in matrix according to (16) and (17). Based on the area, the sample differentiation matrix between each of the first rows in and the test sample could be calculated.

3.4.3. Step  3: Calculate the Similarity Matrix

For each element in matrix , a pessimistic function can be defined according to (19):now we could obtain a similarity matrix , which denotes the similarity degree between the test sample and the target species models. Assume FOD is incomplete; we first calculate the value of the th () column of matrix . Then if the result is larger than 1, the th column should be normalized and set which denotes the similarity degree between the test sample and ; else if the result is less than 1, is regarded as the similarity between the test sample and , where represents the sum of the first rows in similarity matrix . Finally, the modified similarity matrix could be regarded as the initial BPA matrix .

3.4.4. Step  4: Calculate Evidence Disctance

Taking each column in matrix as a piece of evidence or as a vector, we could calculate the distance between any two bodies of evidence based on (8) to obtain a distance matrix , where represents the distance between the th and th column of the initial BPA matrix . Obviously, equals ; that is, distance matrix is a symmetrical matrix, which can simplify the calculation to a great extent.

3.4.5. Step  5: Determine the Extent of Conflict of Evidence

First, we can sum each row in matrix and obtain a vector ; it can be normalized as a vector , where denotes the maximum element of . As debated in Section 1, classical D-S theory has a vital problem that combining two highly conflicting evidences could get even a wrong result. So a conflict threshold should be set first according to concrete engineering application. If the maximum element value of vector is less than conflict threshold , the conflict degree is acceptable and the initial BPAs could be combined with GCR (see (12) and (13)) directly; otherwise the initial BPAs should be adjusted. The specific process is in Section 3.4.6, with the discount coefficient method.

3.4.6. Step  6: Adjust the Conflicting Evidence by Using the Discount Coefficient Method

This step will be operated if evidence conflict is over threshold . In this step, conflict resolution between initial BPAs should firstly be done. Then GCR could be applied to combine the preprocessed BPAs. Computational procedure is summarized as follows:(1)Construct a comparative matrix with the average of each row () in distance matrix :where is the th row and th column element in matrix , is the th row of , and is the th row of .(2)Calculate eigenvector () corresponding to the maximum eigenvalue and normalize as discount coefficient .(3)Discount the initial BPAs with coefficient to obtain the final BPAs.(4)Combine the final BPAs with GCR and then the final recognized result will be acquired.

4. Experimental Analysis

In this paper, we take Iris dataset [47] to demonstrate the effectiveness of the proposed method. The Iris dataset, which is perhaps the best known database to be found in pattern recognition literature, involves classification of three species of the Iris flowers, Iris setosa (S), Iris versicolour (E), and Iris virginica (V), on the basis of four numeric attributes of the Iris flower: sepal length (SL) in cm, sepal width (SW) in cm, petal length (PL) in cm, and petal width (PW) in cm [47]. In the Iris dataset, there are 50 instances for each of three species. The data are obtained from the UCI repository (UCI Machine Learning Repository: http://archive.ics.uci.edu/ml/datasets/Iris) of machine learning databases. Among 50 instances of each species, 30 instances are randomly selected as the training set, and the remaining 20 instances serve as the test set. Each of four attributes is regarded as an information source, and correspondingly there are three training sets and three test sets.

4.1. To Recognize Known Species of Iris Dataset

Before the experiment, the fully optimized parameter settings should be obtained first by training the dataset. The conflict threshold can be set according to experts’ experience usually. In this experiment, . It means that the conflict is in the acceptable scale only if the maximum conflict between bodies of evidence is less than . Otherwise the initial BPAs should be adjusted. The difference coefficient can be obtained by an iterative program. The difference coefficient would be adjusted in each iterative step. When the average recognition rate of known species is up to an acceptable scale and the deviation of the average recognition rate between current step and last step was tiny enough, the parameter would be obtained. After the training process, we have obtained the optimized parameter settings as . Because these optimized parameters fit the dataset well, parameters are all set as the optimized settings in the following.

Following the steps in Section 3.4, triangular fuzzy numbers of the training samples are built, shown in Table 1 and in Figure 5.

According to Step  5 in Section 3.4.5, we can obtain BPAs for each attribute of a Setosa instance shown in Table 2.

After conflict resolution (if needed), these BPAs in Table 2 could be combined with GCR. The results are as follows: As can be seen, the combination result illustrates that the test instance belongs to species setosa, which is consistent with the actual situation.

4.2. To Recognize Unknown Species of Iris Dataset

In order to check the proposed method’s ability to recognize the unknown species of data in an open world, triangular fuzzy number models are constructed only by two species of Iris dataset selected randomly from the three species this time. Thus, the remaining species of Iris dataset could be regarded as test sets, which are unknown to the recognition system. For example, we randomly selected 30 instances from Setosa (S) and Versicolour (E), respectively, as training samples to construct triangular fuzzy number model and the remaining 20 instances in each species as test samples. And parameters should also be set as the optimized settings obtained in the training process. The training samples’ triangular fuzzy number model is shown in Table 3.

For a known Setosa instance , we can obtain BPAs for each attribute in Table 4, according to Step  5 in Section 3.4.5.

After conflict resolution (if needed), these BPAs in Table 4 should be combined with GCR. The results are as follows: As can be seen, the combination result illustrates that the test instance belongs to species setosa.

Similarly, to recognize an instance from the open world, we can obtain BPAs for each attribute of a Virginica (V) instance (unknown species) in Table 5. As can be seen from Table 5, the difference function (see (18)) and pessimistic function (see (19)) indeed recognized this instance as a member of species V.

After conflict resolution (if needed), these BPAs in Table 5 should be combined again with GCR. The results are as follows: And the recognition result suggests that the test instance is an unknown species of Iris datasets.

In order to further illustrate the validity and accuracy of the proposed method, the same experiments have been done 100 times. And to be satisfied, the average recognition rate about known-species Iris dataset (in closed world) is up to and the average recognition rate about unknown-species Iris dataset (in open world) is up to . Several unrecognized data in one experiment are shown in Table 6. As can be seen from Table 6, two instances of species E were recognized as species V, and two datasets of species V were recognized as species E and (that is, species S), respectively. And two instances of species (species S) were recognized as species E.

It is not difficult to explain the wrong results. From Table 1 and Figure 5 we can see that only attributes PL and PW of species setosa are totally separated from species E and species V, and the two remaining attributes are all intersected with others. At the same time, species E and species V intersect with each other for each of attributes. Especially since the length of interval overlapping in the SL and SW attribute is large, all the attributes of some data intersect with each other and it is difficult to distinguish these attributes using pessimistic function. Even so, the simulation examples show that the average recognition rate for the instances of the known species is up to and the average recognition rate of the unknown-species instances is up to . Moreover, during the process of recognition, the number of training samples is small (only 30 instances in each species), and test samples are totally separated from training samples. It scientifically proves that the proposed method to determine BPA has great effectiveness and could work well with GET.

5. Conclusion

In the application of data fusion, the generalized evidence theory (GET) has more advantages than the classical Dempster-Shafer evidence theory due to its ability to deal with evidence conflict when the frame of discernment is incomplete. How to determine the generalized basic probability assignment (GBPA) in an open world is still an open issue. A method to construct GBPA is proposed in this paper. This method uses training samples to build triangular fuzzy number models for each attribute of the multiattribute dataset. Then, the differentiation function and the similarity function are defined. The initial GBPAs are generated by the similarity function, and bodies of evidence are fused with Dempster’s rule or the generalized combination rule (GCR) according to the actual target environment. This method makes full use of the advantages of GET to deal with these targets in the open world. In order to reduce the impact of conflicting evidence on the fusion results, the distance between each body of evidence is calculated and conflict resolution is to be done in the initial stage of determining GBPAs to eliminate human interference and environment noise. Several numerical examples show that the method is concise and effective, and this method has a very significant data processing capacity of small samples based on a good theoretical foundation. The proposed method to obtain GBPA can effectively overcome the problem of subjectivity, which has strong generality. The classification of Iris data is used to illustrate the efficiency and the low computational complexity of the proposed method. This method will help to promote GET and use GCR effectively.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The work is partially supported by National Natural Science Foundation of China (Grant nos. 60904099 and 61501377) and Natural Science Basic Research Plan in Shaanxi Province of China (Program no. 2016JM6018).