Abstract

As an extension of Pawlak’s rough sets, rough fuzzy sets are proposed to deal with fuzzy target concept. As we know, the uncertainty of Pawlak’s rough sets is rooted in the objects contained in the boundary region, while the uncertainty of rough fuzzy sets comes from three regions (positive region, boundary region, and negative region). In addition, in the view of traditional uncertainty measures, the two rough approximation spaces with the same uncertainty are not necessarily equivalent, and they cannot be distinguished. In this paper, firstly, a fuzziness-based uncertainty measure is proposed. Meanwhile, the essence of the uncertainty for rough fuzzy sets and its three regions in a hierarchical granular structure is revealed. Then, from the perspective of fuzzy distance, we introduce a modified uncertainty measure based on the fuzziness-based uncertainty measure and present that our method not only is strictly monotonic with finer approximation spaces, but also can distinguish the two rough approximation spaces with the same uncertainty. Finally, a case study is introduced to demonstrate that the modified uncertainty measure is more suitable for evaluating the significance of attributes. These works are useful for further study on rough sets theory and promote the development of uncertain artificial intelligence.

1. Introduction

The rough sets introduced by Pawlak [1, 2] have been demonstrated as an effective tool to handle uncertain information by using the given information granulations [35]. An uncertain concept can be described by a pair of lower and upper approximation sets. And three disjoint regions are constructed in a rough approximation space. Generally speaking, the target concepts in Pawlak's rough sets are usually accurate and crisp. For a decision-making problem, there are only two states which are opposite and disjoint each other for a crisp concept. For example, in the decision-making problems of diagnosis analysis, there are only two states of Yes or No for a patient. That is, a patient is either diseased or not diseased. However, in many real decision-making applications, the states of the target concept may be uncertain and fuzzy in practice. To address this problem, the rough fuzzy sets (RFS) [68] are proposed to deal with the target concept which is usually fuzzy or uncertain. In rough fuzzy sets, the lower and upper approximation fuzzy sets are considered as two boundary fuzzy sets of the target concept.

Probabilistic rough fuzzy set model [9], based on the average membership degree, classifies an approximation space into positive region, boundary region, and negative region by setting a pair of threshold . Similar to the probabilistic rough sets, the rough fuzzy sets also have a good fault-tolerance. It is well known that the uncertainty of classical rough sets is only rooted in the boundary region, while the uncertainty of rough fuzzy sets not only comes from the boundary region, but also comes from the negative or positive regions, because the membership degrees of these objects in the negative or positive regions are not completely equal to 1 or 0. From the viewpoint of hierarchical quotient space [10, 11], with the increase of information, not only the objects in the boundary region, but also the objects in the positive region and negative region will be reclassified. Therefore, for rough fuzzy sets, the uncertainty comes from three regions and is not monotonic with finer knowledge spaces.

As a new methodology for simulating human cognitive mechanism, granular computing (GrC) is regarded as an umbrella covering the theories, methodologies, techniques, and tools in artificial intelligence [1217]. From the viewpoint of GrC, the certainty and uncertainty of knowledge can be transformed for each other at a certain granularity level [18]. As the main GrC models, fuzzy sets [19], rough sets [1], quotient space [10], and cloud model [20] realize the representation and transformation of uncertain knowledge from the different views. The uncertainty measure plays an important role in acquiring rules from information system and it is helpful in evaluating attribute weight in attribute reduction. Wierman [21] first introduced an axiomatic definition of knowledge granularity to measure the uncertainty in information system. Yao [22] presented a so-called expected granularity (EG) class that provides a family of uncertain measure. Besides, there are various uncertain measures, such as fuzziness [23, 24], rough entropy [25], inclusion degree [26], roughness [27], and knowledge distance [28, 29].

The uncertain measure of rough fuzzy sets model has drawn many researchers’ attention, because it can reflect the quality of acquiring rules from information system. Guo and Mi [30] proposed a fuzziness measure of rough fuzzy sets based on the conditional entropy. By using new lower and upper approximation operators, Beg and Rashid [31] proposed a modified soft rough fuzzy set model, which provide better approximations of undefinable sets. Based on rough entropy and information entropy, Wang [32] introduced a method address the uncertainty measure of rough fuzzy sets. Combining the rough degree with the rough entropy, Qin [33] proposed a new rough entropy that is strictly monotonic with finer approximation space. Sun [34, 35] introduced an uncertainty measure for generalized rough fuzzy sets based on the Shannon entropy, which is effective and suitable for evaluating the roughness and accuracy of a generalized rough fuzzy sets. From the perspective of distance, Hu [36] studied the roughness measure of rough fuzzy sets and applied it to incomplete information systems with fuzzy decision. However, there are still several shortcomings in the current research on uncertainty measure of rough fuzzy sets as follows: (i) lack of theoretical analysis on the change rules of uncertainty of RFS in a hierarchical granular structure. For example, in many practical applications, such as medical diagnosis [37], decision-making [38], and feature selection [39], the analysis on the uncertainty of three regions with changing knowledge spaces is helpful in improving the decision quality. (ii) In the view of traditional uncertainty measures, two rough approximation spaces with the same uncertainty are not necessarily equivalent and the difference between them is difficult to reflect. Actually, in some cases, such as granularity selection, attribute reduction, and multigranularity construction, we usually need to discriminate the two rough approximation spaces. To solve the above problems, in this paper, from the perspective of fuzzy distance, we propose a modified uncertainty measure based on fuzziness.

The rest of this paper is organized as follows. Section 2 presents many preliminary concepts such as rough fuzzy sets, step fuzzy set, average membership degree, average fuzzy set, and probabilistic rough fuzzy sets, etc. In Section 3, a basic uncertainty measure based on fuzziness is proposed to investigate the changing rules of uncertainty of RFS in a hierarchical granular structure. In Section 4, in the view of fuzzy distance, a modified uncertainty measure is presented. In Section 5, an illustrative example is conducted to verify the effectiveness of the proposed method. In Section 6, conclusions are drawn.

2. Preliminaries

In order to facilitate the description of this paper, many basic concepts are reviewed briefly in this section. In this paper, we denote an information system by , where is a nonempty finite domain, is the set of condition attributes, is the decision attribute, is the set of all attribute values, and is an information function.

Definition 1 (rough sets [1, 2]). Given an information system , , and , the lower and upper approximation sets of are defined as follows:where denotes the equivalence class induced by , namely, .

For simplicity, let in case of confusion. If , is a definable set; otherwise is a rough set. Universe is divided by positive region, boundary region, and negative region, then the three regions can be defined, respectively, as follows:

In this paper, a partition space is also called a knowledge space or granularity space. If ,   and are two knowledge spaces. If , then is finer than , denoted by . If , then is strictly finer than , denoted by .

Definition 2 (rough fuzzy sets [68]). Given an information system , and is a fuzzy set on , then the lower and upper approximation sets of can be defined as a pair of fuzzy sets, and . And their membership functions are defined as follows:where denotes the equivalence class induced by the equivalence relation . If , then is a definable fuzzy set; otherwise is a rough fuzzy sets.

Definition 3 (step fuzzy set [40]). Given an information system , denotes an attribute set. and is a fuzzy set on . is a partition space, where and , is a fuzzy set on . If , then is a step fuzzy set on .

Definition 4 (average membership degree [40]). Given an information system , and is a fuzzy set on , is a partition space. , where ,  , then we call that is the average membership degree.

In Definition 4, is the average membership degree that an object ,   belongs to the fuzzy concept . is the average membership degree that an object , , belongs to the fuzzy concept . From the perspective of probability statistics, can be understood as the probability that an equivalent class ,  , belongs to the fuzzy concept . Similarly, can be understood as the probability that an equivalent class , , belongs to the fuzzy concept .

Definition 5 (average fuzzy set [41]). Given an information system , and is a fuzzy set on , is a partition space, where and , is a fuzzy set on . If ,  , where is a fuzzy set on , then we call is the average fuzzy set of .

Example 6. Given an information system , , is a fuzzy set on ; , then

Definition 7 (probabilistic rough fuzzy sets [9]). Given an information system with a pair of threshold , and is a fuzzy set on , the lower and upper approximation sets of are defined as follows:Universe is divided by positive region, boundary region, and negative region, which can be defined as follows:

Definition 8 (granularity measure [21, 22, 42]). Suppose is finite and nonempty universe. A function is a granularity measure if it satisfies the following conditions for any :(1).(2).(3). where denotes there is a bijection from to .

Definition 9 (knowledge granulation [25]). Given an information system , , and is a fuzzy set on , . The knowledge granulation of is defined as follows:Obviously, for any .

Definition 10 (fuzziness [43]). Suppose be a finite universe, and be two fuzzy sets on , if satisfies the following conditions:(1)If and only if , .(2)If and only if and , .(3), if or , then .(4), . where is called the fuzziness of a fuzzy subset.

Definition 11 (distance between two fuzzy sets [44]). A real function satisfies the following conditions:(1),  .(2),  .(3), .(4), if , then and . Then, is a distance measurement.

3. A Basic Fuzziness-Based Uncertainty Measure

In the RFS model, the uncertainty usually comes from three regions at each granularity, because the objects in a positive or negative region are uncertain. Namely, the membership degrees of these objects may be not necessarily equal to 0 or 1. With the adding attributes, the objects in the negative or positive regions may be reclassified and the three disjoint regions will be changed. As a result, the uncertainty at each granularity in RFS model will be changed accordingly. In this section, we will pay attention to analyze the change rules of the uncertainty of RFS in the changing knowledge spaces.

Research on uncertainty of RFS is very useful in approximate knowledge acquisition. Based on Definition 10, there are various fuzziness formulas proposed by many researchers. In this paper, we choose the fuzziness formula presented in [30] as follows:

According to the analysis in Section 1, the uncertainty of a target concept in RFS comes from three regions: positive region, negative region. and boundary region; that is to say,

Obviously, given an information system , and is a fuzzy set on , the following properties hold for :(1).(2) does not decrease with respect to and does not increase with respect to .

Therefore, the advantage of this method is independent of thresholds .

Lemma 12. Given an information system , . is a fuzzy set on , and . If a granule which is subdivided into many finer subgranules by , and , then .

Proof. Obviously, , if , because , then .

From Lemma 12, if the granule on the coarser granularity is subdivided into many finer subgranules with the same membership degree on the finer granularity, then the membership degree of the equivalence class is equal to the one of its subequivalence classes.

Theorem 13. Given an information system , . is a fuzzy set on , and . If only granule is subdivided into many finer subgranules by , and , then .

Proof. From Lemma 12, Theorem 13 is easy to prove.

From Theorem 13, we can conclude that the uncertainty will remain unchanged if the granules on the coarser granularity which are subdivided into many finer subgranules by equal proportion on the finer granularity.

Theorem 14. Given information system , and . is a fuzzy set on . If ,   holds.

Proof. Let be a nonempty finite domain, and . Because , . According to the conditions, for simplicity, suppose only one granule can be subdivided into two finer subgranules (the more complicated cases can be transformed into this case, so we will not repeat them here). Without loss of generality, let ,  ,  ,    ; namely, . We will prove this theorem in the following:Because , .
We haveThus, , where when .

From Theorem 14, we know that the total uncertainty will monotonically decrease with finer granularity in RFS model.

Theorem 15. Given an information system , for any ,   and . is a fuzzy set on . If and only the granules containing in are subdivided into many finer equivalence classes by the attribute increment , then hold.

Proof. Let be a nonempty finite domain, , because ,  . According to the conditions, for simplicity, suppose only one granule can be subdivided into two finer subgranules by (the more complicated cases can be transformed into this case, so we will not repeat them here). Without loss of generality, let ,  ,  ,    ; namely, . We will prove this theorem in three cases in the following.
Case 1. If and , namely, and , . Because , ,   , .
Case 2. If and , namely, and , . Because ,  ,    , .
Case 3. If and , namely, and , then ,  .

Theorem 16. Given an information system , for any ,   and . is a fuzzy set on . If and only the granules containing in are subdivided into many finer equivalence classes by the attribute increment , then hold.

Proof. Similar to Theorem 15, Theorem 16 is easy to prove.

According to Theorems 15 and 16, the uncertainty of boundary region will increase only when the equivalence classes contained in negative region or positive region are subdivided with finer granularity in RFS model. This is not in line with human cognitive habits.

Theorem 17. Given an information system , for any ,   and . is a fuzzy set on . If and only the granules containing in are subdivided into many finer equivalence classes by the attribute increment , then hold.

Proof. Let be a nonempty finite domain, and . Because , . According to the condition, for simplicity, suppose only one granule can be subdivided into two finer subgranules by (the more complicated cases can be transformed into this case, so we will not repeat them here). Without loss of generality, let ,  ,  ,    , namely, . We will prove this theorem in four cases in the following.
Case 1. If and , namely, and , . Because ,  ,    , .
Case 2. If and , namely, and , . Because ,  ,    , .
Case 3. If and , namely, and , . Because ,  ,    , .
Case 4. and , namely, and , . Because ,  ,    , .

According to Theorem 17, the uncertainty of boundary region will decrease when only the equivalence classes contained in boundary region are subdivided with finer granularity in RFS model. This is in line with human cognitive habits.

Theorem 18. Given an information system , , is a fuzzy set on . If , then the following properties hold:(1).(2).(3).

Proof. Obviously, some of the equivalence classes in positive region will be assigned to the boundary region as parameter increases, so will decrease. Similarly, some of the equivalence classes in negative region will be assigned to the boundary region as parameter decreases, so will decrease. Some of the equivalence classes in boundary region will be assigned to the negative region or positive region as parameter increases and parameter decreases, so will decrease.

Theorem 18 indicates that the uncertainty of positive region monotonically decreases as parameter increases, the uncertainty of negative region monotonically decreases as parameter decreases, and uncertainty of boundary region monotonically increases with increasing and decreasing .

Note. As to the monotonicity of , it can be used for measuring uncertainty in RFS to a certain degree. However, there still exist some deficiencies about ; for example, it cannot provide enough information, which will be discussed in the following section.

4. Modified Uncertainty Measure of Rough Fuzzy Sets

According to the above discussion, is monotonic with finer approximation spaces in a hierarchical granular structure, which can reflect the change rules of uncertainty in RFS to a certain extent. However, there exist some deficiencies about . On the one hand, does not provide enough information that is caused by the uncertainty related to the granularity of partitions. This limitation will be revealed by Example 19 in this section. One the other hand, cannot supply enough information to characterize the two approximation spaces with the same fuzziness. In particualr, the latter is very important for evaluating the attribute significance in attribute reduct.

Example 19. Given an information system , where ,   and is a fuzzy set on . Suppose that ,  , and .

According to formula (8), we haveTherefore, cannot reflect the uncertainty caused by the granularity of partition.

To overcome this limitation, we modify formula (8) based on the idea of combination entropy and combination granulation [42, 45], and the definition of combined fuzziness-based uncertainty measure is as follows.

Definition 20. Given an information system , and is a fuzzy set on . The combined fuzziness-based uncertainty measure is defined as follows:

Theorem 21 (see [25]). Given an information system , and is a fuzzy set on . If ,   holds.

Theorem 22. Given an information system , is a fuzzy set on . If ,   holds.

Proof. If , we have and by Theorem 21 and Theorem 14, respectively. Then, we obtain .

According to Theorem 22, the combined fuzziness-based uncertainty in Definition 20 is more accurate than the basic fuzziness-based uncertainty, since it is equipped with strict monotonicity.

Example 19 (continued). We haveObviously, .

Thus, .

In some special cases, the two rough approximation spaces with the same uncertainty are not necessarily equal. Consequently, it is hard to distinguish them, which is important to evaluate the attribute significance in attribute reduction. Therefore, it is necessary to further modify formula (13). Firstly, based on Definition 11, we have the following definition.

Definition 23 (fuzzy distance). Given an information system ,  , and is a fuzzy set on . and are two rough approximation spaces on . Then, the fuzzy distance between and is defined as follows:whereand

Theorem 24. Given an information system ,  , , and , and is a fuzzy set on . If ,   holds.

Proof. Let be a nonempty finite universe; , , and . Because , . According to the condition, for simplicity, suppose only one granule can be subdivided into two finer subgranules and by and only one granule can be subdivided into two finer subgranules and by (the more complicated cases can be transformed into this case, so we will not repeat them here). Without loss of generality, let ,  ,    ,  ,  ,    , namely, ,  . Then, we haveBecause , .
From Definition 2, and .
Because , .
Thus, .
Then, we haveSimilarly, one can drawTherefore, we have

From Theorem 24, we can easily find that formula (15) in Definition 23 is a distance measurement between two fuzzy sets, because it satisfies all conditions of Definition 11. Thus, formula (15) is able to characterize the difference between two rough approximation spaces. Then, we have the definition of the improved uncertainty measure as follows.

Definition 25. Given an information system ,   and , and is a fuzzy set on . The improved fuzziness-based uncertainty measure is defined as follows:

Theorem 26. Given an information system , and is a fuzzy set on . If ,   holds.

Proof. If , we have by Theorem 22. According to Theorem 24, we have . Then, we obtain .

From Theorem 26, we learn that the modified fuzziness-based uncertainty in Definition 25 is also strictly monotonic with finer granularity.

Theorem 27. Given an information system ,  , and is a fuzzy set on . Then, .

Example 19(continued). We haveObviously, .

Thus, .

Theorem 28. Given an information system , for any ,  ,   is a fuzzy set on . If , then the following properties hold:(1).(2).(3).

Proof. Because and are independent of and , according to Theorem 18, Theorem 28 obviously holds.

In the next section, we will present the advantages of improved fuzziness-based uncertainty on evaluating the attribute significance for a fuzzy-target information system.

5. An Application to Fuzzy-Target Information System

The attribute significance is an important factor in attribute reduction. The significance of an attribute is obtained by the change value of uncertainty when the attribute is removed from the whole attribute set. The larger the change is, the more the significance of the attribute is.

Definition 29. Given an information system , , and is a fuzzy set on , the attribute significance of can be defined as

Definition 30. Given an information system , , and is a fuzzy set on , the attribute significance of can be defined as

Definition 31. Given an information system , , and is a fuzzy set on , the attribute significance of can be defined as

In this section, we will present an example to illustrate the application of the modified fuzziness-based uncertainty to a fuzzy-target information table shown in Table 1 which is artificial to address issues.

Example 32. From Table 1, we have and . Suppose that ,  ,  ,  , thenAccording to formula (8), we haveTherefore, we haveAccording to formula (13), we obtainTherefore,According to formula (22), we computeTherefore, we have

For convenience, the above results are listed in Table 2. From Table 2, we can obtain the following conclusions: (i) the three rough approximation spaces cannot be discriminated by ; (ii) the is different from and ; (iii) although , can distinguish and . Furthermore, the attribute significance of based on the above uncertainty measures is listed in Table 2.

The above conclusions can be further described intuitively by Figure 1. The dashed ellipse denotes the basic fuzziness-based uncertainty measure . The solid ellipse denotes the combined fuzziness-based uncertainty measure . The red, green, blue, and black dots denote the rough approximation spaces , , , and , respectively. Herein, , , and appear on the dashed ellipse, because ; however, only and appear in the solid ellipse, because . Furthermore, and will be distinguished as , leading to .

Hence, according to the above results, if we take to compute the attribute significance, the significance of and is equal, because . If we take to compute the attribute significance, the significance of and can be distinguished. Therefore, the deficiency mentioned in Section 4 of the traditional uncertainty measure of rough fuzzy sets is overcome effectively.

6. Conclusions

Rough fuzzy sets are able to deal with uncertain concepts, and the uncertainty of rough fuzzy sets usually comes from three regions. Related research on the uncertainty of rough sets and granular computing has been presented in our previous papers [4648]. Based on these researches, in this paper, a basic fuzziness-based uncertainty measure is introduced and the essence of the fuzziness for rough fuzzy sets and its three regions in a hierarchical granular structure is revealed. From the perspective of fuzzy distance, we present a modified fuzziness-based uncertainty measure for the rough fuzzy sets. The modified measure can not only measure the information caused by the uncertainty related to the granularity of partitions, but also provide the information to distinguish the two approximation spaces with the same uncertainty. A case study is adopted to show that our method is more suitable for evaluating the significance of attributes. These results can also be applied to the uncertainty measure of probabilistic rough sets. They will have a wide variety of applications, including measuring knowledge content, constructing decision trees and building a heuristic function in a heuristic reduct algorithm in rough set theory.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work is supported by the National Science Foundation of China (61472056), High-End Talent Project (no. RC2016005), Qian Education Cooperation (LH (2017) no. 7075), and the Key Disciplines of Guizhou Province (QXWB2013]18).