Abstract

In practice, some of information systems are based on dominance relations, and values of decision attribute are fuzzy. So, it is meaningful to study attribute reductions in ordered decision tables with fuzzy decision. In this paper, upper and lower approximation reductions are proposed in this kind of complicated decision table, respectively. Some important properties are discussed. The judgement theorems and discernibility matrices associated with two reductions are obtained from which the theory of attribute reductions is provided in ordered decision tables with fuzzy decision. Moreover, rough set approach to upper and lower approximation reductions is presented in ordered decision tables with fuzzy decision as well. An example illustrates the validity of the approach, and results show that it is an efficient tool for knowledge discovery in ordered decision tables with fuzzy decision.

1. Introduction

Rough set theory, which was first proposed by Pawlak in the early 1980s [1], can describe knowledge via set-theoretic analysis based on equivalence classification for the universe of discourse. It provides a theoretical foundation for inference reasoning about data analysis and has extensive applications in areas of artificial intelligence and knowledge acquisition.

A primary use of rough set theory is to reduce the number of attributes in databases thereby improving the performance of applications in a number of aspects including speed, storage, and accuracy. For a data set with discrete attribute values, this can be done by reducing the number of redundant attributes and find a subset of the original attributes that are the most informative. As is well known, an information system may usually has more than one reduct. It means that the set of rules derived from knowledge reduction is not unique. In practice, it is always hoped to obtain the set of the most concise rules. Therefore, people have been attempting to find the minimal reduct of information systems, which means that the number of attributes contained in the reduction is minimal. Unfortunately, it has been proven that finding the minimal reduct of an information system is an NP-hard problem.

Recently, some new theories and reduction methods have been developed. Many types of knowledge reduction have been proposed in the area of rough sets [28]. Possible rules and reducts have been proposed as a way to deal with inconsistence in an inconsistent decision table [9]. Approximation rules [10] are also used as an alternative to possible rules. On the other hand, generalized decision rules and reducts [9] provide a decision with more flexible selection of decision behavior. In [11], the notions of -reduct and -relative reduct for decision tables are defined. The -reduct allows occurrence of additional inconsistency that is controlled by means of a parameter. In [12], Slezak presented a new concept of attribute reduction that keeps the class membership distribution unchanging for all objects in the information system. It was shown by Slezak [13] that the knowledge reduction preserving the membership distribution is equivalent to the knowledge reduction preserving the value of generalized inference measure function. A generalized knowledge reduction was also introduced in [13] that allows the value of generalized inference measure function after the attribute reduction to be different from the original one by user-specified threshold. By eliminating the rigorous conditions required by distribution reduct, maximum distribution reduct was introduced by Zhang et al. in [14].

Partition or equivalence (indiscernibility relation) is an important and primitive concept in Pawlak's original rough set theory. However, partition or equivalence relation is still restrictive for many applications. To overcome this limitation, classical rough sets have been extended to several interesting and meaningful general models in recent years by proposing other binary relations, like tolerance relations [15], neighborhood operators [16], and others [1724]. However, the original rough set theory does not consider attributes with preference ordered domain, that is, criteria. Particularly, in many real situations, we are often faced with the problems in which the ordering of properties of the considered attributes plays a crucial role. One such type of problem is the ordering of objects. For this reason, Greco et al. [2531] proposed an extension rough set theory, called the dominance-based rough set approach (DRSA), to take into account the ordering properties of criteria. This innovation is mainly based on substitution of the indiscernibility relation by a dominance relation. In DRSA, condition attributes are criteria and classes are preference ordered; the knowledge approximated is a collection of upward, and downward unions of classes and the dominance classed are sets of objects defined by using a dominance relation. In recent years, several studies have been made about properties and algorithmic implementations of DRSA [23, 3235].

Nevertheless, only a limited number of methods using DRSA to acquire knowledge from in inconsistent ordered information systems have been proposed. Pioneering work on inconsistent ordered information systems with the DRSA has been proposed by Greco, Matarazzo, and Slowinski [1, 46, 17, 18], but they did not clearly point out the semantic explanation of unknown values. Shao and Zhang [36] further proposed an extension of the dominance relation in the IOIS. Yao [37] introduced the notion of high-order decision rules. While a standard decision rule expresses connections between attribute values of the same object, a high-order decision rule expresses connections of different objects in terms of their attribute values. Various types of relationships can be used, such as ordering relations, closeness relations, similarity relations, and neighborhood systems on attribute values. The introduction of semantics information on attribute values leads to information tables with added semantics. Depending on the decision rules to be mined, one can transform the original table into another information table. Yao et al. [38] provided a granular computing-based interpretation of rules representing two levels of knowledge, which was done by adopting and adapting the decision logic language for granular computing. The language provided a formal method for describing and interpreting conditions in rules as granules and rules as relationships between granules. As examples, they examined rules in the standard rough set analysis and dominance-based rough set analysis. For the modeling of ordering problems, Sai et al. [39] generalized the notion of information tables to ordered information tables by adding order relations on attribute values. For mining ordering rules, they first transformed an ordered information table into binary information.

Despite this, these contributions did not mention the underlying concept of attribute reduction in ordered decision table with fuzzy decision and only proposed an approach to attribute reduction in consistent ordered information systems. Therefore, the purpose of this paper is to study approaches to attribute reductions in ordered decision tables with fuzzy decision.

The rest of this paper is organized as follows. To facilitate our discussion, some preliminary concepts are briefly recalled in Section 2. In Section 3, the concept of ordered decision table with fuzzy decision are introduced, and some important properties are discussed. In Section 4, upper approximation reduction and lower approximation are proposed for the complicated decision table. Moreover, the judgement theorems and discernibility matrices associated with two reductions are obtained, from which we can provide an approach to attributes reductions in ordered decision tables with fuzzy decision. In Section 5, the practical approaches to upper and lower approximation reduction are provided in ordered decision tables with fuzzy decision. Finally, we conclude the paper with a summary and outlook for further research.

2. Preliminaries

The following recalls necessary concepts and preliminaries required in the sequel of our work. Detailed description of the theory can be found in the source papers [2531, 34]. A description has also been made in [14].

The notion of information system (sometimes called data tables, attribute-value systems, knowledge representation systems, etc.) provides a convenient tool for the representation of objects in terms of their attribute values.

An information system is an quadruple , where (i) is a nonempty finite set of objects, (ii), is a nonempty finite set of attributes, (iii) and is a domain of attribute , (iv) is a function such that , for every , called an information function.

An information system with decision, which is also called to a decision table (DT), is a special case of an information system and , where set and be condition attributes set and the decision attribute set, respectively.

In an information system, if the domain of an attribute is ordered according to a decreasing or increasing preference, then the attribute is a criterion.

An information system is called an ordered information system (OIS) if all condition attributes are criteria. Moreover, a decision table (ODT) is called an ordered decision table if all attributes are criteria (See [2531]).

In general, we denote an ordered information system by and denote an ordered decision table by .

Assumed that the domain of a criterion is complete preordered by an outranking relation , then means that is at least as good as with respect to criterion for . And we can say that dominates . In the following, without any loss of generality, we consider criterions having a numerical domain; that is, ( denotes the set of real numbers). Being of type gain; that is, (according to increasing preference) or (according to decreasing preference).

Without any loss of generality and for simplicity, in the following, we only consider condition attributes with increasing preference.

For a subset of attributes , we define , and that is to say that dominates with respect to all attributes in .

For an ordered information system with decision, we say that dominates with respect to if , and denote by . That is is called dominance relation of ordered information system .

If we denote then the following properties of a dominance relation are trivial by above definition.

Let be a dominance relation. The following hold ([2531]):(i) is reflexive, transitive, but not symmetric, so it is not an equivalence relation (ii)if , then , (iii)if , then , (iv)if , then and , (v) if and only if for all , (vi) for any , (vii) constitute a covering of , that is, for every we have that and ,

where denotes cardinality of the set.

For any subset and in , the lower and upper approximation of with respect to a dominance relation could be defined as following (see [2531]):

Unlike classical rough set theory, one can find easily that and do not hold.

3. Ordered Decision Table with Fuzzy Decision

Traditionally, decision tables (DTs) are crisp, indicating that the conditions are specified in an exact manner. In many real worlds, crisp decision tables may be stringent. A potential problem of such DTs is that any measurement error is not taken into account. Fuzzy decision tables (FDTs) offer a solution to this problem. A fuzzy decision table (FDT) is an extended version of a crisp DT in order to deal with imprecise and vague decision situations [40, 41]. The extension amounts to the introduction of fuzzy sets in the condition and decision attributes sets of the crisp DT; the crisp condition and decision states are replaced with fuzzy conditions and decisions. A membership function need be given, so the decision maker can judge the extent to which a particular attribute level meets a particular condition by using the membership function.

A familiar example of a fuzzy decision table is shown in order to illustrate the meaning in Table 1. This table is a fuzzy decision table which describes the investment program of a real estate company, where is the set of projects considered by the company, is the set of condition attributes, and is a decision attribute, which means the suitability of the project. Moreover, stand for the distance between site of the project and Hospital, School, and Central Business District, respectively. And interpretations of values of condition and objective attributes are represented as follows.

It is obvious that condition and decision attributes are fuzzy environment in the DT of Table 1, that is to say that the DT is a fuzzy decision table.

In this contribution, we focus on the DT, in which values of contribution attributes set is crisp and value of decision attribute set is fuzzy. Moreover, we will consider the increasing preference of condition and decision attributes. In other words, we will discuss approximation reduction of an ordered decision table with fuzzy decision based on the rough set theory.

Definition 3.1. Let be an ordered decision table. If the value of decision attribute is fuzzy, in other words, for all , then the ordered decision table is called ordered decision table with fuzzy decision.

From above definition, one can find easily that the set of decision attribute is a fuzzy set on , which reflects the degree of fuzzy decision in ordered decision table. So, these systems are different from the systems in which a decision is numeric value. For convenience, we denote set of decision attribute by and denote an ordered decision table with fuzzy decision by . Thus, that is to say for all in an ordered decision table with fuzzy decision .

Let be an ordered decision table and , then the lower and lower approximation sets of with respect to , which are fuzzy sets, are denoted by and , respectively. And their membership functions are defined as follows: It is obvious that these two approximation sets are fuzzy set.

Moreover, by the above definition and rough set theory, one can easily consider that the upper and lower approximation of have the following properties in an ordered decision table with fuzzy decision.

Proposition 3.2. Let be an ordered decision table with fuzzy decision, then the following propositions hold:(i)if , then , (ii)if , then , (iii)if , then , where and , (iv)if , then , where and .

Example 3.3. We still consider the fuzzy DT in Table 1. To depict the degree of decision attribute, we take the values as Table 2 according to opinions of some experts.
Naturally, the increasing preference of condition attribute is
So, from the table, one can find that
If take , then
Moreover, we can have that

4. Theory of Approximation Reduction of ODT with Fuzzy Decision

The approximation reduction proposed by Mi et al. is an important attribute reduction, which can be used to simplify an inconsistent classical decision table [4]. So far, however, there is not any practical approach to attribute reduction in ordered decision tables with fuzzy decision. In this section, we present the notions of a lower approximation reduction and an upper approximation reduction in an ordered decision table with fuzzy decision and then deduce their some important properties, from which we introduce the theory of approximation reductions to this complicated decision table.

Definition 4.1. Let be an ordered decision table with fuzzy decision and .(i)If for any , then we say that is upper approximation consistent set of this information system. Moreover, if any proper subset of is not the upper approximation set, then is called to one upper approximation reduction of this information system. (ii)If for any , then we say that is lower approximation consistent set of this information system. Moreover, if any proper subset of is not a lower approximation consistent set, then is called to one lower approximation reduction of this information system.

From above definition, one can find that upper and lower approximation consistent sets preserve the upper and lower approximations of the fuzzy decision, respectively.

Example 4.2 (Continued from Example 3.3). Let us consider the upper and under approximation reductions of the ordered decision table with fuzzy decision in Table 2.
If take , it can be easily checked that , for . Hence, and , that is to say that is one upper approximation consistent set, and it is also one under approximation consistent set.
Moreover, if take , then we have Hence, .
Thus, is also one upper approximation consistent set of the system.
Moreover, we can examine that and are not the upper approximation consistent sets by computing and is the upper approximation consistent set of the system. Hence, we conclude that this decision table has unique upper approximation reduction .
Furthermore, if we take , by computing, we have Hence, . Therefore, is also another lower approximation consistent set of the decision table. Moreover, we can find that is lower approximation consistent sets by computing. Hence, we conclude that is unique lower approximation reduction of this ordered decision table.
Detailed judgment theorems of upper approximation reductions will be proposed in the following.

Theorem 4.3. Let be an ordered decision table with fuzzy decision and . Attribute set is an upper approximation consistent set if and only if if such that , then there must exist such that .

Proof. ” Suppose that the conclusion does not holds, that is to say that if such that , then for any . So, we can obtain , which implies that . And by the upper approximation definition, we have known that Therefore, one can get .
On the other hand, is an upper approximation consistent set, then we have that
Hence, we can obtain . Obviously, this is a contradiction.
” Suppose that is not an upper approximation consistent set, then there exist certainly one such that . So, we have that according to Proposition 3.2.
Since , we take such that . At the same time, we easily observe that , then we can obtain .That is to say .
Therefore, from the above, the following inequality holds:
Thus, there exist certainly such that , which is a contradiction with .
The theorem is proved.

In the following, judgment theorems of lower approximation reductions will be presented.

Theorem 4.4. Let be an ordered decision table with fuzzy decision and . Attribute set is a lower approximation consistent set if and only if if such that , then there exist certainly such that .

Proof. ” Suppose that the conclusion does not holds, that is to say that if such that , then for any . So we can obtain , which implies that . And, by the lower approximation definition, we have known that Therefore, one can get .
On the other hand, is an lower approximation consistent set, then we have that
Hence, we can obtain . Obviously, this is a contradiction.
” Suppose that is not an lower approximation consistent set, then there exist certainly one such that . So we have that according to Proposition 3.2.
Since , we take such that . At the same time, we easily observe that , then we can obtain . That is to say .
Therefore, from the above, the following inequality holds:
Thus, there exist certainly such that , which is a contradiction with .
The theorem is proved.

Theorems 4.3 and 4.4 provide an approach to judge whether a subset of condition attributes is a lower and upper approximation consistent set or not, respectively.

5. Approach to Approximation Reduction of ODT with Fuzzy Decision

In this section, we can further obtain practical approaches to upper and lower reductions in ordered decision tables with fuzzy decision, and an illustrative example is also employed to show their mechanisms. We first give the following notions.

Definition 5.1. Let be an ordered decision table with fuzzy decision and . If we denote, for , where , , then and are said to be upper and lower approximation discernibility attributes set between objects and , respectively. And matrices and are referred as to upper and lower approximation discernibility matrix of the decision table , respectively.

Theorem 5.2. Let be an ordered decision table with fuzzy decision and . Subset is upper approximation consistent set if and only if for all .

Proof. ” From , we have . By Theorem 4.3, we can know that there exist certainly such that . So, according to the above definition. Hence, .
” For , if , then there exist certainly such that , which implies that . By Theorem 4.3, we can obtain that is an upper approximation consistent set of the decision table .

Theorem 5.3. Let be an ordered decision table with fuzzy decision and . Subset is lower approximation consistent set if and only if for all .

Proof. It is similar to Theorem 5.2.

Definition 5.4. Let be an ordered decision table with fuzzy decision and and upper and lower approximation discernibility matrices of , respectively. If denote then and are called discernibility formulas of upper and lower approximation of the decision table , respectively.

Theorem 5.5. Let be an ordered decision table with fuzzy decision. The minimal disjunctive normal form of discernibility formula of upper approximation is Denote , then is just set of all upper approximation reductions of .

Proof. For any , by the definition of minimum alternative normal form, we have that is upper approximation consistent set. If one element of is reduced in , without loss of generality and the result denoted by , then there exist certainly such that . So, is not an upper approximation consistent set. So, is an upper approximation reduction of the ordered decision table .
On the other hand, we have known that the discernibility formula of upper approximation includes all . Thus, there is not other upper approximation reduction besides of .
The proof is completed.

Theorem 5.6. Let be an ordered decision table with fuzzy decision. The minimal disjunctive normal form of discernibility formula of lower approximation is Denote , then is just set of all lower approximation reductions of .

Proof. It is similar to Theorem 5.5.

Example 5.7 (Continued from Example 3.3). Compute the upper approximation reduction and lower approximation reduction of the ordered decision table with fuzzy decision in Table 2.
By computing, we can easily obtain the upper and lower approximation discernibility matrices in Tables 3 and 4.
Therefore, by Theorems 5.5 and 5.6, we have
Thus, we can conclude that is the unique upper approximation reduction and is the unique lower approximation reduction of the ordered decision table with fuzzy decision, which accord with the result of Example 4.2.

Remark 5.8. By the example, one can easily find that is the upper approximation consistent set, but it is not a lower approximation consistent set, and is the lower approximation consistent set, but it is not a upper approximation consistent set. Hence, there is no static relationship between upper approximation consistent sets and lower approximation consistent set.

6. Conclusions

Attributes reduction, as one research problem, has played an important role in rough set theory. Many types of attribute reductions have been proposed based on the rough set theory, each of them aimed at a different requirement. In practise, some of information systems are based on dominance relations, and values of decision attribute are fuzzy. Therefore, it is meaningful to study the attribute reductions in ordered decision table with fuzzy decision. In this paper, upper approximation reduction and lower approximation reduction were proposed for this kind of complicated decision table. Some important properties were discussed. The judgement theorems and discernibility matrices associated with the two reductions have been obtained, from which we can provide an approach to attribute reductions in ordered decision tables with fuzzy decision. Then, the practical approaches to upper and lower approximation reduction in ordered decision tables with fuzzy decision have been provided as well. Finally, an illustrative example has been employed to explain the mechanism of this method.

Though ordered decision table with fuzzy decision are discussed here, they are all complete. Because incomplete decision table are more complicated than complete one, we will develop the proposed approaches to more generalized and more complicated ODTs such as incomplete ODTs with fuzzy decision and ordered decision tables with interval values or intuitionistic fuzzy values.

Acknowledgments

This work is supported by the Postdoctoral Science Foundation of China (no. 201004813 31) and the National Natural Science Foundation of China (no. 61105041, 71071124, and 11001227).