Abstract

Covering is a widely used form of data structures. Covering-based rough set theory provides a systematic approach to this data. In this paper, graphs are connected with covering-based rough sets. Specifically, we convert some important concepts in graph theory including vertex covers, independent sets, edge covers, and matchings to ones in covering-based rough sets. At the same time, corresponding problems in graphs are also transformed into ones in covering-based rough sets. For example, finding a minimal edge cover of a graph is translated into finding a minimal general reduct of a covering. The main contributions of this paper are threefold. First, any graph is converted to a covering. Two graphs induce the same covering if and only if they are isomorphic. Second, some new concepts are defined in covering-based rough sets to correspond with ones in graph theory. The upper approximation number is essential to describe these concepts. Finally, from a new viewpoint of covering-based rough sets, the general reduct is defined, and its equivalent characterization for the edge cover is presented. These results show the potential for the connection between covering-based rough sets and graphs.

1. Introduction

Covering is an extensively used form of data representation. As a generalization of classical rough set theory [1, 2], covering-based rough set theory [3, 4] was proposed to process this type of data. Due to its generality and universality, it has attracted much research interest. Covering reducts have been defined [59], approximation models [1013] have been established, axiomatic systems [1417] have been constructed, and generalization works [7, 1821] have been conducted. In application, covering-based rough sets have been used in rule learning [22, 23], attribute reduction [2426], feature selection [27, 28], and other fields [2932].

Graphs are important discrete structures consisting of vertices and edges that connect these vertices, and they can well describe the relationship among objects. Problems in almost every discipline can be addressed using graph models. However, some important problems in graphs are NP-hard optimization ones such as finding the minimal vertex cover [33] and edge cover [34]. Therefore, it is much necessary to equivalently characterize these problems with other approaches or forms in order to find other efficient solutions.

In this paper, some graph concepts including vertex covers, independent sets, edge covers, and matchings are equivalently formulated using covering-based rough sets. First, a graph is represented with a covering, and an isomorphism from simple graphs without isolated vertices to a special type of coverings is constructed. Second, vertex covers, edge covers and matchings are equivalently described through the upper approximation number and independent sets with the lower approximation. Third, edge covers are also characterized by general reducts which are generalizations of the covering reduct. Furthermore, some graph problems are transformed into ones in covering-based rough sets. For instance, finding a minimal edge cover of a graph is equivalently converted to finding a minimal reduct of a covering.

There are numerous applications concerning connections between covering-based rough sets and graphs. For example, chemical classification, job assignment, and production process arrangement can be well modeled by graphs. However, many of these practical problems are NP-hard. Due to equivalent characterizations of covering-based rough sets and graphs, these problems can be converted and addressed under the framework of covering-based rough sets. In this way, heuristic reduction algorithms [58] for them may be employed.

The rest of this paper is arranged as follows. Section 2 recalls some fundamentals related to covering-based rough sets and graph theory. In Section 3, the upper approximation number is presented, and its properties are studied. Section 4 represents graphs with coverings and establishes a one-to-one correspondence between a special type of coverings, and simple graphs without isolated vertices in an isomorphic sense. In Section 5, some important graph concepts including vertex covers, independent sets, edge covers, and matchings are formulated through the upper approximation number and lower approximation. Section 6 presents another equivalent characterization of edge covers by general reducts. In Section 7, this paper is concluded and further works are pointed out.

2. Basic Definitions

This section introduces some fundamental definitions related to covering-based rough sets and graphs.

2.1. Covering-Based Rough Sets

As a generalization of partitions, coverings are with strong applicability and high universality. For example, a course consists of a number of students, and all courses form a covering of all students. Since a student can choose several courses, the covering is not necessarily a partition of all students. All students compose the set of research objects, namely, universe of discourse.

Definition 1 (covering). Let be a universe of discourse and a family of subsets of . If none of subsets in is empty and , then is called a covering of .

Each subset in a covering is called a covering block also called a basic concept in knowledge discovery, and each subset of a universe is called a concept. An important idea of covering-based rough sets is to approximate a concept using some basic ones. This idea is implemented by a pair of lower and upper approximations.

Definition 2 (approximations [3]). Let be a covering of . For all , are called the lower and upper approximations of (with respect to ), respectively.

The lower approximation provides a certainty, and the upper one offers a probability. If we suppose that the universe is all students selecting courses and the covering is the set of all courses, then for any subset of students, its lower approximation is those courses which all students select in the subset, and its upper approximation is those courses which some students select in the subset.

Some covering blocks may be redundant; in other words, removing them has little effect on the approximation accuracy. For example, if a course is a required one and all students must select it, then removing it does not affect the lower approximation. The concept of the reducible element was proposed to describe those redundant covering blocks.

Definition 3 (reducible element [35]). Let be a covering of and . is called reducible in if can be expressed as a union of some elements in ; otherwise, is irreducible. All irreducible elements of are called the reduct of , denoted as .

The reducible element well reveals the relationship between coverings and their lower approximations. In fact, two coverings generate the same lower approximation if and only if their reducts are the same.

2.2. Graphs

Graphs are discrete structures to model the correlation between data. Theoretically, a graph is an ordered pair consisting of vertices and edges that connect these vertices.

Definition 4 (graph [36]). A graph consists of a nonempty set of vertices and a set of edges . Each edge has either one or two vertices associated with it, called its endpoints. Generally, we write or for an edge with endpoints and .

The simple graph is a main research objective of graph theory, and many practical problems can be represented with a simple graph.

Definition 5 (simple graph [36]). A simple graph is a graph without loops or multiple edges, where a loop is an edge whose endpoints are equal and multiple edges are edges having the same pair of endpoints.

Graphs are visual and efficient tools to reveal the interrelation between data. Different graphs may have a certain internal relation. For this reason, isomorphism is introduced to express the relationship between graphs.

Definition 6 (isomorphism [36]). An isomorphism from one simple graph , to another one , is a bijection such that if and only if . We say that is isomorphic to , denoted as , if there exists an isomorphism from to .

In a graph that represents a road network (with straight roads and no isolated vertices), we can interpret the problem of finding the minimal vertex cover as the problem of placing the minimal number of policemen to guard the entire road network. It is noted that an isolated vertex of a graph is a vertex that is not an endpoint of any edge.

Definition 7 (vertex cover [36]). Let be a graph. A vertex cover of is a set that contains at least one endpoint of any edge. The minimal size of vertex covers of is denoted by , simply by .

Definition 8 (independent set [36]). Let be a graph. An independent set of is a set , and any two vertices of are not adjacent. The maximal size of independent sets of is denoted by , simply by .

Definition 9 (edge cover [36]). Let be a graph. is called an edge cover of if for all , there exists such that is its endpoint. The minimal size of edge covers is denoted by , simply by .

Definition 10 (matching [36]). Let be a graph. A matching in is a set of pairwise no-adjacent edges. A maximal matching is a matching of a graph with the property that if any edge not in is added to , it is no longer a matching. The maximal size of matchings of is denoted by , simply by . A perfect matching is a matching which matches all vertices of the graph.

3. The Upper Approximation Number of Covering-Based Rough Sets

Covering-based rough sets are studied qualitatively, and they are short of quantitative approaches. The following definition presents a measure to conduct the quantitative analysis. This measure is also a bridge between covering-based rough sets and graphs.

Definition 11 (upper approximation number [37, 38]). Let be a covering of . One can define, for all , is called the upper approximation number of and , the upper approximation number function with respect to . When there is no confusion, we omit the subscript .

The above definition presents the upper approximation number with respect to a covering. Note that this concept can be defined similarly on any subcovering or family of subsets of a universe.

Example 12. Let , , , , and . If and , then since , , and . Similarly, .

The following proposition shows some properties of the upper approximation number and its connections with covering-based rough sets.

Proposition 13. Let be a covering of and the upper approximation number function. Then the following properties hold for all ,(1), ,(2)if , then ,(3)for all , ,(4).

When a covering is included in another one, their corresponding upper approximation numbers present a similar characteristic.

Proposition 14. Let and be two coverings of . If , then for all .

A partition can be equivalently characterized by the upper approximation number. A covering is a partition if and only if the upper approximation number of the set with only one element is equal to one.

Proposition 15. Let be a covering of  . Then is a partition of if and only if for all .

The upper approximation number plays an important role in both conducting quantitative analyses on covering-based rough sets and building connections between them and graphs.

4. Graphs Represented with Coverings

In this section, we convert a graph to a covering in order to construct a platform for solving graph problems using covering-based rough sets. The following definition points out an approach to representing a simple graph with a family of subsets of vertices.

Definition 16. Let be a simple graph. One can define a family of subsets of vertices of as follows: for all ,

According to Definition 16, a graph can be described with a family of subsets of its vertices. We say the family of subsets of vertices is induced by the graph. The following proposition presents the characteristics of the family of subsets of vertices.

Proposition 17. Let be a simple graph. Then is a covering of   if and only if has no isolated vertices.

Proof. (): If is a covering of , that is, , then for all , there exists such that ; that is, . Hence, has no isolated vertices.
(): If has no isolated vertices, then for all , there exists such that ; that is, . Thus, . Hence, ; in other words, is a covering of .

According to Proposition 17, a simple graph without isolated vertices can be characterized by a covering of its vertices. Therefore, the family of the vertices induced by a graph is denoted by . In fact, any edge subset of a graph can also be represented with a covering or a subcovering. For example, if is an edge subset of a graph, then we can define as follows:

In the rest of this paper, a graph is a simple one without isolated vertices unless otherwise stated. The following example illustrates the graph and its covering.

Example 18. Let be the graph as shown in Figure 1. Then ,,,,,,,,,,.

The following proposition explores the relationship between two graphs inducing the same covering. In fact, two different graphs induce the same covering if and only if they are isomorphic. Isomorphic graphs can be regarded as the same in graph theory. In other words, in an isomorphic sense, a one-to-one correspondence between coverings whose elements have only two objects and graphs is established.

Proposition 19. Let , be two graphs and , the coverings induced by , . Then if and only if .

5. Graph Problems by the Upper Approximation Number

Independent sets, vertex covers, matchings, and edge covers of a graph are important concepts, and they stem abstractly from some practical problems. For instance, the maximal matching is from the job assignment problem. But some of them such as finding a minimal vertex cover are typical examples of NP-hard optimization problems. Therefore, there is much necessity to equivalently characterize these concepts. The following proposition presents a sufficient and necessary condition for the vertex cover through the upper approximation number.

Proposition 20. Let be a graph and the covering induced by and . Then is a vertex cover of if and only if .

Proof. (): For all , according to Definition 16, . Since is a vertex cover of , or which imply . Conversely, is straightforward. To sum up, it proves that .
(): Since , for all , or . According to Definition 16, for all , or . Therefore, is a vertex cover of .

In fact, in Proposition 20, can be replaced with , since for any graph, where denotes the number of edges. Therefore, based on the relationship between covering-based rough sets and graphs, a vertex cover of a graph can be described equivalently through the upper approximation number. Naturally, the minimal size of independent sets can also be represented by covering-based rough sets.

Proposition 21. Let be a graph and the minimal size of vertex covers of . Then .

Proof. According to Definition 7 and Proposition 20, it is straightforward.

The above proposition shows that finding a minimal vertex cover of a graph is equivalently transformed into finding a minimal subset whose upper approximation number is equal to the cardinality.

Example 22. Let be the graph as shown in Figure 1. Suppose that , then . For all and , . Then the minimal vertex cover is , and a maximal vertex cover is presented in Figure 2.

The following proposition indicates that an independent set of a graph can also be formulated equivalently with the lower approximation operator. In fact, a subset of vertices of a graph is an independent set if and only if its lower approximation is empty.

Proposition 23. Let be a graph and . Then is an independent set of if and only if where .

Proof. (): Since is an independent set of , any two vertices of are not adjacent which implies that , for all . Hence, .
(): Since , then , which implies that no two elements of are adjacent. Hence, is an independent set of .

Proposition 24. Let be a graph and the maximal size of independent sets. Then , and .

Proof. According to Definition 8 and Proposition 23, it is straightforward.

The above proposition indicates that finding a maximal independent set is converted to finding a maximal subset keeping its lower approximation empty. The following example illustrates independent sets of a graph and its connection with the lower approximation.

Example 25. Let be the graph as shown in Figure 1. Suppose that , then . For all and , . Then the maximal size of independent sets is , and a maximal independent set is presented in Figure 3.

A matching of a graph can also be represented with the upper approximation number. In fact, an edge subset of the edges of a graph is a matching if and only if the upper approximation number of these sets having only one element is not less than one.

Proposition 26. Let be a graph and . Then is a matching of if and only if for all .

Furthermore, the perfect matching of a graph is concisely characterized by the upper approximation number.

Proposition 27. Let be a graph and . Then is a perfect matching of if and only if for all .

Proposition 28. Let be a graph and the maximal size of matchings of . Then and for all .

Example 29. Let be the graph as shown in Figure 1. Suppose that , then . Therefore, is a maximal matching of since for all . In fact, it is also a perfect matching. The matching is presented in Figure 4.

Proposition 30. Let be a graph and . Then is an edge cover of if and only if for all .

Proof. (): If is an edge cover of , then for all , there exists such that , that is, . Hence, .
(): If for all , , then there exists such that ; that is, . Thus, is an edge cover of .

Proposition 31. Let be a graph and the minimal size of edge covers of . Then and for all .

6. Graph Problems by General Reduct

This section presents another view to represent graph problems with covering-based rough sets. First of all, we define the generally reducible element of a covering, which is an extension of the reducible element in the literature [35].

6.1. General Reduct

Pawlak defined category reducts for knowledge reduction and rule extraction [39]. Zhu and Wang [35] proposed the reducible element to remove the redundancy and keep the essence. Some forms of reducible elements are defined and applied to rule learning [6] and other fields [12, 14]. The following definition proposes generally reducible elements to characterize graph concepts including edge covers and matchings.

Definition 32 (generally reducible element). Let be a covering of and . is called a generally reducible element in if is also a covering of ; otherwise, is called a generally irreducible element.

Definition 33 (general reduct). Let be a covering of and . is called a general reduct of , if and for all , is a generally irreducible element of .

Example 34. Let and , where , , , and . Then , and are generally reducible elements of ; however, and are irreducible elements of . And and are general reducts of .

The following proposition shows that the generally reducible element is an extension of the reducible element. In other words, if a covering block is reducible, then it is generally reducible.

Proposition 35. Let be a covering of   and . If is a reducible element of , then is a generally reducible element of .

Remark 36. The generally reducible element is an extension of the reducible element; however, the general reduct is not an extension of the reduct. The following counterexample illustrates this argument.
Let and where , , and . Therefore, Reduct; however, Reduct is not a general reduct of . In fact, the general reducts of are , , and .

6.2. Equivalent Characterization of Graph Problems by General Reducts

The following proposition explores the relationship between general reducts and edge covers of a graph. In fact, an edge subset of a graph is an edge cover if and only if it contains at least one general reduct.

Proposition 37. Let be a graph and . is an edge cover of if and only if there exists a general reduct of such that .

Proof. (): If is an edge cover, then for all , there exists , such that is its endpoint. This proves for all that there exists such that . Thus, . Then, . Hence, there exists , such that is a general reduct of . So is also a general reduct of . Therefore, there exists a general reduct of of such that .
(): If there exists a general reduct of such that , then is a covering of . Hence, for all , there exists , such that ; that is, . Hence, is an endpoint of . Therefore, is an edge cover of .

According to Proposition 37, edge covers of a graph can be characterized by general reducts. In other words, finding a minimal edge cover of a graph is transformed equivalently into finding a minimal reduct of a covering.

Proposition 38. Let be a graph. Then is a general reduct of .

Proof. According to Proposition 37, it is straightforward.

Matchings of a graph can also be described using the generally reducible element.

Proposition 39. Let be a graph and . If is a matching of , then for all , is a generally reducible element of .

Proposition 39 shows a necessary condition for a matching of a graph by the generally reducible element. The following proposition further presents the characteristic of matchings using covering-based rough sets. In fact, an edge subset is a matching if and only if there exists an order such that those elements in the set can be reducible one by one according to the order.

Proposition 40. Let be a graph, , and . Then is a matching of if and only if there is a permutation of elements of such that is a generally reducible element of for .

7. Conclusions and Further Works

In this paper, we presented equivalent characterizations for some important problems in graph theory from the viewpoint of covering-based rough sets. These problems included vertex covers, independent sets, edge covers, and matchings of a graph, where finding a minimal vertex cover was NP-hard. The equivalent characterizations indicated covering-based rough sets approaches to these problems. Moreover, graph concepts such as vertex connectivity and graph approaches such as shortest path algorithms were available to study covering-based rough sets. In future works, we will apply these interesting theoretical results not only to algorithm design for some graph problems but also to unsupervised learning, especially bipartite graph clustering and spectral clustering.

Acknowledgments

This work is supported in part by the National Natural Science Foundation of China under Grant no. 61170128, the Natural Science Foundation of Fujian Province, China, under Grants nos. 2011J01374 and 2012J01294, and the Science and Technology Key Project of Fujian Province, China, under Grant no. 2012H0043.