Abstract

Mathematical models, such as sets of equations, are used in engineering to represent and analyze the behaviour of physical systems. The conventional notations in formulating engineering models do not clearly provide all the details required in order to fully understand the equations, and, thus, artifacts such as ontologies, which are the building blocks of knowledge representation models, are used to fulfil this gap. Since ontologies are the outcome of an intersubjective agreement among a group of individuals about the same fragment of the objective world, their development and use are questions in debate with regard to their competencies and limitations to univocally conceptualize a domain of interest. This is related to the following question: “What is the criterion for delimiting the specification of the main identifiable entities in order to consistently build the conceptual framework of the domain in question?” This query motivates us to view the Yoneda philosophy as a fundamental concern of understanding the conceptualization phase of each ontology engineering methodology. In this way, we exploit the link between the notion of formal concepts of formal concept analysis and a concluding remark resulting from the Yoneda embedding lemma of category theory in order to establish a formal process.

1. Introduction

In the computer science context, the terms ontology and concept are defined in many various ways, and sometimes their use is a confused mixture of both terms. On the one hand, an ontology is defined as a common conceptualization of a general idea, or a domain of interest [1, 2], while, on the other hand, a concept is defined as a general idea, which is created by removing the uncommon characteristics from several particular ideas [3]. Thus, it is obvious that ontologies use concepts to convey their semantics and concepts are definitional structures that are explicitly encoded within ontologies.

In practice, an ontology consists of classes that are concepts of a particular domain, relations between classes, a hierarchy of classes, properties of classes that describe their attributes, instances of classes with specific values, and axioms that specify additional constraints over classes. So, the most crucial task of building an ontology is to consistently express its classes, that is, the concepts of a particular domain. An indispensable condition for formally expressing a concept that should carry a meaning, besides its name that does not carry any meaning, is to determine all its stakeholders, that is, other terms interrelated with the concept in question, properties attached to it, and specific values, directly or indirectly, affecting its noema, since, in natural language, a concept can be described by the objects belonging to it or by its determining features [4].

In order to make the ontological status of concepts apparent, independently of the language used to express them and of the techniques used for embedding them within ontologies, we have exploited the link between the notion of formal concepts of formal concept analysis (FCA) and a concluding remark resulting from the Yoneda embedding lemma of category theory. This enables the formalization of concepts by considering their semantics, since the notion of a concept in computer science is very subtle and does not rely on mathematical constructors. This step intends to eliminate any misunderstanding that occurs, due to ontological semantic heterogeneity, arising from synonymy (i.e., ontological concepts for a domain described with different terminologies) and polysemy (i.e., different meanings assigned to the same word in different contexts).

The basics and the role of formal concept analysis in defining ontology concepts are discussed in Sections 2 and 4, respectively, while the basics and a discussion of related work on category theory are presented in Sections 3 and 5, respectively. In Section 6, the main point of this work is illustrated, by means of a simple example. In Section 7, other possible meeting points of FCA and category theory, aiming at direct research into the critical path of formalizing tasks concerning ontology engineering and facilitating progress of the field, are discussed. The conclusion is drawn in the last section.

2. Formal Concept Analysis Background: Formal Context, Formal Concept, and Concept Lattice

This section introduces basic terms of formal concept analysis, among which are the fundamental notions of a formal context, formal concept, and concept lattice, as well as their basic properties.

Formal concept analysis (FCA), introduced by Wille [5], is a method for analyzing data which describe relationships between a set of objects and a set of attributes. From the input data, FCA produces a concept lattice, which is a collection of formal concepts in the data. Formal concepts are clusters of data drawn together by having common attributes [6] representing natural human-like concepts and are hierarchically ordered by a subconcept-superconcept relation. FCA also produces attribute implications which describe particular dependencies valid in the data. It integrates the discovery and reasoning with concepts in data, the discovery and reasoning with dependencies in data and the visualization of data, and concepts and dependencies with folding and unfolding capabilities, which are referred to as the reconstruction of a diagram of concepts and their dependencies, in order to be more simplified, but as informative as the original one. For more details see [7, 8].

In FCA, a cross-table with logical attributes is represented by a triplet (a so-called context), where the elements of the set are called objects and correspond to the rows of the table, the elements of the set are called attributes and correspond to the columns of the table, and is a binary (incidence) relation between and . , or indicates that object has attribute or that holds for , while indicates that object does not have attribute . An example of such a table is shown in Table 1.

A partially ordered collection of particular clusters of objects and attributes is obtained out of a given context cross-table, and it is called a concept lattice. The clusters of a concept lattice, called formal concepts, are pairs , where is a set of objects and is a set of attributes, such that is the set of all objects which have all attributes from and is the set of all attributes which are common to all objects from . For instance, is an example of a formal concept for the table of Table 1. Under suitable permutations of the rows and the columns of the cross-table, a formal concept is represented by a maximal rectangle full of crosses.

Moreover, formulas describing dependencies which are true in the table are obtained, which are called attribute implications. An attribute implication is an expression , with and being sets of attributes. is true in the context if each object having all attributes from has all attributes from as well. For instance, is true for the context of Table 1, while is not, being a counterexample.

In order to produce the formal concepts of a formal context , the so-called concept-forming operators are used, defined between the powersets of and : where is just the set of all attributes shared by all objects of and where is just the set of all objects sharing all attributes from .

A formal concept now in is just a pair of and , such that and . is called the extent and the intent of , respectively. The formal concepts are naturally ordered by using a subconcept-superconcept relation, based on inclusion relation on objects and attributes; that is, for formal concepts and of the context , if and only if (if and only if ).

By using basic mathematical structures behind FCA, it can be proven that the pair of concept-forming operators induced by forms a Galois connection between and , and the formal concepts are just the fixed points of this Galois connection.

More precisely, given a pair of sets and , a pair of operators, and , is a Galois connection if for , it satisfies the following:

For a Galois connection between the sets and , the fixpoint set of is defined as the set

Also, for a Galois connection between the sets and , we have

Now, given a set , a mapping , such that, for each , is called a closure operator on and its fixpoint set is given by

In addition, for a Galois connection between the sets and , the composition is the closure operator on and is the closure operator on .

Finally, since the pair of concept-forming operators induced by forms a Galois connection between and , the compositions and of the concept-forming operators are closure operators on and , respectively.

Another fundamental notion in FCA is that of the set of all formal concepts of , denoted by the so-called concept lattice, where

Then, the fixpoints of the composition operator give the set of all extents of the formal concepts of that is, and the fixpoints of the composition operator give the set of all intents of the formal concepts of ; that is,

The extents and intents are just the images under the concept-forming operators: Hence,

Now, a partial order ≤ can be defined on , since for , if and only if . Thus, one puts , meaning that is more specific than or, equivalently, that is more general than . Then is a complete lattice, which means that, for any collection of formal concepts , contains both their direct generalization (the supremum of , noted as ) and their direct specialization (the infimum of , noted as ). The main theorem of concept lattices states that the supremum and infimum of a collection of formal concepts ,  , are given by

3. Category Theory Background: Categories, Functors, Natural Transformations, and Yoneda Embedding Lemma

In this section, we briefly refer to the basic notions of category theory, such as categories, functors, and natural transformations that guide to the comprehension of the Yoneda embedding lemma.

Category theory [912], which can, in general, be considered as a mathematical theory of structures and of systems of structures, occupies a central position in contemporary mathematics and theoretical computer science, since (a) it focuses on relationships (categorical morphisms, functors, and natural transformations) and not on entities (categorical objects and categories), (b) it allows the coexistence of heterogeneous entities, since it provides the ability to define several categories, according to the kind of entities to be described, which can be related by the definition of special morphisms (categorical functors), (c) it offers a set of categorical constructors for creating new categories, by using predefined ones, (d) it provides a means for the combination of categorical objects (colimits can be used to compose them and limits to decompose them) and for the combination of categorical functors (natural transformations), and (e) it provides a multilevel study of its categorical notions, by defining three interrelated levels (the level of categories, of functors, and of natural transformations).

A category is a mathematical structure consisting of entities of some kind, together with the relationships between them that express the structure. Formally, a category consists of objects (entities) and morphisms (relationships), together with a law of composition for the morphisms, and a unique identity morphism for each object. More rigorously speaking, a category is anything comprising the following data.(i)A class of objects is denoted by .(ii)A class of morphisms between objects is denoted by . In order to denote a morphism from object to object , we write , where is the domain of ) and is the codomain of (). In general, there may exist more than one morphism from to . The collection of all morphisms from to is denoted by .(iii)A composition operator is denoted by , assigning to each pair of composable morphisms and a composite arrow , . The composition of morphisms is associative, that is, for any morphisms , , and , as depicted in Figure 1. (iv)For each object in , a unique morphism , which acts as the unit of the composition; that is, for any morphism , , as depicted in Figure 2.

A category is called small when the objects and the morphisms are sets and not proper classes. A less strict concept is that of a locally small category, when for all objects and in , the collection of morphisms between them, , is a set, called a Hom-set (in this case, we may still have “too many” pairs of objects).

An important category is , the category having sets as objects, total functions as morphisms, and identity functions as identity morphisms. The composition of total functions, , is the total function , mapping .

For each category , the dual category is defined as the category having the same objects as those of and as morphisms the opposites of the morphisms in ; that is, if is a morphism in , then is a morphism in .

Another essential feature of category theory is that of an isomorphism, which is mainly used in order to compare objects. An isomorphism is a morphism that has an inverse, that is, a morphism , such that and . If is an isomorphism, then its inverse is unique. By using this definition, two objects and are isomorphic () if there is a pair of inverse morphisms and between them. Two isomorphic objects are said to be identical up to an isomorphism if every object satisfying a property of an object is isomorphic to .

There are two special objects that are defined in a category: the initial and the terminal objects. More formally, an object of a category is called an initial object, denoted by in Figure 3, if for every object in , there exists exactly a unique morphism .

The symbol is used to characterize unique morphisms. On the other hand, an object of a category is called a terminal object, denoted by in Figure 4, if for every object in , there exists exactly a unique morphism .

In this mathematical theory, any part of a category can be pictured as a directed diagram, which is a set of objects and morphisms. A significant operation in category theory is the pushout, which is defined over a diagram. A pushout of a pair of morphisms, and , is an object , together with a pair of morphisms and , such that , and with the universal property stating that, whenever and are such that , there exists a unique morphism , such that and . In Figure 5, the pushout is indicated by drawing a square corner next to it.

The pushout is the categorical dual of pullback, which is depicted in Figure 6, which is obtained by reversing arrows in the graph of pushout.

These categorical constructions are characterized as universal constructions, describing a class of objects and the accompanying morphisms that share a common property.

Another important notion of category theory is functors, which are structure-preserving mappings between categories. Given two categories and , a functor maps(i)each object to an object ,(ii)each morphism to a morphism ,(iii)identity morphisms of to identity morphisms of ; that is, ,(iv)the composition of morphisms and in to the composition of their mappings in ; that is, with and composable in , .

For a given category , the identity functor maps each object and each arrow in to itself. Moreover, functors can be composed. Given two functors and , their composition is a functor mapping an object to the object and mapping a morphism to the morphism .

Let us now consider two locally small categories and . For every pair of objects , a functor defines a function in . Then, the functor is(i)faithful if the function is injective,(ii)full if the function is surjective,(iii)fully faithful if the function is bijective.

A functor is an embedding if it is full, faithful, and injective on objects. A functor from to is called a covariant functor, while a functor is called a contravariant functor from to if it reverses arrow; that is, it is a covariant functor from the opposite category to .

In order to comprehend the Yoneda embedding, the more elaborate categorical notions of representable functors are needed. The representable functors map a locally small category to the category . They refer either to incoming morphisms to a fixed object in (generalized elements) or to outgoing morphisms from a fixed object in (generalized properties).

Given a category , the covariant representable functor , which is depicted in Figure 7, can be defined, for each fixed object , and it takes each object to the set of morphisms from to ( is an object of ) and each morphism to the function

This last function takes a morphism (an element of the set in ) to the morphism (an element of the set in ). The covariant representable functor refers to the outgoing morphisms from a fixed object in and to their compositions with other morphisms in .

Given a category , the contravariant representable functor , which is depicted in Figure 8, is defined, for each fixed object , taking each object to the set of morphisms from to ( is an object of ) and each morphism to the function which takes a morphism (an element of the set in ) to the morphism (an element of the set in ). The contravariant representable functor refers to the incoming morphisms to a fixed object in and their compositions with other morphisms in .

Moreover, given a category , the Hom-set bifunctor , which is depicted in Figure 9, is defined, for each pair of fixed objects and , and it takes each such pair of fixed objects and to the set of morphisms from to ( is an object of ) and each pair of morphisms and to the function

This last function takes a morphism (an element of the set in ) to the morphism (an element of the set in ). The contravariant representable functor refers to the incoming morphisms to a fixed object in and to the outgoing morphisms from a fixed point in .

Another important notion of category theory is that of natural transformations, which relate two functors. Given two functors and from to , a natural transformation , which is depicted in Figure 10, is a map , assigning to each object a morphism , called the component of at , such that, for every morphism , it holds that . The components are morphisms in and can thus be composed.

Given two categories and , a functor category denoted by is defined, having as objects the functors from to and having as morphisms the natural transformations between them. In this functor category, is the set of natural transformations from functor to functor ( and are objects of ). The contravariant representable functor for is then denoted by and sends a functor to the set of natural transformations between and and a natural transformation to the function

The function sends a natural transformation to the natural transformation (which is a morphism in and an element of the set ).

An isomorphism can be defined in a functor category . Given two functors (objects of ), if a natural transformation (morphism in ) has an inverse natural transformation such that for every object in it is and , then is called a natural isomorphism, and and are said to be naturally isomorphic; that is, .

Given an object in a category , there is a functor from to the category of functors from to (called the category of presheaves) that sends to the functor . This functor is the Yoneda embedding. The Yoneda embedding also sends a morphism to the natural transformation between functors and , having a component at , which sends a morphism to the morphism .

The Yoneda lemma is an abstract result on functors of the type morphisms into a fixed object, which allows the embedding of any category into a category of functors defined on that category. It states that every functor is naturally isomorphic to the functor ; that is, for every , and this isomorphism is natural both in and . One instance of Yoneda’s lemma is the Yoneda embedding, which informally states that in order to determine an object of a category, it suffices to determine all its generated elements or all its generated properties.

4. The Interplay between FCA and Ontological Concepts

An ontology is the formal expression of relations between relevant notions in order to consistently describe a domain of interest, and in this perspective, it must clearly conceptualize these notions. At the conceptualization level, it is necessary to formally describe the relevant concepts, which are to be included in the ontology.

FCA, on the other hand, is a mathematical theory, which is based on the philosophical statement that a concept is constituted by two parts: its extension consisting of all objects belonging to the concept and its intension comprising all attributes shared by those objects [13]. In FCA, a formal concept is produced from a formal context such as of Section 2 by applying the concept-forming operators to its intension or to its extension; that is, a formal concept is a pair , where is a set of objects and is a set of attributes, such that is the set of all objects which have all attributes from and is the set of all attributes which are common to all objects from .

Based on the assumption that an isolated concept has no meaning, since meaning is a product of the relationships among concepts in a specific context, an extension of FCA has been designed, the so-called relational concept analysis (RCA) [14], in order to deal with multirelational data [15]. RCA provides a generalization of FCA in the case of many-valued contexts. A many-valued context is defined as a context consisting of three sets, namely, , whose elements are called formal objects, with many-valued attributes, and , whose elements are values, and a ternary relation between them, with the conditions and , with being read as “the attribute has the value for the object .” Thus, relational context schemas are useful for attributes that have not just only one but several values for an object.

Moreover, a triadic approach to FCA has been designed, the so-called triadic concept analysis (TCA) [16], as another FCA extension, which is based on a formalization of the triadic relation connecting objects, attributes, and conditions, where a concept is described by its extension, intention, and modus, respectively [17]. In this framework, instead of the dyadic relation between objects and attributes used in FCA, the triadic approach is based on a triadic relation saying that the object has the attribute under the condition . Similar to the dyadic form of FCA, a triadic concept is produced from a triadic context, which is defined as a quadruple , where , , and are sets of objects, attributes, and conditions, respectively, and is a ternary relation between them, by applying derivation operators to its intension, extension, or modus. Thus, triadic concepts can be considered, for example, as those formal concepts that have the same extent but differ according to their intent and modus.

The interplay between FCA and ontologies has been studied to a considerable extent [18]. It turns out that there are basically two ways concerning how FCA and ontological concepts can be combined. The most obvious way is to identify the ontology concepts with the formal concepts of FCA. However, in many applications, the ontology concepts correspond to the FCA attributes [19, 20]. Since the ontological status of concepts has an internal structure, within the framework of FCA, this is sometimes generated by tertiary relations within multivalued contexts [21] or by using distributional formal contexts [22].

The crucial point in specifically describing a concept within an ontology is to find out all the attributes that are shared by this concept. Thus, from an FCA-based point of view, for formally describing a concept, its intentional description is required. In the case where a concept is described by a set of attributes under specific conditions, the notion of triadic contexts is used for its representation [23]. Moreover, each concept within an ontology denotes a specific role emerging from adding specific values to its intentional structure, that is, its attributes. In this perspective, ontology concepts are considered as the relational structures that the RCA process provides [24, 25].

5. The Interplay between Category Theory and Ontological Concepts

Since ontology can be regarded as a categorization of that which exists, issues involving the ontological status of concepts and the notion of a category are important. According to Aristotle, a class or collection of things all of one type, where typicality is determined by a concept, is a category. Later on, a category, as a purely mathematical notion, was defined as having an internal structure given by relationships among members. Thus, the exact match between the terms category and concept has emerged because a concept can be defined as having internal structure, due to their explanatory capabilities [26].

Category theory has been established as an appropriate mathematical basis for the formalization and study of ontologies [27, 28], since it focuses on relationships (categorical morphisms) between entities (categorical objects) in such a way that an entity can be defined according to its interaction with other entities [29]. There have been various category theoretic approaches to ontology engineering, which involves tasks, such as development, mapping, and merging of ontologies. In particular, in the case of ontology mapping and merging, the category of ontologies is defined and the categorical constructions of pullback and pushout are used, respectively [3032].

Formally speaking, ontology is a theory over logic. But many ontologies are described in different languages, each one with its own logic. Thus, in order to integrate ontologies, many approaches using the theory of institutions [33] have emerged, such as in [34, 35] for mapping between different logics to deal with heterogeneous formalization. Moreover, the constructions of category theory have been combined with information flow theory [36], which describes how information about one thing can convey information about something else, for integrating distributed ontologies [37, 38]. Finally, in [39] a rigorous foundation for integrating heterogeneous information is provided, by unifying information flow theory, institutions, formal concept analysis, and category theory.

By using a category theoretic language, ontologies can be represented as categories with finite limits and finite coproducts, where classes (concepts) are represented by nodes and functional relationships by arrows of a directed graph [40], and if an ontology has a representation by a category, then one may apply homological and homotopical algebra constructions to study the ontology [41]. Another view of the category theoretic perspective of ontologies and their underlying classes (concepts) is given in [42], where an ontology is regarded as a mathematical category of theories and a concept is represented by the ideas of theories and models that are borrowed from FCA. Moreover, in [43] an ontology can be described by formal concepts and thus it can be viewed as a category.

6. From the Yoneda Embedding to Formal Concepts

Our main point is the fact that, from an ontological point of view, the formal determination of a concept is guided by the interconnection existing between a concluding remark of the Yoneda embedding lemma of category theory and the intents of the formal concepts of FCA. In order to illustrate this point, an example is used in this section, relying on the assumption that attributes can also be considered as concepts, according to the German standard DIN 2330, stating that attributes “are units of thought which are gained by abstraction, and hence they are also concepts” [19, 44].

Following this statement, we can represent binary relations between entities within an ontology, by using a suitable formal context. For this purpose, we consider the entities of the ontology, both as objects and as attributes of the context. A binary relation present in the context signifies that the respective entities are related. As an example, the ontology of Figure 11, where the concepts , , are hierarchically structured, is represented by the formal context of Figure 12, by using Lattice Miner [45], which is a formal concept analysis software tool for the construction of concept lattices.

In Figure 12, the concept lattice and the formal concepts associated with the formal context of Figure 11 are depicted. It is a representation of a formal context called cross-table. The context has six attributes, , . An “” in a particular location or cell in the cross-table indicates that an object has an attribute, so, for example, the object “” has the attribute “.” Conversely a blank location indicates that the object does not have that attribute.

The concept lattice of Figure 13 contains five concepts, the most general being . The bottom concept is because it is the concept of objects that have all attributes. The remaining concepts are marked with their intent and their extent.

The diagram shows that there is an ordering over the concepts, which is a specialization ordering, denoted by [46].

It is interesting to notice here that the formal concepts represent the incoming relations to an ontology entity; that is, the intents of each formal concept correspond to the incoming arrows to each respective ontological concept. For example, the formal concept represents the incoming relations (arrows) to ontological entity (concept) . Now, if we consider a categorical perspective of the ontology, where the ontology entities are the objects of a category and the relations between the entities are the morphisms (arrows), then the intent of a formal concept represents the incoming arrows to an entity (object). But, from a category theoretic point of view, this remark is equivalent to the remark concluded from the Yoneda embedding lemma, which states that an object is completely determined by the incoming arrows to this object.

This statement yields from the Yoneda lemma, which is interpreted as follows: an object of a category is determined by the network of relationships that the object has with all the other objects in [47]. Thus, a concept can be clearly specified only by identifying the set of all its attributes and a set of attributes that are common for a concept determines completely a concept. This reciprocal condition can guide the building of the ontological status of concepts completely and consistently, since it eliminates polysemy and synonymy.

In a more illustrative example, such as that of the ontology of Figure 14, following the same process and taking into account its category theoretical representation, where concepts are objects of a category and their attributes/relations are morphisms, we observe the alignment between the intent of the formal concept and the incoming arrows to the ontological entity “Europe,” that is, the Yoneda philosophy in unequivocally determining concepts in ontology engineering.

7. Equivalent Points of FCA and Category Theory

In this section, we refer to the possible meeting points of FCA and category theory aiming at direct research into the critical path of formalizing tasks concerning ontology engineering and facilitating progress of the field.

Due to the fact that we focus on the abstract notion of concepts, as well as their formal representation, and not on strict mathematical objects, it is incongruous to talk about equality between different points of mathematical theories, rather than equivalence, or isomorphism between them. Thus, the main question to be answered is not if two concepts are equivalent, but if they are isomorphic, that is, if they are interchangeable in the sense that they have the same relationships with all concepts within a context. In that sense, it will be interesting to investigate the correspondence between the basic notions and constructions of FCA and category theory, such as between (a) the supremum of any collection of formal concepts of a complete lattice and the initial object of a category, (b) the infimum of any collection of formal concepts of a complete lattice and the terminal object of a category, and (c) the application of the concept-forming operators to the intention or to the extension of a formal context and the fact that from an object of a category, one can define a specific functor that encodes the essence of these objects, as well as between other notions.

8. Conclusion

A formal view on ontologies or on ontological concepts can contribute a lot to the direction of achieving effective interoperability, since the coexistence of multiple heterogeneous ontologies within or among ontology-based applications is an objective reality. Thus, by carefully studying FCA and category theory, as the mathematical theories that provide frameworks for structuring, analyzing, and visualizing data to make them more understandable and for combining and maintaining heterogeneous collections of data to enable them to interoperate, respectively, we investigate some equivalent notions of both theories. This can lead, despite the abstract but quite accurate techniques of these theories, to create a unified framework for the ontological representation as well as their management.

Acknowledgments

The authors are indebted to the editor and the anonymous reviewers for their valuable suggestions and comments that greatly contributed to improving the paper.