Abstract
Aiming at enhancing the communication and information security between the next generation of Industrial Internet of Things (Nx-IIoT) sensor networks, it is critical to aggregate heterogeneous sensor data in the sensor ontologies by establishing semantic connections in diverse sensor ontologies. Sensor ontology matching technology is devoted to determining heterogeneous sensor concept pairs in two distinct sensor ontologies, which is an effective method of addressing the heterogeneity problem. The existing matching techniques neglect the relationships among different entity mapping, which makes them unable to make sure of the alignment’s high quality. To get rid of this shortcoming, in this work, a sensor ontology extraction method technology using Fuzzy Debate Mechanism (FDM) is proposed to aggregate the heterogeneous sensor data, which determines the final sensor concept correspondences by carrying out a debating process among different matchers. More than ever, a fuzzy similarity metric is presented to effectively measure two entities’ similarity values by membership function. It first uses the fuzzy membership function to model two entities’ similarity in vector space and then calculate their semantic distance with the cosine function. The testing cases from Bibliographic data which is furnished by the Ontology Alignment Evaluation Initiative (OAEI) and six sensor ontology matching tasks are used to evaluate the performance of our scheme in the experiment. The robustness and effectiveness of the proposed method are proved by comparing it with the advanced ontology matching techniques.
1. Introduction
In the research era of the Next generation of Industrial Internet of Things (Nx-IIoT), the network technology and intelligent computing has become a huge technical model for the government to establish a smart world [1, 2]. Security issues in the Internet of Things (IoT) have also sparked concern with researches rolling in. Particularly, Xiong et al. [3] proposed a LightPrivacy scheme to achieve the tradeoff between user’s personalization privacy protection and the availability of task data in mobile group awareness, whose computational efficiency was significantly improved. Later, they further presented an ATG framework, which was both effective and efficient, and suitable for IoT Mobile Edge Crowd Sensing (MECS) [4]. More recently, Lin et al. [5] proposed an Ant Colony Optimization (ACO) approach to protect information by the transaction deletion, which was able to reduce the side effects while keeping the overall computing cost low. In this fashion, a large number of physical objects embedded with sensors devices exchange information through heterogeneous networks in various applications such as the smart grid, electronic medical treatment, and smart cities [6–10]. To aggregate the information of systems efficiently, their entities should be able to interact with one another in meaningful ways without special effort by humans or machines. And it is worth mentioning that many diverse sensor data management application frameworks have been proposed for uniting and dealing sensors. At the same time, value-added information is provided by spatial data sources for public applications, including sensor networks such as Global Sensor Network (https://gsn.sourceforge.net/), Hourglass (https://www.eecs.harvard.edu/∼syrah/hourglass/), and IrisNet (https://www.intel-iris.net/). However, the heterogeneity of different sensor networks resulting in a lack of interoperability. Therefore, to build a secure Nx-IIoT, the way of aggregate heterogeneous sensor data in different sensor networks is prominent.
Specifically, there are two aspects of work to be done; one is to improve the expressive ability of sensor network models, and the other is to enhance the interaction between sensor networks to achieve data integration. To address the first aspect, an increasing number of sensor ontologies have appeared because of the preponderance of sensor ontology technique, which is able to model the corresponding networks integrally. And to address the other aspect, the ontology matching technique has been on the stage of history in recent years. Furthermore, one of the cutting-edge research institutions in this field is Ontology Alignment Evaluation Initiative (OAEI) (https://oaei.ontologymatching.org/). Recently, ontology alignment extraction technique has been used to strengthen the team. To advance the relevant work, we propose a mechanism for sensor ontology matching with the Fuzzy Debate Mechanism (FDM) based ontology alignment extraction technique, which aims to extract the correct sensor ontology matching pairs in different alignments generated by different basic matching measures. To be specific, we first express the similarity between two sensor entities in the three-dimensional vector space through a fuzzy membership function and then evaluate the similarities in multiple dimensions and the cosine theorem is introduced to evaluate the distance of similarity vectors and the golden one.
The following sections are arranged as follows. Section 2 is an overview of the related work. Section 3 presents a preliminary analysis of the relevant concepts. Details of FDM are provided by Section 4. Section 5 externalizes experiments’ results and makes the corresponding analysis, and Section 6 concludes the work.
2. Related Work
A growing number of sensor ontologies have appeared due to the sensor ontology possesses powerful sensor network model expression ability, i.e., SensorOntology 2009 ontology, SSN ontology and IoT-Lite ontology, and so on [11–13]. And to enhance the interaction between sensor networks to achieve data integration, the sensor ontology matching technique has been brought out these years [14].
In the research upsurge, there are two technical routes: ontology meta-matching (OMM) techniques and ontology entity matching techniques. The ontology entity matching techniques try to determine the entity correspondence set between two ontologies directly, while the OMM techniques try to solve the problem of aggregate different similarity measures with appropriate weights [15]. There are plenty of popular technical approaches in computing intelligence (CI) to solve OMM problems, e.g., machine learning (ML), evolutionary computing (EC), and swarm intelligence (SI). For example, many ML technologies [16–21] have been proposed to automatically determine ontology alignment, and experiments have shown that ML greatly improves matching efficiency, and the genetics for ontology alignment (GOAL), which was designed to optimize aggregate weight sets for different matchers [22–25]. To overcome the disadvantage of excessive reliance on reference alignment, Xue et al. put forward the partial reference alignment (PRA) and the unanimous improvement rate (UIR) [26]. Furthermore, Xue et al. proposed a series solution using compact algorithm (CA) and sensor ontology meta-matching technique to aggregate weight sets for different matchers [27].
However, in the existing ontology matching methods based on CI, various ontology matchers are regarded as tools with the same effect and try to aggregate their outputs by determining the optimal weights [28]. The matching quality will decrease if ignoring the influence of different entity mappings on the matching results of different matching devices. Furthermore, adjusting weights with this method can be problematic; that means they may not be reusable in different ontology matching scenarios. Hence, the influence of entity mappings on matching result cannot be ignored, which is addressed by ontology alignment extraction technique [29]. Recently, the context extraction technique has been widely applied in semantic field [26, 30], in which OntoLT uses terminology extraction, ontology structure mapping, the statistical method, and the language model of definition to extract ontology concept [31]. Besides, Gaeta et al. implemented several statistical and data mining algorithms to identify and extract the concepts as well as their relationships in ontologies [32]. In addition, the rule of extraction process is typically described as a series of agreement reaching processes, such as argumentation frameworks [33]. In concrete terms, the argumentation framework proposed by Laera et al. relies on preferences between a formal argument operation pattern and a particular type of argument, taking into account ontology-based arguments and propositions specific to the matching task [34]. Dos-Santos and Euzenat used argumentation as a supporting or rejecting parameter and proposed a computational strategy to remove inconsistencies in the result alignment and allow consistency in the argumentation system [35].
It is worth mentioning that the similarity of ontology matching and the relation between entities and the similarity threshold can all be regarded as uncertain problems, in which fuzzy logic is highly adept [36]. In the proposal of [37], the ontology matches are expressed by the fuzzy set of reference concepts or instances that makes the new ontology be directly compared with the original one. Todorov et al. proposed a fuzzy ontology alignment using background knowledge [38]. UFOM adopts fuzzy set theory as the general framework of fuzzy ontology matching, which represents many types of correspondences and describe the uncertainty in the process of correspondence discovery [39]. And Cross discusses how to extend the process of ontology concept matching by using similarity measure and integration of fuzzy sets [40].
3. Problem Definition
3.1. Sensor Ontologies
An ontology is composed of concept set, attribute set, and instance set, and the ontology refers to three tuples , among which , refer to concepts set, properties of datatype, and properties of object, respectively, called ontology entities. The semantic sensor network (SSN) (https://purl.oclc.org/NET/ssnx/ssn#) ontology can be regarded as the authoritative ancestor of sensor ontology [41]. It is an OWL 2 ontology put forward by the W3C Semantic Sensor Network Incubator group (SSN-XG) (https://www.w3.org/2005/Incubator/ssn), which models sensors and observation data and represents sensors in accordance with the function, measurement process, observation data, and so on. SensorOntology 2009 (https://www.w3.org/2005/Incubator/ssn/wiki/SensorOntology2009) ontology developed by Michael Compton, i.e., from CSIRO (Australia) has come into use as the source of the SSN Ontology. And OSSN (https://www.w3.org/ns/ssn) is an ontology established by SSN-XG in the year from 2009 to 2011 [42]. Furthermore, the resources, entities, and services in the Internet of Things (IoT) are summarized by the IoT-Lite (https://www.w3.org/Submission/2015/SUBM-iot-lite-20151126/) ontology outlines, which is a lightweight ontology and a case of an SSN ontology and the latest version was submitted in 2015 [43]. The Sensors, Observations, Samples, and Actuators (SOSA) (https://www.w3.org/ns/sosa) ontology is designed for a broad target audience and applications that have access to the ontology and was released in 2017. In addition, SOSA acts as a minimal interoperability fallback layer; that is, it defines those public classes and attributes whose data can be securely exchanged between the SSN, its modules, and all information used by SOSA.
3.2. Ontology Matching and Sensor Ontology Alignment Extraction
Ontology matching is the process of determining the entity correspondence between source ontology and target ontologies to bridge the semantic gap between them. As shown in Figure 1, the input of ontology matching process is a pair of ontologies to-be-matched, and the output is the final alignment. The research on ontology matching mostly involves the calculation and refinement of similarity measure, but the research on extracting the final alignment from similarity measure matrix, which is called ontology alignment extraction technique, is less.

To be specific, sensor ontology alignment extraction technique works by extracting entity correspondences from different matching suggestions generated from different similarity measures for the same sensor ontology matching task to form the final alignment [28]. The framework of ontology alignment extraction is shown in Figure 1, in which the systems to finish the matching work are regarded as ontology matchers. The set of entity correspondences determined by an ontology matcher is called an ontology alignment. In addition, a corresponding set is the alignment between two sensor ontologies, where the entity correspondence is referred to a 4-tuple , and are, respectively, two ontologies’ entities, and is their confidence value, while acts as the equivalence relation.
3.3. Fuzzy Similarity Measure
Since some of the wireless sensor network (WSN) domain concepts have not yet been incorporated into a common dictionary, there is no lexical tool to define the linguistic relationships between all concepts for mapping purposes. To conquer the difficulty, our proposal uses a variety of metrics as tools in similarity calculations. Each tool gives different matching suggestions; the matching results between entities become uncertain information. In this case, a unified framework for representing many different modes of inconclusive information is provided by the use of a fuzzy measure to depict inconclusive information [44]. For this reason, we decide to use the fuzzification process to combine them to get more accurate alignment. In other words, the work of fuzzy similarity measurement adopted here is supplemented by two parts; one is the basic similarity measurements, also known as the entity matching measure (EMM), and the other is the membership function of fuzzification.
An EMM is always described as the function to output similar values with information from two entities as input in the interval [0, 1], which always plays the role of a basic ontology matcher. EMM can be grouped into three types: first is the string-based type, second is the linguistics-based type, and the third one is called taxonomy-based EMM.
A string-based EMM outputs the edit distance between entities by considering their IDs, tags, comments, and so on. In this work, two well-known EMMs were used to improve the quality of the matching results, namely, the Levenshtein similarity [45] and the Jaro similarity metric [46]. Given two strings, and , Levenshtein-similarity is defined in the following:where and are, respectively, the length of se1and ; is the number of required operations to transform to ; given the strings and , define as a character common to in ; that is, they appear in the same order in and . Similarly define in . Now, assume that and are transposed at position when their th bits are not the same and the value of is 50% of the transpositions of and in number. According to the assumption above, the Jaro similarity measure of and is defined in the following:
Linguistics-based EMMs often use external corpus or dictionaries (e.g., WordNet [47]) to calculate similarity values through entity tags. Because of the effectiveness of the WordNet, we used WordNet-based distances. Besides, based on the distance WordNet-based on the WordNet electronic vocabulary, the equation is defined as follows:where and are a word derived from two entities and denotes the number of words ’s meanings.
The similarity values obtained by the taxonomy-based EMMs are based on the consideration of the entity structure information as well as the superentity and subentity information. To be specific, in our proposal, the taxonomy-based EMM makes use of the structure-based distance calculated by noted algorithm, which adopts a versatile graph matching method called similarity flooding (SF) [48], where an iterative fixpoint computation below is applied to produce congruent relationship between the two ontologies’ elements:
In this formula, is the normalized process, is the value of the last iteration that changes in each iteration, and is a function of increasing the similarity of pairs of elements based on the similarity of their adjacent elements.
The flowchart of the fuzzy similarity measure application process is depicted in Figure 2. The work in pre-processing generally includes conversion ontology format and ontology analysis. And the similarity matrixes are generated from the four basically similar measures adopted in this article. Usually, a matcher determines whether two entities are correct matching pairs by comparing the threshold value with the similarity value. But it is difficult to find an accurate threshold to make the matching result completely correct. Therefore, we introduce the membership function in fuzzy theory and consider the similarity value from the “low,” “medium,” and “high” dimensions. The fuzzy process changes the elements in the similarity matrixes and changes them into vectors through membership function before participating in the Debate Mechanism. In this work, the membership function of fuzzy process is defined as three subfunctions as follows:where is an element of a similarity matrix which stands for the similarity . According to previous practical experience, we believe that matching pairs with similarity less than 0.7 are mostly mismatched, and 0.5 is a medium similarity in similarity interval [0, 1] [28]. Therefore, is used to measure the degree of low similarity. describes the degree of medium size and then uses the direct proportionality function to describe the degree of high similarity as the . After that, we take the three function values as the coordinates of the 3D vector and input them into the Debate Mechanism.

4. Debate Mechanism
In this work, ontology alignment extraction process is carried out by the Debating Mechanism, in which the debating rules is utilised to extract the target information. The Debating Mechanism contains a classification module of correspondences, where a fuzzy measure is built to express the similarities of correspondences, and an argumentation framework, which is used to negotiate different matching suggestions between matchers to reach agreement. In addition, a fuzzy measure is built to express the similarities of correspondences as is shown in Figure 3 that depicts the framework of Debate Mechanism.

The classification module and argumentation framework are described in the following paragraphs.
In the classification module, assume that a matching task is working on two ontologies, and , by basic ontology matchers. Extend an entity correspondence in an ontology matcher as an argument , which is defined as follows:where , (), and (), respectively, express a correspondence, the artificially preset matcher number, and the measure factor of similarity value.
Assume that reference vector ; change the similarity value into similarity fuzzy vector in vector space. Then, we describe as follows:where is a similarity fuzzy vector and is set as the similarity’s threshold. Especially, while , the matcher rejects ; otherwise, it accepts it. Suppose that is allocated to one of the five groups , that are, respectively, defined as follows: is the number of matchers, is the number of matchers that accepts , is the number of matchers that rejects , . Next, classify into groups from to in category column according to the above situation. For detailed classification, see Table 1.
Since is regarded as a correct correspondence which is accepted by all the matchers, while is rejected oppositely, the correspondences in groups and are straightway judged as right correspondences or the false ones in the process of extraction without participating in the following process. Besides, correspondences of , , and groups are in list of the argumentation process.
In the process of argumentation, two arguments are given as and. Four relationships are defined between and , which are , , , and . To be specific, is marked as , and , support, and disprove are expressed as , , and . The details are listed in the following descriptions:(i)When , is united with , which is denoted as .(ii)When , attacks , which is denoted as.(iii)When or , , , , or when , , , supports , which is represented by .(iv)When , , , , or when , , , , disproves , that is depicted as .
The four relationships between arguments are depicted in Figure 4. There are three matchers, i.e., , , and including their arguments , , and . and happened between arguments from different matchers, but and from the same one.

The arguments set is defined as a 7-tuple: , where , , , are the relationships mentioned above, is defined as the set of matchers that contains basic ontology matchers, argument is related to correspondence , and an d acts as strength value of according to a matcher , which is defined as follows:
In this paper, is an element of corresponding arguments existing in every basic ontology matcher, and we need to calculate its judgment factor whose value is determined in argumentation process to determine whether it can be extracted into the final alignment. Consequently, an essential challenge is to improve the reliability of an entity mapping’s judgment element. To meet that challenge, as is defined in formula (9), each matcher’s correctness factor is adopted to evaluate the credibility of the matcher depending on how similar it is to the matching recommendations of other matchers. And the support strength and disprove strength in debating process are utilized to offer the evidence of right mappings. In addition, the can be obtained after full assessment of the factors mentioned above.
To be specific, the argumentation process is arranged as follows: Step 1. Apparently, is (or ) when belongs to (or ), and the similarity values of corresponding rows and columns of can be deleted from the similarity matrix. Step 2. Matcher ’s correctness factor is calculated in the following: Step 3. In every matcher, the debating process is brought forward in accordance with the relationships “” and “”:(1)In the group, most matchers support these correspondences. Therefore, the success of the supporting part in defeating the disproving part is calculated. The above situation is explained as follows. The support strength of matcher is defined as follows: where argument , argument , , and . When three matchers , , and support but disproves it, it is determined that the mapping can be established between and when , and , and is set to 1. Otherwise, is transformed into , performing step 4.(2)In a similar way, for the group of argument, it is necessary to calculate whether the attack of the opposing side against the supporting side is successful. Assuming that the matcher ’s disprove strength is defined as follows: where argument , argument , , and . In this situation, if the three matchers , , and disprove , but support, obviously, the mapping cannot be established between and when , and , and . Otherwise, is converted into group. Step 4. For the arguments in group, the number of matchers who took the opposite view was almost even, due to what they can be divided into two party called and . Therefore, the core challenge is to figure out which of the two parties defeat the other successfully. To end this, the power of each party is calculated, which is defined as follows: where are the members of the party. Assume that for , if , . Otherwise, . Step 5. Select the correspondences with , which are further used to decide the final alignment.
5. Experiment and Results
5.1. Alignment Evaluation Metric
The alignment can typically be assessed with two measures, often referred to as and [49], which are severally ruled in the following:where the alignment given is , and the reference alignment is . Particularly, when found all correct matching pairs,. And stands for that all the matching pairs found are correct. In order to combine the two metrics, the is further employed, which is regarded as a comprehensive measure of and [38]:
5.2. Experimental Testing Cases
In this experiment, we take advantage of the testing cases in Bibliographic track (https://oaei.ontologymatching.org/2016/results/benchmarks/index.html) from OAEI as well as six pairs of real sensor ontology matching tasks to verify the sensitivity and availability of our recommendation. Table 2 shows a brief description of OAEI’s Bibliographic track, where two ontologies to be mapped and a reference alignment to evaluate the effectiveness of ontology matcher are included by each test case. Table 3 depicts the main features of sensor ontologies.
In this experiment, the similarity threshold, set empirically to 0.85, guarantees the highest alignment quality achieved on average in all test cases.
5.3. Experimental Results
When performing testing cases from the Bibliographic track, we compare the result of our suggestion with OAEI’s participants, i.e., AML, edna, and LogMapLt from the standpoint of . Figure 5 shows the experimental results for all types of testing cases in the Bibliographic track of OAEI.

As can be seen from the presentation information in Figure 5, for most testing cases, our proposal outperformed other methods due to the application of a Debate Mechanism that integrates the advantages of various basic similarity measures; the matching problem can be considered synthetically from different angles in ontology matching. In some cases, the for all matching techniques is nearly zero, which is due to the complexity of testing cases. By contrast, our proposal does a better job in these cases as listed in Table 4, which masks a small step forward in our proposal on the basis of cutting-edge work.
When performing sensor ontology matching tasks, we compare the result of our proposal with four basic EMM that are mentioned above, i.e., Levenshtein-similarity, Jaro similarity metric, WordNet-based distances, and similarity flooding in terms of , , and .
Figure 6 depicts the result of our scheme for matching six pairs of real sensor ontologies and compares them with four basic EMMs, and the results show our proposal typically achieves very high capacity with the golden alignment. Furthermore, the application of fuzzy measure extends the single-dimensional evaluation on similarities judging by basic similarity measures to three-dimensional assessment, which fully express the similarity to gain a high-quality alignment.

In a word, FDM can significantly improve the accuracy of search results, and at the same time ensure a high recall rate, in all kinds of matching tasks that are superior to other competitors.
6. Conclusions
Semantic connections among different sensor ontologies are of great significance to Nx-IIoT’s communication quality and information security [50]. Therefore, a sensor ontology aggregating method based on Fuzzy Debate Mechanism is proposed, which extracts the ultimate alignment by performing arguments between different entity matching measures. A fuzzy similarity measure is presented to improve the alignment’s quality, which models two entities’ similarity in the vector space and their semantic distance is calculated by using cosine function. The Bibliographic tracks provided by OAEI and five real sensor ontologies were used to calculate the performance of the proposed method in this experiment. Compared with the most advanced ontology matching technology and four basic ontology matchers, the robustness and effectiveness of our proposal are verified.
Looking to the future, there are two challenges in sensor ontology alignment extraction technique: one is to measure entity similarity, and the other is how to develop extraction rules to tune the quality of alignment. In one hand, we need to further innovate the similarity measurement of domain-specific ontology to adapt to its fine-grained and complex structure. In the other hand, there is the need of approaches that can deal with the problem of uncertainty generated in the matching process. [50].
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors declare that they have no conflicts of interest in the work.
Acknowledgments
This work was supported by the Natural Science Foundation of Fujian Province (No. 2020J01875) and the National Natural Science Foundation of China (Nos. 61773415, 61801527, and 61103143).