Abstract

Efficient matching of incoming mass events to persistent queries is fundamental to complex event processing systems. Event matching based on pattern rule is an important feature of complex event processing engine. However, the intrinsic uncertainty in pattern rules which are predecided by experts increases the difficulties of effective complex event processing. It inevitably involves various types of the intrinsic uncertainty, such as imprecision, fuzziness, and incompleteness, due to the inability of human beings subjective judgment. Nevertheless, D numbers is a new mathematic tool to model uncertainty, since it ignores the condition that elements on the frame must be mutually exclusive. To address the above issues, an intelligent complex event processing method with D numbers under fuzzy environment is proposed based on the Technique for Order Preferences by Similarity to an Ideal Solution (TOPSIS) method. The novel method can fully support decision making in complex event processing systems. Finally, a numerical example is provided to evaluate the efficiency of the proposed method.

1. Introduction

Nowadays, there has been increasing interest in distributed applications which require processing continuously flowing data from geographically distributed sources to obtain timely responses to complex queries, such as data stream processing (DSP) systems [18] and complex event processing (CEP) systems [916]. In principle, DSP systems differ from CEP systems as the DSP systems focus on transforming the incoming flow of information, while the CEP systems focus on detecting patterns of information that represent the higher-level events [17, 18]. Because of the advantages such as expressive rule language and efficient event detection model, CEP systems have been highly concerned in academic circles and industry recently [1925].

In CEP systems, event streams are processed in or near real time for a variety of purposes, from wireless sensor networks to financial tickers and from traffic management to click-stream inspection [17, 18, 2628]. In those application domains, accurate and effective complex event processing is critical for dealing with real-world events, whereas there are two possible origins for uncertainty which increases the difficulties of accurate and effective complex event processing, such as uncertainty originating at the event source and uncertainty resulting from event inference [29]. For uncertainty originating at the event source (i.e., raw event), there may be uncertainty associated with either the event occurrence itself or the event’s attributes, due to a feature of the event source. For uncertainty resulting from event inference (i.e., derived events), they are based on other events and uncertainty can propagate to the derived events. Nevertheless, in DSP systems, many kinds of strategies have been developed to handle the above two types of uncertainty [2933].

However, most of the CEP systems are dealing with the flow of events where the basic premise underlying the design of the CEP queries proposed is that the associated pattern rules of events are predefined. Moreover, many of predefined pattern rules are based on human beings subjective experience so that it is not necessarily accurate in practice. While rule based reasoning system structure looks simple, the acquisition of knowledge is a big bottleneck in the system. Because the rules come from experts, there are no self-learning functions. Such a behavior gives rise to poor quality of query results. Furthermore, in conventional CEP systems, it inevitably involves various types of the intrinsic uncertainty, such as imprecision, fuzziness, and incompleteness, due to the inability of human beings subjective judgment. In these cases, it seems more reasonable to consider fuzzy situation in the CEP systems.

Due to the efficiency to handle uncertainty and fuse data, some math tools such as fuzzy sets, evidence theory, and probability method are widely used in decision making [3442], risk analysis [43], diagnosis [44], and optimization problem [45]. Needless to say, fuzzy sets theory introduced by Zadeh is a good approach to settle with the uncertain information [35, 4650]. The Technique for Order Preferences by Similarity to an Ideal Solution (TOPSIS) [51] method proposed by Hwang and Yoon (1981) is widely used in decision making [52]. Moreover, the extension of the TOPSIS [51] method was developed to handle fuzzy data [53, 54]. The TOPSIS method is applied widely in MCDM field. And many applications have proved its effectiveness,whereas there still exist some shortcomings. One of the open questions is that current MCDM based on the TOPSIS method cannot adequately handle these types of uncertainties, such as imprecision, fuzziness, and incompleteness, due to the inability of experts’ subjective judgment. D numbers proposed by Deng [55] is a new mathematic tool to model uncertainty, since it ignores the condition that elements on the frame must be mutually exclusive. Compared with existing methods, D numbers can efficiently represent uncertain information and are more tallied with the actual situation. There is an increasing number of applications about D numbers, such as FEMA analysis [56], failure mode analysis [57], environment assessment [5860], supplier selection [61], and product engineering [62].

To address these issues, we propose an intelligent complex event processing strategy with D numbers under fuzzy environment based on the TOPSIS method. The novel method can deal with pattern-rule based uncertainty under fuzzy environment to realize accurate and effective event stream processing. Effectiveness of the proposed method is evaluated through a numerical experiment.

The rest of this paper is organized as follows. Section 2 briefly introduces the preliminaries of this paper. After that, Section 3 proposes an intelligent complex event processing strategy with D numbers under fuzzy environment. Section 4 gives a numerical example to show the effectiveness of the proposed method. Finally, Section 5 gives a conclusion.

2. Preliminaries

2.1. Dempster-Shafer Evidence Theory

Dempster-Shafer evident theory [63, 64], also known as evidence theory, is used to handle uncertain information, belonging to the category of artificial intelligence. As a theory reasoning under the uncertain environment, it needs weaker conditions than the Bayesian theory of probability. When the probability is confirmed, Dempster-Shafer theory could convert into Bayesian theory, so it is often regarded as an extension of the Bayesian theory. Dempster-Shafer theory has the advantage of directly expressing the “uncertainty” by assigning the probability to the subsets of the set composed of multiple objects, rather than to an individual object. Besides, it has the ability to combine pairs of bodies of evidence or belief functions to derive a new evidence or belief function. Based on the Dempster-Shafer evident theory, Deng proposed the generalized evidence theory to extend the classical evidence theory [34]. For completeness of the explanation, some basic concepts are introduced as follows.

Definition 1 (frame of discernment). Let be a set of mutually exclusive and collectively exhaustive, indicated byThe set is called frame of discernment. The power set of is indicated by , whereand is an empty set. If , is called a proposition.

Definition 2 (mass function). For a frame of discernment , a mass function is a mapping from to , formally defined by which satisfies the following condition:

In the Dempster-Shafer evident theory, a mass function is also called a basic probability assignment (BPA). If , is called a focal element; the union of all focal elements is called the core of the mass function.

Definition 3 (Dempster’s rule of combination). Dempster’s rule of combination, also called orthogonal sum, denoted by , is defined as follows:withwhere and are also elements of and is a constant to show the conflict between the two BPAs. Note that Dempster’s rule of combination is only applicable to such two BPAs which satisfy the condition .

2.2. Number Theory

D number theory is a generalization of Dempster-Shafer evidence theory proposed by Deng [55]. In the classical Dempster-Shafer theory, there are several strong hypotheses on the frame of discernment and basic probability assignment and still some shortcomings, which limit the ability of Dempster-Shafer theory to represent some types of information and restrict the application in practice. D number theory, as an extension and development method, is defined as follows.

Definition 4 ( number). Let be a finite nonempty set; a D number is a mapping formulated bywithwhere is a subset of .

It seems that the definition of D numbers is similar to the definition of BPA. However, in D number theory, the elements of do not require to be mutually exclusive. In addition, being contrary of the frame of discernment containing overall events, is acceptable to incomplete information by .

Furthermore, for a discrete set , where and when , . A special form of D numbers can be expressed byor simply denoted as , where and .

Definition 5 (two D numbers’ rule of combination). Let , be two D numbers; the combination of and , indicated by , is defined bywithwhere and .

In the meanwhile, an aggregation operator is proposed on this special D number; it is defined as below.

Definition 6 (D numbers’ integration). For , the integrating representation of is defined as

2.3. TOPSIS Method with Fuzzy Data

Fuzzy set theory [65] provide an alternative and convenient framework for modeling of real-world fuzzy decision systems mathematically [6668]. A fuzzy set is any set that allows its members to have different grades of membership in the interval . It consists of two components: a set and a membership function associated with it.

Definition 7 (fuzzy set [69]). Let be a collection of objects denoted generally by ; a fuzzy subset of , , is a set of ordered pairs:where is called the membership function (generalized characteristic function) which maps to the membership space . Its range is the subset of nonnegative real members whose supremum is finite.

Definition 8 (triangular fuzzy number [66]). A fuzzy number is a fuzzy subset of . And a triangular fuzzy number can be defined by a triplet shown in Figure 1, in which , , and are real numbers with . Its membership function is defined as

Definition 9 (distance between two triangular fuzzy numbers [53]). Let and be two triangular fuzzy numbers; then the vertex method is defined to calculate the distance between them as

The main idea of TOPSIS is that the best compromise solution should have the shortest Euclidean distance from the positive ideal solution and the farthest Euclidean distance from the negative ideal solution.

The procedures of TOPSIS method with fuzzy data can be described as follows. Let be the set of alternatives, be the set of criteria, and be the performance ratings with the criteria weight vector .

Step 1 (calculate normalized ratings). Let and be the set of benefit criteria and cost criteria, respectively. The normalized value is calculated by

Step 2 (calculate weighted normalized ratings). In the weighted normalized decision matrix, the modified ratings are calculated bywhere is the weight of the sth criteria.

Step 3 (determine the fuzzy positive and negative ideal solutions). The elements , , are normalized positive triangular fuzzy numbers and their ranges belong to the closed interval . Then, the fuzzy positive ideal solution (FPIS, ) and the fuzzy negative ideal solution (FNIS, ) are derived as follows:where and , .

Step 4 (calculate the distance of each alternative from the FPIS and the FNIS). The distance of each alternative from and can be currently calculated aswhere is the distance measurement between two fuzzy numbers.

Step 5 (calculate the relative closeness coefficient to the positive ideal solution). The relative closeness coefficient for the alternative with respect to is

Step 6 (rank the alternatives). Obviously, an alternative is closer to the FPIS () and farther from FNIS () as approaches 1. Therefore according to relative closeness coefficient to the ideal alternative, larger value of indicates the better alternative .

3. The Proposed Method

Before introducing the system architecture of intelligent CEP, we first have a clear understanding of the definition of an event [10, 11]. An event that represents an atomic instance is an occurrence that is of interest at a point in time. Basically, events can be classified into primitive events and composite events. A primitive event instance is predefined as a single occurrence of interest that cannot be split into any small events. A composite event instance that occurs over an interval is created by composing primitive or composite events. A pattern rule is a template, specifying one or more combinations of events by the nesting of sequences (SEQ) and conjunctions (AND), which can have negative event type(s), and their combination. In the following, denotes an event type which can be either primitive or composite. Some details were presented in [70].

Definition 10. A SEQ operator [13] specifies a specific order according to the start time-stamps in which the event must occur, to match the pattern, and thus form a composite event.

For example, SEQ (History_taking, Physical_examination, Laboratory_examination) consists of SEQ operator that can be used to monitor the patients who have completed medical diagnosis.

Definition 11. An AND operator [13] takes a set of event types as input, and events occur within a specified time window without a specified time order.

For example, AND (Laboratory_examination, Surgery) consists of AND operator that can be used to monitor the patients’ preoperative and postoperative situations.

Currently, the pattern-rule base of CEP systems consisting of a set of pattern rules is predefined where it is based on human beings subjective experience so that it is not necessarily accurate in practice, whereas the acquisition of knowledge is a big bottleneck in the system and most researches on CEP systems seldom discuss how to handle such kinds of problems. As we discussed in Section 1, because the rules come from experts, there is no self-learning function. The CEP engine based on the inaccurate predefined pattern-rule base will generate poor quality of query results. Furthermore, in conventional CEP systems, it inevitably involves various types of the intrinsic uncertainty, such as imprecision, fuzziness, and incompleteness, due to the inability of human beings subjective judgment.

In this section, a novel intelligent complex event processing strategy with D numbers under fuzzy environment is proposed based on the Technique for Order Preferences by Similarity to an Ideal Solution (TOPSIS) method. The proposed method can fully support decision making in CEP systems.

The proposed system architecture of intelligent CEP is shown in Figure 2; it mainly involves two components: event collector engine and CEP engine under D number theory based on TOPSIS method. We will explain each component in detail.

Component 1: Event Collector Engine. The event collector engine, which collects events from data sources, first generates a unified formal definition of the event flow.

Component 2: CEP Engine under D Number Theory Based on TOPSIS Method. Then, the CEP engine processes the collected simple events through the pattern rules in which the events generated at this component are called complex events, whereas the D number theory based on TOPSIS method, which is the addition to the CEP engine, in view of the detected uncertainty event set, is used for pattern-rules analysis to select the best one from the decision candidates. Finally, the generated new events are sent out directly to particular users, or reused as input events, or used to do further analysis.

The main steps of the proposed method are shown as follows, and the basic flowchart is given in Figure 3 for the better understanding of the concept.

Step 1. Make certain of the detailed information of the decision-making problems, including the goal and criteria.

Step 2. After determining the goal and criteria, it forms the pattern-rule base by leveraging D number theory based on TOPSIS method.

Step 3. For the intelligent CEP, it then collects simple events by the event collector engine from the data sources.

Step 4. Based on the pattern-rule base, the intelligent CEP system can process the collected simple events and generate new complex events by the CEP engine.

4. Numerical Example

In this section, in order to illustrate the application of the proposed method, a simple numerical example is given. Suppose that there are three pattern-rule alternatives () which may be considered for further generating a composite event in the CEP system, namely, , , and . Meanwhile, there are four evaluation criteria and three experts need to select the most suitable pattern-rule candidate from them for better supporting decision making in CEP systems. Three experts give different assessment results to different alternatives in terms of different evaluation criteria. Four benefit criteria are considered:(1)Accuracy ().(2)Response Time ().(3)Cost ().(4)Operability ().The proposed method is now applied to solve this decision problem. Then, computational procedure is summarized as follows.

Step 1. The experts use linguistic weighting variables to assess the importance of the criteria, which is shown in Table 1.

Step 2. In this paper, the criteria weights for different experts determined by variation coefficient method are adopted and presented in Table 2.

Step 3. Using the linguistic rating variables of Table 1, the ratings of the three decision alternatives by different experts under four evaluation criteria are obtained as shown in Table 3. Furthermore, the fuzzy numbers of three candidates by experts under all criteria can be obtained as shown in Table 4.

Step 4. Based on the fuzzy numbers of Table 4, the fuzzy normalized decision matrix can be constructed as shown in Table 5.

Step 5. The closeness coefficient of each alternative can be calculated as shown in Table 6.

Step 6. Based on Table 6, the D number for can be represented as shown in Table 7.

Step 7. The results of can be represented as shown in Table 8.

Step 8. The integration representation of every is calculated by using (12). Table 9 shows these values of , , and . According to these values, the ranking of alternative is obtained; it is , where “” represents “better than.”

5. Conclusion

In this paper, we started off with identifying the uncertainty problems in terms of pattern rules of CEP systems. We proposed an intelligent complex event processing strategy with D numbers under fuzzy environment, which could fully support decision making for guaranteeing effective complex event processing. The main idea of the proposed strategy is that we applied D numbers based on TOPSIS method into the CEP systems. A numerical example was provided to evaluate the effectiveness of our proposed method.

Competing Interests

The author declares that he has no competing interests.

Acknowledgments

This work was supported in part by National Natural Science Foundation of China (no. 60904099), Foundation for Fundament Research of Northwestern Polytechnical University (no. JC20120235), Fundamental Research Funds for the Central Universities (no. XDJK2015C107), and the Doctoral Program of Higher Education (no. SWU115008).