Abstract

Many industrial processes and physical systems are spatially distributed systems. Recently, a novel 3-D FLC was developed for such systems. The previous study on the 3-D FLC was concentrated on an expert knowledge-based approach. However, in most of situations, we may lack the expert knowledge, while input-output data sets hidden with effective control laws are usually available. Under such circumstance, a data-driven approach could be a very effective way to design the 3-D FLC. In this study, we aim at developing a new 3-D FLC design methodology based on clustering and support vector machine (SVM) regression. The design consists of three parts: initial rule generation, rule-base simplification, and parameter learning. Firstly, the initial rules are extracted by a nearest neighborhood clustering algorithm with Frobenius norm as a distance. Secondly, the initial rule-base is simplified by merging similar 3-D fuzzy sets and similar 3-D fuzzy rules based on similarity measure technique. Thirdly, the consequent parameters are learned by a linear SVM regression algorithm. Additionally, the universal approximation capability of the proposed 3-D fuzzy system is discussed. Finally, the control of a catalytic packed-bed reactor is taken as an application to demonstrate the effectiveness of the proposed 3-D FLC design.

1. Introduction

Many industrial processes and physical systems such as industrial chemical reactor [1, 2], semiconductor manufacturing [3], and thermal processing [4] are “distributed” in space. They are usually called spatially distributed systems, or distributed parameter systems [1]. The states, controls, and outputs of such systems depend on the space position as well as on the time [2]. Traditionally, model-based methods are used to control such systems, where a good mathematical model is definitely required. However, the process model may not be easily obtained in many complex situations, and then, a model-free control method has to be used. This leads to the recent development of the novel three-dimensional fuzzy-logic control (3-D FLC) [58], which has the inherent capability to process spatiotemporal dynamic systems. The 3-D FLC uses one kind of three-dimensional (3-D) fuzzy set (shown in Figure 1), which is composed of the traditional fuzzy set and a third dimension for the spatial information, and executes a 3-D rule inference engine. It is actually a kind of spatiotemporal fuzzy-control system with the traditional model-free advantage.

To date, the 3-D FLC design has been focused on an expert-knowledge-based approach [5], that is, the fuzzy-rule design is from human experts’ knowledge. In this approach, human knowledge to the control solution must exist, and be structured. Practically, experts may have problems structuring the knowledge [9]. Sometimes, although experts have the structured knowledge, they may sway between extreme cases: offering too much knowledge in the field of expertise, or tending to hide their knowledge [9]. Thus, we often lack expert knowledge for control that is usually hidden in an input-output data set. Under this circumstance, a data-driven design becomes a good choice for the 3-D FLC, that is, extraction of fuzzy rules from a spatiotemporal input-output data set. Since the research on the 3-D FLC is just at the beginning stage, extracting 3-D fuzzy control rules from a spatiotemporal data set is still a challenging and open problem for spatially distributed systems.

Traditional data-driven FLC design methods have been developed in the past three decades. They are usually composed of three parts: rule generation, structure optimization, and parameter optimization [10]. For instance, grid partitioning of multidimensional space [11] and clustering technique [12] can be used to generate rules automatically; reducing redundancy variable [12], fusing similar clusters [13], and fusing similar fuzzy set [14] can be applied to reduce the rule number and realize the structure optimization; genetic algorithm [15] and gradient decent approach [16] can be adopted for fine tuning of membership function and realize the parameter optimization. For a complete review of data-driven fuzzy system design, one can further refer to [10]. These methods provide useful solutions to a traditional FLC design.

In this study, we aim at developing a new data-driven 3-D FLC design method based on clustering and SVM-regression learning. The initial 3-D rule base is first generated by a nearest-neighborhood-clustering method from a spatiotemporal data set via defining Frobenius norm as a distance. Then, the initial 3-D rule base is simplified based on similarity measure technique defined for 3-D fuzzy sets and 3-D fuzzy rules. Subsequently, an SVM-regression learning algorithm is used to learn the parameters of the rule consequent parts. In addition, the universal approximation capability of the proposed 3-D fuzzy system is discussed.

The paper is organized as follows. Preliminaries about 3-D FLC and SVM regression are addressed in Section 2. In Section 3, a clustering and SVM-regression learning-based 3-D fuzzy control design methodology is presented in detail. In Section 4, the universal approximation capability of the proposed 3-D fuzzy system is presented. In Section 5, a catalytic packed-bed reactor is presented as an example to illustrate the proposed design scheme of a 3-D FLC and validate its effectiveness. Finally, conclusions are given in Section 6.

2. Preliminaries

2.1. 3-D FLC

The 3-D FLC is designed to have the inherent capability to deal with spatial information and its basic structure is shown in Figure 2. It has a similar functional structure similar to the traditional FLC, which consists of three basic blocks: fuzzification, rule inference, and defuzzification. However, it will differ in the detailed operations because of the spatial processing requirement. Generally, the 3-D FLC will be involved with the following basic designs: 3-D membership function (MF), 3-D fuzzification, 3-D rule base, 3-D rule inference, and defuzzification. One can refer to [5] for detailed description. Once each component of a 3-D FLC is set, a precise mathematical formula of the 3-D FLC can be derived.

Assumed that we have 3-D fuzzy rules represented by the following expression: 𝑅𝑙IF𝑥1(𝑧)is𝐶𝑙1andand𝑥𝑠(𝑧)is𝐶𝑙𝑠,Then𝑢is𝐵𝑙,(2.1) where 𝑥𝑖(𝑧)=(𝑥𝑖(𝑧1)𝑥𝑖(𝑧2)𝑥𝑖(𝑧𝑝))𝑇 denotes the 𝑖th spatial input variable (1𝑖𝑠), 𝑥𝑖(𝑧𝑗) is the input of 𝑥𝑖(𝑧) from the sensing location 𝑧=𝑧𝑗(1𝑗𝑝); 𝑧 denotes one-dimensional space in a discrete space domain 𝑍={𝑧1,𝑧2,,𝑧𝑝}; 𝐶𝑙𝑖 denotes a 3-D fuzzy set, 𝑙=1,,𝑁; 𝑢 denotes output variable (the control action); 𝐵𝑙 denotes a traditional fuzzy set.

If Gaussian type 3-D membership functions (MF) are used to describe 3-D fuzzy sets in (2.1), then we have 𝜇𝑙𝐺𝑖𝑥𝑖𝑥(𝑧)=exp𝑖(𝑧)𝑐𝑙𝑖(𝑧)𝜎𝑙𝑖(𝑧)2,(2.2) where 𝜇𝑙𝐺𝑖 denotes the Gaussian type 3-D MF of the 𝑖th spatial input 𝑥𝑖(𝑧) in the lth rule; 𝑐𝑙𝑖(𝑧)=(𝑐𝑙𝑖(𝑧1),,𝑐𝑙𝑖(𝑧𝑝))𝑇 and 𝜎𝑙𝑖(𝑧)=(𝜎𝑙𝑖(𝑧1),,𝜎𝑙𝑖(𝑧𝑝))𝑇 are the center and width of 𝜇𝑙𝐺𝑖, respectively; 𝑐𝑙𝑖(𝑧𝑗) and 𝜎𝑙𝑖(𝑧𝑗) denote center and width of the Gaussian type 2D MF of the 𝑖th spatial input 𝑥𝑖(𝑧) at the sensing location 𝑧=𝑧𝑗. The Gaussian type 3-D MF 𝜇𝑙𝐺𝑖 can be regarded as an assembly of multiple Gaussian type 2-D MFs over the space domain 𝑍. Then, the Gaussian type 2-D MF of the 𝑖th spatial input 𝑥𝑖(𝑧) at the sensing location 𝑧=𝑧𝑗 is given as 𝜇𝐺𝑖𝑗𝑥𝑖𝑧𝑗𝑥=exp𝑖𝑧𝑗𝑐𝑙𝑖𝑗𝜎𝑙𝑖𝑗2,(2.3) where 𝑐𝑙𝑖𝑗=𝑐𝑙𝑖(𝑧𝑗) and 𝜎𝑙𝑖𝑗=𝜎𝑙𝑖(𝑧𝑗).

Furthermore, if we employ singleton fuzzification, “product” 𝑡-norm and “weighted aggregation” dimension reduction [6] in the 3-D rule inference, singleton fuzzy sets for the output variable, and “center of sets” defuzzification [17], the 3-D FLC can be mathematically expressed as 𝑢𝑥𝑧=𝑁𝑙=1𝜁𝑙𝑝𝑗=1𝑎𝑗𝑠𝑖=1𝜇𝐺𝑖𝑗𝑥𝑖𝑧𝑗𝑁𝑙=1𝑝𝑗=1𝑎𝑗𝑠𝑖=1𝜇𝐺𝑖𝑗𝑥𝑖𝑧𝑗=𝑁𝑙=1𝜁𝑙𝑝𝑗=1𝑎𝑗𝑠𝑖=1𝑥exp𝑖𝑧𝑗𝑐𝑙𝑖𝑗/𝜎𝑙𝑖𝑗2𝑁𝑙=1𝑝𝑗=1𝑎𝑗𝑠𝑖=1𝑥exp𝑖𝑧𝑗𝑐𝑙𝑖𝑗/𝜎𝑙𝑖𝑗2,(2.4) where 𝑥𝑧=(𝑥1(𝑧)𝑥2(𝑧)𝑥𝑠(𝑧))Ω𝑅𝑝×𝑠 is a spatial input vector with Ω as the input domain, 𝑝 as the number of sensors, and 𝑠 as the number of spatial inputs; 𝑁 is the number of rules; 𝑎𝑗 is the spatial weight from the 𝑗th spatial point [6]; 𝜁𝑙𝑈 is the nonzero value in the singleton fuzzy set of the output variable for the 𝑙th rule.

In (2.4), let 𝜙𝑙𝑥𝑧=𝑝𝑗=1𝑎𝑗𝑠𝑖=1𝜇𝐺𝑖𝑗𝑥𝑖𝑧𝑗𝑁𝑙=1𝑝𝑗=1𝑎𝑗𝑠𝑖=1𝜇𝐺𝑖𝑗𝑥𝑖𝑧𝑗,(2.5) then (2.4) can be rewritten as 𝑢𝑥𝑧=𝑁𝑙=1𝜁𝑙𝜙𝑙𝑥𝑧.(2.6) Similar to a traditional FLC [16], we define 𝜙𝑙(𝑥𝑧) as a spatial fuzzy basis function (SFBF). Each SFBF corresponds to a 3-D fuzzy rule, and all the SFBFs correspond to a 3-D rule base. Mathematically, a 3-D FLC is a linear combination of all the SFBFs.

Equation (2.6) shows that the 3-D FLC is a nonlinear mapping from the input space 𝑥𝑧Ω𝑅𝑝×𝑠 to the output space 𝑢(𝑥𝑧)𝑈𝑅. It provides us a way to understand and analyze the 3-D FLC from the point of view of function approximation. In Section 4, we will prove that the 3-D FLC has a universal approximation property based on the nonlinear mapping in (2.6).

2.2. Linear SVM Regression

An SVM is a learning algorithm that originated from theoretical foundations of the statistical learning theory [18] and has been widely used in many practical applications, such as bioinformatics, machine vision, text categorization, handwritten character recognition, time series analysis, and so on. The distinct advantage of the SVM over other machine learning algorithms is that it has a good generalization ability and can simultaneously minimize the empirical risk and the expected risk [19]. The SVM algorithms can be categorized into two categories: SVM classification and SVM regression. In this study, we are concerned with the SVM regression with 𝜀-insensitive loss function [20].

Suppose we have a training set 𝐷={[𝑥𝑖,𝑦𝑖]𝑅𝑠×𝑅,𝑖=1,,𝑞} consisting of 𝑞 pairs (𝑥1,𝑦1),(𝑥2,𝑦2),,(𝑥𝑞,𝑦𝑞), where the inputs are 𝑠-dimensional vectors, and the labels are continuous values. In 𝜀-SVM regression, the goal is to find a function 𝑓(𝑥,𝑤) so that for all training patterns 𝑥 has a maximum deviation 𝜀 from the target values 𝑦𝑖 and has a maximum margin. The 𝜀-insensitive loss function is defined as follows: ||||𝑦𝑓(𝑥,𝑤)𝜀=||||||||0,if𝑦𝑓(𝑥,𝑤)𝜀,𝑦𝑓(𝑥,𝑤)𝜀,otherwise.(2.7) The 𝜀-insensitive loss function defines an 𝜀 tube [9].

The regression problem can be formulated as a convex optimization problem as follows: min𝑤,𝑏,𝜉𝑖,𝜉𝑖=12𝑤2+𝐶𝑙𝑖=1𝜉𝑖+𝑙𝑖=1𝜉𝑖(2.8) subject to 𝑦𝑖𝑤𝑥𝑖𝑏𝜀+𝜉𝑖𝑤𝑥𝑖+𝑏𝑦𝑖𝜀+𝜉𝑖𝜉𝑖0,𝜉𝑖0,𝑖=1,,𝑞,(2.9) where 𝜉𝑖 and 𝜉𝑖 are slack variables, and the constant 𝐶 is a design parameter chosen by the user, which determines the trade off between the complexity of 𝑓(𝑥,𝑤) and the approximate error.

The above optimization problem can be solved in a dual space. By introducing the Lagrange multipliers, the primal optimization problem can be formulated in its dual form as follows: max𝛼𝑖,𝛼𝑖12𝑞𝑞𝑖=1𝑗=1𝛼𝑖𝛼𝑖𝛼𝑗𝛼𝑗𝑥𝑖𝑥𝑗𝜀𝑞𝑖=1𝛼𝑖+𝛼𝑖+𝑞𝑖=1𝛼𝑖𝛼𝑖𝑦𝑖(2.10) subject to 𝑞𝑗=1𝛼𝑖=𝑞𝑖=1𝛼𝑖,0𝛼𝑖𝐶,0𝛼𝑖𝐶,𝑖=1,,𝑞.(2.11) Solving the dual quadratic programming problem, we can find an optimal weight vector 𝑤 and an optimal bias 𝑏 of the regression hypersurface given as follows: 𝑤=𝑞𝑖=1𝛼𝑖𝛼𝑖𝑥𝑖,1𝑏=𝑞𝑞𝑖=1𝑦𝑖𝑤𝑥𝑖.(2.12) Then, the best regression hypersurface is given by 𝑓(𝑥,𝑤)=𝑞𝑖=1𝛼𝑖𝛼𝑖𝑥𝑥𝑖+𝑏=𝑖SV𝛼𝑖𝛼𝑖𝑥𝑥𝑖+𝑏(2.13) The training pattern 𝑥𝑖 with nonzero (𝛼𝑖𝛼𝑖) is called support vector (SV).

3. Clustering and SVM-Regression Learning-Based 3-D FLC Design

Clustering and SVM-regression learning-based 3-D FLC design is a novel design of a 3-D FLC by integrating a nearest-neighborhood-clustering and an SVM-regression. The design methodology can be depicted by Figure 3. Firstly, a nearest-neighborhood-clustering method with Frobenius norm defined as a distance is employed to mine the underlying knowledge of the spatiotemporal data set S and yield the initial structure, that is, antecedent part of 3-D fuzzy rules. Because the obtained input space partition may have redundancy in terms of highly overlapping MFs, it is necessary to optimize the obtained initial fuzzy partition. Then, a similarity measure technique is utilized to merge similar 3-D fuzzy sets and to merge similar 3-D fuzzy rules, and then to simplify the initial rule structure. Finally, a linear SVM-regression algorithm is used to learn the parameters of the consequent parts based on an equivalence relationship between a linear SVM regression and a 3-D FLC.

The spatiotemporal data set S from a spatially distributed system is composed of n spatiotemporal input-output data pairs given as follows: 𝑥𝑆=𝑘𝑧,𝑢𝑘𝑥𝑘𝑧𝑅𝑝×𝑠,𝑢𝑘,𝑅,𝑘=1,,𝑛(3.1) where 𝑥𝑘𝑧=(𝑥𝑘1(𝑧),,𝑥𝑘𝑠(𝑧)) denotes the value of s spatial input variables at the kth sampling time, 𝑥𝑘𝑖(𝑧)=(𝑥𝑘𝑖(𝑧1),,𝑥𝑘𝑖(𝑧𝑝))𝑇denotes the value of 𝑖th spatial input variable at the kth sampling time (𝑖=1,,𝑠), 𝑢𝑘 denotes the output value at the kth sampling time, n denotes the number of sampling time, and p denotes the number of sensors. Since infinite sensors are used, 𝑥𝑘𝑧 is a matrix with p rows and s columns.

3.1. Initial Structure Learning
3.1.1. Nearest Neighborhood Clustering Method

Clustering method is one of the data-driven learning tools for unlabeled data. It can mine underlying knowledge (or data structure) from a dataset that is difficult for humans to manually identify. One of the simplest clustering algorithms is the nearest-neighborhood clustering algorithm [16]. However, the existing nearest-neighborhood clustering algorithm has not the capability to deal with spatiotemporal data. In this study, we expand its capability to deal with spatiotemporal data set 𝑆, which is of matrix form. The key point is that the Frobenius norm given in (3.2) is used for defining a distance in a nearest neighborhood clustering algorithm. 𝑋𝐹=𝑋tr𝑇𝑋𝑋𝑅𝑝×𝑠.(3.2)

The nearest neighborhood clustering algorithm is summarized as follows.(i)Step 1: Begin from the first spatiotemporal data 𝑥1𝑧. Let the first cluster center 𝑐1𝑧 be 𝑥1𝑧, the number of data pairs 𝑚1 be 1, and the threshold be 𝜌0 for generating new fuzzy rules.(ii)Step 2: Suppose that the kth spatiotemporal data 𝑥𝑘𝑧(𝑘=2,,𝑛) is considered, when N clusters have been generated and their centers are 𝑐1𝑧,𝑐2𝑧,, and 𝑐𝑁𝑧 respectively. Firstly, compute the distance between 𝑥𝑘𝑧 and each center of N clusters using 𝑥𝑘𝑧𝑐𝑙𝑧𝐹(𝑙=1,,𝑁). Then, compute the threshold 𝜌 using 𝜌=max𝑙=1,,𝑁1𝑥1+𝑘𝑧𝑐𝑙𝑧𝐹(3.3) Hence, the corresponding cluster center 𝑐𝑙𝑘𝑧 is taken as the nearest neighborhood cluster of 𝑥𝑘𝑧.(iii)Step 3: (a) If 𝜌<𝜌0, then 𝑥𝑘𝑧 is taken as a new cluster center, and let 𝑁=𝑁+1,𝑚𝑁=1, and 𝑐𝑁𝑧=𝑥𝑘𝑧. (b) If 𝜌𝜌0,𝑥𝑘𝑧 belongs to the cluster with the center 𝑐𝑙𝑘𝑧. The center of 𝑙𝑘th cluster is tuned by introducing a learning rate 𝜂=𝜂0/(𝑚𝑙𝑘+1)(𝜂0[0,1]) as follows: 𝑐𝑙𝑘𝑧=𝑐𝑙𝑘𝑧𝑥+𝜂𝑘𝑧𝑐𝑙𝑘𝑧,(3.4) and let 𝑚𝑙𝑘=𝑚𝑙𝑘+1.(iv)Step 4: Let 𝑘=𝑘+1. If 𝑘𝑛+1, then quit. Otherwise, back to Step 2.

3.1.2. Rule Extraction and 3-D MF Construction

After clustering learning, we obtain an input space partition with 𝑁 cluster centers 𝑐1𝑧,𝑐2𝑧,,𝑐𝑁𝑧. Then, we will produce antecedent part of rule base and construct 3-D MFs in terms of the partition. Each cluster corresponds to a 3-D fuzzy rule. Assumed that we employ Gaussian type 3-D MF. Then, the cluster center corresponds to the center of Gaussian type 3-D MFs in the antecedent part. Thus, the number of fuzzy rules is equal to the number of clusters 𝑁. In addition, we determine the width of the Gaussian MFs in terms of the domain of variables. For instance, the width of the Gaussian type 3-D MFs from the same sensing location are defined as 𝜎𝑧𝑗=max1𝑖𝑠𝑥𝑖max𝑧𝑗𝑥𝑖min𝑧𝑗10,(3.5) where 𝑥𝑖max(𝑧𝑗) and 𝑥𝑖min(𝑧𝑗) are the maximum and the minimum bound values of the 𝑖th spatial input variable, respectively.

3.2. Structure Simplication

After the initial structure learning, the obtained fuzzy partition of the input space and fuzzy rules may have redundancy in terms of highly overlapping MFs. In this step, we will simplify the fuzzy partition and fuzzy rules. The crucial technique for simpification is similarity measure. The previous similarity measure techniques [14, 21, 22] developed for traditional fuzzy sets and traditional fuzzy rules are not suitable to 3-D fuzzy sets and 3-D fuzzy rules. In this study, we will define a new similarity measure technique.

3.2.1. Similarity Measure

Firstly, we define the similarity of two 3-D fuzzy sets 𝐴 and 𝐵 as below. 𝑆𝐴,𝐵=11+𝑑𝐴,𝐵,],𝑆()(0,1(3.6) where 𝑑(𝐴,𝐵) is a distance between 𝐴 and 𝐵. Since Gaussian type 3-D MFs are chosen, the following simple expression can be used to approximate the distance: 𝑑𝐴,𝐵=𝑐𝐴(𝑧1)𝜎𝐴(𝑧1)𝑐𝐴(𝑧𝑝)𝜎𝐴(𝑧𝑝)𝑐𝐵(𝑧1)𝜎𝐵(𝑧1)𝑐𝐵(𝑧𝑝)𝜎𝐵(𝑧𝑝)𝐹,(3.7) where 𝑐𝐴(𝑧𝑗)(𝑐𝐵(𝑧𝑗)) and 𝜎𝐴(𝑧𝑗)(𝜎𝐵(𝑧𝑗)) are center and width of the Gaussian type 3-D MF 𝐴(𝐵) at sensing location 𝑧=𝑧𝑗(𝑗=1,,𝑝), respectively.

Based on the similarity measure, we can merge similar 3-D fuzzy sets, or merge similar 3-D fuzzy rules.

(i) Merge of Two Similar 3-D Fuzzy Sets 𝐴 and 𝐵
Firstly, the similarity between 𝐴 and 𝐵 is computed according to (3.6). If 𝑆(𝐴,𝐵) is higher than a threshold, we can conclude that 𝐴 and 𝐵 are similar, and then merge them into a new 3-D fuzzy set 𝐶. The center and width of 𝐶 are viewed as the average values of 𝐴 and 𝐵, and are given as the following: 𝑐𝐶(𝑧𝑗)=𝑐𝐴(𝑧𝑗)+𝑐𝐵(𝑧𝑗)2,𝜎𝐶(𝑧𝑗)=𝜎𝐴(𝑧𝑗)+𝜎𝐵(𝑧𝑗)2.(3.8)

(ii) Merge of Two Similar 3-D Fuzzy Rules 𝑅𝑙1 and 𝑅𝑙2
The similarity 𝑅𝑙1 and 𝑅𝑙2 is inferred by measuring their similarity in the antecedent part. For instance, the similarity computation between 𝑅𝑙1 and 𝑅𝑙2 is given by 𝑆rule𝑅𝑙1,𝑅𝑙2=min1𝑖𝑠𝑆𝐶𝑙1𝑖,𝐶𝑙2𝑖,(3.9) where 𝑅𝑙1 and 𝑅𝑙2 have the same rule form as in (2.1), 𝐶𝑙1𝑖 (𝐶𝑙2𝑖) denotes the 3-D fuzzy set for the 𝑖th spatial input variable 𝑥𝑖(𝑧) in the 𝑙1th (𝑙2th) rule. If 𝑆rule(𝑅𝑙1,𝑅𝑙2) is higher than a threshold, we can conclude that 𝑅𝑙1 and 𝑅𝑙2 are similar, and then merge them into a new 3-D fuzzy rule 𝑅𝑙1𝑙2. The merging of two 3-D fuzzy rules is realized by merging the two fuzzy sets of each spatial input variable in the two 3-D fuzzy rules, respectively.

3.2.2. Similarity Measure-Based Structure Simplification

Based on the similarity measure, the simplification task includes removing 3-D fuzzy sets similar to the universal set, merging similar 3-D fuzzy sets, and merging similar rules. The detailed procedure of structure simplification is summarized as follows.(i)Step 1: Given a 3-D fuzzy rule base ={𝑅𝑙}𝐾𝑙=1. Firstly, set proper thresholds: 𝜆𝑢(0,1] for removing 3-D fuzzy sets that are similar to the universal set, 𝜆set(0,1] for merging similar 3-D fuzzy sets, and 𝜆rule(0,1] for merging 3-D fuzzy rules with similar antecedents.(ii)Step 2: Calculate 𝑠𝑗𝑘𝑖=𝑆(𝐶𝑗𝑖,𝐶𝑘𝑖) with 𝑗𝑘,𝑗=1,,𝐾,𝑘=1,,𝐾,and𝑖=1,,𝑠. Let 𝑠𝑟𝑚𝑞=max𝑗𝑘{𝑠𝑗𝑘𝑖} and select 𝐶𝑟𝑞 and 𝐶𝑚𝑞.(iii)Step 3: If 𝑠𝑟𝑚𝑞𝜆set, merge 𝐶𝑟𝑞 and 𝐶𝑚𝑞 into a new 3-D fuzzy set 𝐶𝑞𝑟𝑚, set 𝐶𝑟𝑞=𝐶𝑞𝑟𝑚 and 𝐶𝑚𝑞=𝐶𝑞𝑟𝑚, and back to step 2. If no more two 3-D fuzzy sets have the similarity with 𝑠𝑟𝑚𝑞𝜆set (𝑗𝑘), then go to step 4.(iv)Step 4: Remove the 3-D fuzzy set similar to the universal set and the rule with membership function that is always near zero over the space domain.(v)Step 5: Calculate the similarity of two rules 𝑠𝑙1𝑙2=𝑆rule(𝑅𝑙1,𝑅𝑙2) with 𝑙1𝑙2,𝑙1=1,,𝑁,𝑙2=1,,𝑁. Let 𝑠𝑟𝑚=max𝑙1𝑙2{𝑠𝑙1𝑙2}.(vi)Step 6: If 𝑠𝑟𝑚𝜆rule, merge the 𝑟th and the 𝑚th rules into a new rule 𝑅new and substitute them. Let 𝑁=𝑁1, and back to step 5. If no more rules have similarity with 𝑠𝑟𝑚𝜆rule(𝑟𝑚), then quit.

Generally speaking, the threshold 𝜆𝑢 is higher than the threshold 𝜆set, while the choice of a suitable threshold 𝜆rule depends on the application. The lower 𝜆set is set, the less fuzzy sets and less fuzzy rules are yielded in the resulting rule base. In this study, we set 𝜆𝑢=0.95,𝜆set=0.75, and 𝜆rule=1.

3.3. Parameter Learning

After the structure simplification, we obtain a rule base with optimized antecedent parts. For a complete rule base, the rest task is to determine the consequent part parameters. In this study, we employ an SVM regression algorithm to learn the consequent part parameter 𝜉𝑙(𝑙=1,,𝑁) in the 3-D FLC.

Firstly, the original input samples are transformed into new samples. Utilizing the spatial fuzzy basis functions 𝜙𝑙(𝑥𝑘𝑧)(𝑙=1,,𝑁) in (2.5), we can transform each spatial input sample 𝑥𝑘𝑧(𝑘=1,,𝑛) in 𝑆 into a new input sample 𝜙(𝑥𝑘𝑧)=(𝜙1(𝑥𝑘𝑧),𝜙2(𝑥𝑘𝑧),,𝜙𝑁(𝑥𝑘𝑧)). Then, the original data set 𝑆 in (3.1) can be transformed into a new data set 𝑆 as follows: 𝑆=𝜙𝑥𝑘𝑧,𝑢𝑘𝑥𝜙𝑘𝑧𝑅𝑁,𝑢𝑘.𝑅,𝑘=1,,𝑛(3.10)

Secondly, an equivalence relationship of an SVM regression and a 3-D FLC can be derived based on the new data set 𝑆. From (2.13), the final decision function 𝑓(𝜙(𝑥𝑘𝑧)) of an SVM can be described with the following form: 𝑓𝜙𝑥𝑘𝑧=𝑛𝑘=1𝛼𝑘𝛼𝑘𝜙𝑥𝑘𝑧𝑥,𝜙𝑧+𝑏,(3.11) where 𝛼𝑘 and 𝛼𝑘 are associated learning parameters in a SVM, The training pattern 𝜙(𝑥𝑘𝑧) with nonzero (𝛼𝑘𝛼𝑘) is called support vector (SV). Furthermore, (3.11) can further be expressed by 𝑓𝜙𝑥𝑘𝑧=𝑛𝑘=1𝛼𝑘𝛼𝑘𝑁𝑙=1𝜙𝑙𝑥𝑘𝑧𝜙𝑙𝑥𝑧=+𝑏𝑁𝑙=1𝑛𝑘=1𝛼𝑘𝛼𝑘𝜙𝑙𝑥𝑘𝑧𝜙𝑙𝑥𝑧=+𝑏𝑁𝑙=1𝜉𝑙𝜙𝑙𝑥𝑧𝑥+𝑏=𝑢𝑧(3.12) In (3.12), the bias term 𝑏 in a 3-D FLC can be realized by adding a fuzzy rule as follows: 𝑅0IF𝑥1(𝑧)is𝐶01andand𝑥𝑠(𝑧)is𝐶0𝑠,THEN𝑢is𝑏,(3.13) where 𝐶0𝑖 is a universal 3-D fuzzy set, whose fuzzy degree is 1 over the space domain for any spatial input 𝑥𝑖(𝑧), 𝑖=1,,𝑠. From (3.12), we can see that an SVM will be equivalent to a 3-D FLC if (3.14) holds. 𝜉𝑙=𝑛𝑘=1𝛼𝑘𝛼𝑘𝜙𝑙𝑥𝑘𝑧.(3.14)

Finally, a linear SVM regression is employed to learn the consequent part parameters. Using (3.14), the parameters 𝜉𝑙(𝑙=1,,𝑁) in consequent parts are obtained in terms of the SVM learning, that is, 𝜉𝑙=𝑘SV𝛼𝑘𝛼𝑘𝜙𝑙𝑥𝑘𝑧.(3.15)

4. Universal Approximation of Clustering and SVM-Regression Learning-Based 3-D FLC

In essence, the clustering and SVM-regression learning-based 3-D FLC design is a fuzzy modeling that extracts fuzzy control rules and constructs a 3-D FLC from spatiotemporal data hidden with effective control laws. In other words, the proposed 3-D FLC aims at approximating an unknown nonlinear control function. Thus, in this subsection, we are concerned with its universal approximation capability. The universal approximation capability of the SVM learning-based 3-D FLC can be described by the following theorem.

Theorem 4.1. Suppose that the input universe of discourse Ω is a compact set in 𝑅𝑝×𝑠. Then, for any given real continuous function 𝑔(𝑥𝑧) on Ω and arbitrary 𝜀>0, there exists a 3-D FLC 𝑢(𝑥𝑧) as described in (2.4) satisfying the following inequality: sup𝑥𝑧Ω||𝑢𝑥𝑧𝑥𝑔𝑧||<𝜀.(4.1)

The proof of the theorem is given in the appendix by using Stone-Weierstrass theorem [23]. Theorem 4.1 indicates that the clustering and SVM-regression learning-based 3-D FLC is a universal approximator, that is, it can approximate continuous control functions to arbitrary accuracy.

5. Application

5.1. A Catalytic Packed-Bed Reactor

We take a catalytic packed-bed reactor [1, 5] as an example. The reactor is long and thin as shown in Figure 4. It is fed with gaseous reactant 𝐶 from the right side, and the zero-order gas phase reaction 𝐶𝐷 is carried out on the catalyst. The reaction is endothermic, and a jacket is used to heat the reactor. A dimensionless model that describes this nonlinear tubular chemical reactor is provided as follows: 𝜀𝑝𝜕𝑇𝑔𝜕𝑡=𝜕𝑇𝑔𝜕𝑧+𝛼𝑐𝑇𝑠𝑇𝑔𝛼𝑔𝑇𝑔,𝑢𝜕𝑇𝑠=𝜕𝜕𝑡2𝑇𝑠𝜕𝑧2+𝐵0exp𝛾𝑇𝑠1+𝑇𝑠𝛽𝑐𝑇𝑠𝑇𝑔𝛽𝑝𝑇𝑠𝑏(𝑧)𝑢(5.1) subject to the boundary conditions 𝑧=0,𝑇𝑔=0,𝜕𝑇𝑠𝜕𝑧=0;𝑧=1,𝜕𝑇𝑠𝜕𝑧=0,(5.2) where 𝑇𝑔,𝑇𝑠, and u denote the dimensionless temperature of the gas, the catalyst, and jacket, respectively. The values of the process parameters are given as follows: 𝜀𝑝=0.01,𝛾=21.14,𝛽𝑐=1.0,𝛽𝑝𝐵=15.62,(5.3)0=0.003,𝛼𝑐=0.5,𝛼𝑔=0.5.(5.4)

The concerned control problem is to control the catalyst temperature 𝑇𝑠(𝑧,𝑡) throughout the reactor to track a spatial reference profile (𝑇sd(𝑧)=0.420.2cos(𝜋𝑧)) in order to maintain a desired degree of reaction rate using the measurements of catalyst temperature from five sensing locations 𝑧=[00.250.50.751] and manipulating one spatially distributed heating source (𝑏(𝑧)=1cos(𝜋𝑧)). The mathematical model (5.1)-(5.2) is only for the process simulation for evaluation of the control scheme. The method of lines [24] is used to simulate the model.

In this application, we aim at extracting 3-D fuzzy rules from a spatiotemporal data set using clustering and SVM regression learning algorithm and constructing a complete 3-D FLC without any prior knowledge.

5.2. Design of a Clustering and SVM-Regression Learning-Based 3-D FLC
5.2.1. Spatiotemporal Data Collection

The spatiotemporal input-output data set is collected from the catalytic packed-bed reactor controlled by expert-knowledge-based 3-D FLC [5], where pseudorandom quinary signal (PRQS) [25] with maximum length of 124 as perturbed signal is added to the control input. Each spatiotemporal input-output data pair consists of a spatial error input 𝑒(𝑧)=[𝑒1,,𝑒5]𝑇, a spatial error in change input 𝑟(𝑧)=[𝑟1,,𝑟5]𝑇, and an incremental output Δ𝑢, where 𝑒𝑖=𝑇𝑠(𝑧𝑖,𝑞)𝑇sd(𝑧𝑖)𝑟𝑖=𝑒𝑖(𝑞)𝑒𝑖(𝑞1); 𝑞 and 𝑞1 denotes the qth and q-1th sampling time, respectively. The detailed design of the expert-knowledge-based 3-D FLC, including fuzzification, 3-D rule inference, and defuzzification, can refer to [5]. The scaling factors for the spatial error, the spatial error in change, and the incremental output are set as 2.0, 0.001, and 0.8716, respectively. The parameters of PRQS are chosen with the following settings: the number of the levels is 5, the length of the period is 124, the sampling time is 0.2 s, and the minimum switching time (i.e., clock period) is 0.2 s.

Two groups of data sets are obtained by adding PRQS signal with different scaling factor (i.e., 0.447 and 0.1) to the control input. The first group with 150 data pairs is generated for training by adding PRQS perturbation signal with a scaling factor 0.447, and the other group with 150 data pairs is generated for test by adding PRQS perturbation signal with a scaling factor 0.1. To evaluate the performance, we employ the following root-mean-squared error (RMSE) as the criteria: RMSE=𝑛𝑘=1Δ𝑢𝑘Δ𝑢𝑘2𝑛,(5.5) where 𝑛 denotes the number of samples, Δ𝑢𝑘 denotes actual output, and Δ𝑢𝑘 denotes expected output.

5.2.2. Design of a Clustering and SVM-Regression Learning-Based 3-D FLC

The design procedure of the proposed 3-D FLC is given as follows: (i)Employ the nearest neighborhood clustering algorithm to deal with the spatiotemporal data set for the input space partition with 𝜌0=0.7 and 𝜂0=0, and then generate 16 3-D fuzzy rules with 32 3-D fuzzy sets, where the width of Gaussian type 3-D fuzzy sets is 𝜎𝑧=[0.0620,0.0902,0.1518,0.2008,0.2175]𝑇 from (3.5).(ii)Simplify the 3-D fuzzy sets and 3-D fuzzy rules based on similarity measure (as described in Section 3.2) with 𝜆𝑢=0.95, 𝜆set=0.75, and 𝜆rule=1, and then obtain 15 3-D fuzzy rules with 15 3-D fuzzy sets. The distributions of Gaussian type 3-D fuzzy sets are shown in Figure 5.(iii)SVM algorithm described in Section 3.2 is used to learn the consequent part parameters with 𝐶={1,10,100,1000} and 𝜀={0.00001,0.0001,0.001,0.01,0.1,0.2}. The RMSE in (5.5) for training and test are listed in Table 1. From Table 1, we can find that: (1) smaller 𝜀 yielded more support vectors and led to reasonable training and test performance; while larger 𝜀 yielded less support vectors and led to worse training and test performance. (2)  𝐶 almost had no influence on the training and test performance, once 𝜀 was fixed. In this study, we choose 𝐶=100 and 𝜀=0.0001. Finally, a complete 3-D FLC is constructed with 15 3-D fuzzy rules and 15 3-D fuzzy sets as shown in Figure 6. Using the linguistic hedges approach [14, 21], we can interpret these 3-D fuzzy rules using linguistic words. For instance, the first 3 fuzzy rules are interpreted as follows.(a)𝑅1 IF 𝑒(𝑧) is less than POSITIVE SMALL and 𝑟(𝑧) is more than POSITIVE SMALL, THEN Δ𝑢 is sort of POSITIVE MEDIUM. (b)𝑅2 IF 𝑒(𝑧) is very ZERO and 𝑟(𝑧) is very NEGATIVE SMALL, THEN Δ𝑢 is very ZERO. (c)𝑅3 IF 𝑒(𝑧) is sort of POSITIVE SMALL and 𝑟(𝑧) is more than POSITIVE MEDIUM, THEN Δ𝑢 is more than POSITIVE MEDIUM.

5.2.3. Control-Performance Validation

The designed clustering and SVM regression learning-based 3-D FLC is applied to the control of the catalytic packed-bed reactor, where simulation time is 10 s. We select the same quantitative performance criteria as in [5]: steady-state error (SSE), integral of the absolute error (IAE), and integral of time multiplied by absolute error (ITAE). The control performance is given in Table 2, and the control profile is given in Figures 7 and 8, where (a), (b), and (c) represent catalyst temperature evolution profile, manipulated input, and catalyst temperature profiles in steady state, respectively. We can find that the proposed 3-D FLC has comparable control performance to the expert-knowledge-based 3-D FLC in [5] both in ideal condition and in disturbed condition.

In addition, we do more control experiments when the SVM-learning algorithm adopts different 𝐶 and 𝜀. According to the experimental results (see the last three columns in Table 1), we can find that the proposed 3-D FLC shows good control performance when a smaller 𝜀 is chosen.

The above simulation results demonstrate that the proposed design method of a clustering and SVM-regression learning-based 3-D FLC is effective. It provides a beneficial complementary design method to 3-D FLCs.

6. Conclusions

In this paper, we have proposed a new 3-D FLC design methodology based on clustering and SVM regression learning from a spatiotemporal data set. The 3-D FLC design is divided into three steps. Firstly, an initial rule structure is extracted by a nearest neighborhood clustering method, which is modified to be suitable for spatio-temporal data. Secondly, the initial structure is simplified via using similarity measure technique, which is defined for 3-D fuzzy sets and 3-D fuzzy rules. Thirdly, the parameters of the rule consequent parts are learned by a spatial fuzzy basis function-based SVM regression learning algorithm. Besides, the universal approximation capability of the proposed 3-D fuzzy system is discussed. Finally, effectiveness of the proposed 3-D FLC design methodology is validated on a catalytic packed-bed reactor.

Appendix

A. Proof of the Clustering and SVM-Regression Learning-Based 3-D FLC as a Universal Approximator

Let Θ be a set of 3-D FLCs defined in Ω, which is a compact set in 𝑅𝑝×𝑠. Then, Preliminary 1 is given as follows.

Preliminary 1
Let 𝑑(𝑢,𝑔) be a semimetric [26] with the following definition 𝑑(𝑢,𝑔)=sup𝑥𝑧Ω||𝑢𝑥𝑧𝑥𝑔𝑧||(A.1) Therefore, (Θ,𝑑) is a metric space. Since there is at least one fuzzy rule in the rule base of a 3-D FLC, Θ is non-empty. Thus, (Θ,𝑑) is strictly defined.

Subsequently, we will prove that (Θ,𝑑) is dense in (𝐶[Ω],𝑑) using Stone-Weierstrass theorem, where 𝐶[Ω] is a set of real continuous functions defined in a compact set Ω. The Stone-Weierstrass theorem is first stated here as follows.

Stone-Weierstrass Theorem (see [16, 23])
Let 𝑍 be a set of real continuous functions on a compact set 𝑈. If (1)𝑍 is an algebra, that is, the set 𝑍 is closed under addition, multiplication, and scalar multiplication; (2)𝑍 separates points on 𝑈, that is, for every 𝑥,𝑦𝑈,𝑥𝑦, there exists 𝑓𝑍 such that 𝑓(𝑥)𝑓(𝑦); and (3)  𝑍 vanishes at no point of 𝑈, that is, for each 𝑥𝑈 there exists 𝑓𝑍 such that 𝑓(𝑥)0; then, the uniform closure of 𝑍 consists of all real continuous functions on 𝑈, that is, (𝑍,𝑑) is dense in (𝐶[𝑈],𝑑).

Proof. (1) Firstly, we prove (Θ,𝑑) is an algebra. Let 𝑢1,𝑢2Θ, then we can write them as 𝑢1𝑥𝑧=𝑁1𝑙1=1𝑢𝑙11𝑝1𝑗1=1𝑎𝑗1𝑠𝑖=1𝑥exp𝑖𝑧𝑗1𝑐𝑙1𝑖𝑗1/𝜎𝑙1𝑖𝑗12𝑁1𝑙1=1𝑝1𝑗1=1𝑎𝑗1𝑠𝑖=1𝑥exp𝑖𝑧𝑗1𝑐𝑙1𝑖𝑗1/𝜎𝑙1𝑖𝑗12+𝑏1,𝑢2𝑥𝑧=𝑁2𝑙2=1𝑢𝑙22𝑝2𝑗2=1𝑎𝑗2𝑠𝑖=1𝑥exp𝑖𝑧𝑗2𝑐𝑙2𝑖𝑗2/𝜎𝑙2𝑖𝑗22𝑁2𝑙2=1𝑝2𝑗2=1𝑎𝑗2𝑠𝑖=1𝑥exp𝑖𝑧𝑗2𝑐𝑙2𝑖𝑗2/𝜎𝑙2𝑖𝑗22+𝑏2.(A.2) Subsequently, we have three derivation procedures.

(i) Addition
𝑢1𝑥𝑧+𝑢2𝑥𝑧=𝑁1𝑙1=1𝑁2𝑙2=1𝑢𝑙11+𝑢𝑙22𝑝1𝑗1=1𝑝2𝑗2=1𝑎𝑗1𝑎𝑗2𝑠𝑖=1exp𝒵𝑁1𝑙1=1𝑁2𝑙2=1𝑝1𝑗1=1𝑝2𝑗2=1𝑎𝑗1𝑎𝑗2𝑠𝑖=1+𝑏exp𝒵1+𝑏2,(A.3) where 𝒵 denotes (((𝑥𝑖(𝑧𝑗1)𝑐𝑙1𝑖𝑗1)/𝜎𝑙1𝑖𝑗1)2((𝑥𝑖(𝑧𝑗2)𝑐𝑙2𝑖𝑗2)/𝜎𝑙2𝑖𝑗2)2). Equation (A.3) has the same form as (3.12), then 𝑢1(𝑥𝑧)+𝑢2(𝑥𝑧)Θ.

(ii) Multiplication
𝑢1𝑥𝑧𝑢2𝑥𝑧=𝑁1𝑙1=1𝑁2𝑙2=1𝑢𝑙11𝑢𝑙22𝑝1𝑗1=1𝑝2𝑗2=1𝑎𝑗1𝑎𝑗2𝑠𝑖=1exp𝒵𝑁1𝑙1=1𝑁2𝑙2=1𝑝1𝑗1=1𝑝2𝑗2=1𝑎𝑗1𝑎𝑗2𝑠𝑖=1exp𝒵+𝑏2𝑁1𝑙1=1𝑢𝑙11𝑝1𝑗1=1𝑎𝑗1𝑠𝑖=1𝑥exp𝑖𝑧𝑗1𝑐𝑙1𝑖𝑗1/𝜎𝑙1𝑖𝑗12𝑁1𝑙1=1𝑝1𝑗1=1𝑎𝑗1𝑠𝑖=1𝑥exp𝑖𝑧𝑗1𝑐𝑙1𝑖𝑗1/𝜎𝑙1𝑖𝑗12+𝑏1𝑁2𝑙2=1𝑢𝑙22𝑝2𝑗2=1𝑎𝑗2𝑠𝑖=1𝑥exp𝑖𝑧𝑗2𝑐𝑙2𝑖𝑗2/𝜎𝑙2𝑖𝑗22𝑁2𝑙2=1𝑝2𝑗2=1𝑎𝑗2𝑠𝑖=1𝑥exp𝑖𝑧𝑗2𝑐𝑙2𝑖𝑗2/𝜎𝑙2𝑖𝑗22+𝑏1𝑏2,(A.4) where 𝒵 denotes (((𝑥𝑖(𝑧𝑗1)𝑐𝑙1𝑖𝑗1)/𝜎𝑙1𝑖𝑗1)2((𝑥𝑖(𝑧𝑗2)𝑐𝑙2𝑖𝑗2)/𝜎𝑙2𝑖𝑗2)2). In terms of algebraic operation, the product of functions in Gaussian form is also a function in Gaussian form. Thus, (A.4) has the same form as (3.12), and 𝑢1(𝑥𝑧)𝑢2(𝑥𝑧)Θ.

(iii) Scalar Multiplication
For arbitrary 𝑐𝑅, we have 𝑐𝑢1𝑥𝑧=𝑐𝑁1𝑙1=1𝑢𝑙11𝑝1𝑗1=1𝑎𝑗1𝑠𝑖=1𝑥exp𝑖𝑧𝑗1𝑐𝑙1𝑖𝑗1/𝜎𝑙1𝑖𝑗12𝑁1𝑙1=1𝑝1𝑗1=1𝑎𝑗1𝑠𝑖=1𝑥exp𝑖𝑧𝑗1𝑐𝑙1𝑖𝑗1/𝜎𝑙1𝑖𝑗12+𝑐𝑏1.(A.5) Equation (A.5) has the same form as (3.12), and 𝑐𝑢1(𝑥𝑧)Θ.
Finally, by combining (A.3)~(A.5) together, we can conclude that (Θ,𝑑) is an algebra.
(2) Secondly, we will prove that (Θ,𝑑) separates point on Ω by constructing a simple 3-D FLC 𝑢(𝑥𝑧) as in (3.12), namely, 𝑢(𝑥0𝑧)𝑢(𝑦0𝑧) holds for arbitrarily given 𝑥0𝑧,𝑦0𝑧Ω with 𝑥0𝑧𝑦0𝑧.
We choose two fuzzy rules, that is, 𝑁=2.
Let 𝑥0𝑧=𝑥01𝑧1,,𝑥01𝑧𝑝𝑇𝑥,,0𝑠𝑧1,,𝑥0𝑠𝑧𝑝𝑇𝑦0𝑧=𝑦01𝑧1,,𝑦01𝑧𝑝𝑇𝑦,,0𝑠𝑧1,,𝑦0𝑠𝑧𝑝𝑇,𝑎𝑗=1𝑝,𝜎1𝑖𝑗=𝜎2𝑖𝑗=1,𝑐1𝑖𝑗=𝑥0𝑖𝑧𝑗,𝑐2𝑖𝑗=𝑦0𝑖𝑧𝑗,𝑥1𝑧=𝑥0𝑧,𝑥2𝑧=𝑦0𝑧(𝑗=1,,𝑝).(A.6) We have 𝑢𝑥0𝑧=𝑢1+𝑢2(1/𝑝)𝑝𝑗=1𝑠𝑖=1𝑥exp0𝑖𝑧𝑗𝑦0𝑖𝑧𝑗21+(1/𝑝)𝑝𝑗=1𝑠𝑖=1𝑥exp0𝑖𝑧𝑗𝑦0𝑖𝑧𝑗2+𝑏=𝜁𝑢1+(1𝜁)𝑢2𝑢𝑦+𝑏,0𝑧=𝑢2+𝑢1(1/𝑝)𝑝𝑗=1𝑠𝑖=1𝑥exp0𝑖𝑧𝑗𝑦0𝑖𝑧𝑗21+(1/𝑝)𝑝𝑗=1𝑠𝑖=1𝑥exp0𝑖𝑧𝑗𝑦0𝑖𝑧𝑗2+b=𝜁𝑢2+(1𝜁)𝑢11+𝑏,𝜁=1+(1/𝑝)𝑝𝑗=1𝑠𝑖=1𝑥exp0𝑖𝑧𝑗𝑦0𝑖𝑧𝑗2.(A.7) Since 𝑥0𝑧𝑦0𝑧, there must be some 𝑖 and 𝑗 such that 𝑥0𝑖(𝑧𝑗)𝑦0𝑖(𝑧𝑗). Thus, we have 𝑠𝑖=1exp((𝑥0𝑖(𝑧𝑗)𝑦0𝑖(𝑧𝑗))2)1. For arbitrary 𝑗, 𝑠𝑖=1exp((𝑥0𝑖(𝑧𝑗)𝑦0𝑖(𝑧𝑗))2)1 holds, therefore, we have𝑝𝑗=1𝑠𝑖=1exp((𝑥0𝑖(𝑧𝑗)𝑦0𝑖(𝑧𝑗))2)𝑝. If we choose 𝑢1=0 and 𝑢2=1, then 𝑢𝑥0𝑧𝑦=1𝜉+𝑏𝜉+𝑏=𝑢0𝑧.(A.8) Therefore, (Θ,𝑑) separates point on Ω.
(3) Finally, we prove (Θ,𝑑) vanishes at no point of Ω.
For any 3-D FLC 𝑢(𝑥𝑧) expressed as in (3.12), if we choose 𝜁l0(l=1,,𝑁) and 𝑏>0, then for any 𝑥𝑧Ω, we have 𝑢(𝑥𝑧)>0.
Therefore, (Θ,d) vanishes at no point of Ω.
By combining the results from (1) to (3) together, Theorem 4.1 is proven.

Acknowledgments

This work was supported partly by the National Specialised Research Fund for the Doctoral Programme of Higher Education under Grant 20113705120003 and by Shandong Natural Science Foundation under Grant ZR2010FM018 and under Grant ZR2010FM022.