Abstract

A new fault-relevant KPCA algorithm is proposed. Then the fault detection approach is proposed based on the fault-relevant KPCA algorithm. The proposed method further decomposes both the KPCA principal space and residual space into two subspaces. Compared with traditional statistical techniques, the fault subspace is separated based on the fault-relevant influence. This method can find fault-relevant principal directions and principal components of systematic subspace and residual subspace for process monitoring. The proposed monitoring approach is applied to Tennessee Eastman process and penicillin fermentation process. The simulation results show the effectiveness of the proposed method.

1. Introduction

Process monitoring and fault diagnosis are important for the safety and reliability of industrial processes [112]. As a data-driven process monitoring methodology, multivariate statistical analysis techniques, such as principal component analysis (PCA) and partial least squares (PLS), have been used widely for detection and diagnosis of abnormal operating situations in many industrial processes in the last few decades [5, 1316]. The major advantage of these methods is ability to handle larger number of highly correlated variables and reduce the high-dimensional process measurement into a low-dimensional latent space. The monitoring based onthese methods is straightforward.

PCA is one of the most widely used linear techniques for transforming data into a new space. It divides data information into the significant patterns, such as linear tendencies or directions in model subspace, and the uncertainties, such as noises or outliers located in residual subspace. statistic and statistic, represented by Mahalanobis and Euclidian distances, are used to elucidate the pattern variations in the model and residual subspaces, respectively [1719]. PLS decomposition methods are used similar to PCA for process monitoring and are more effective in supervising the variations in the process variables that are more influential on quality variables [2022]. statistic and statistic are also employed in the PLS monitoring system. These methods develop a normal operating model with the normal data gathered from the normal process and define the normal operation regions. The new process behaviors can thus be compared with the predefined ones by the monitoring system to ensure whether they remain normal or not. When the process moves out of the normal operation regions, it is concluded that an “unusual and faulty” change in the process behaviors has occurred. Nowadays, many extensions of the conventional PCA/PLS algorithms have been reported [15, 2331]. Recently, Li et al. proposed a total projection to latent structure (T-PLS) and discussed the policy of process monitoring and fault diagnosis based on the new structure [32, 33]. They analyzed the problem faced in conventional PLS based process monitoring policy which only divides the measured variable space into two subspaces and uses two monitoring statistics for PLS scores and residuals, respectively. They indicated that the output-irrelevant variations are also included by PLS scores and PLS residuals do not necessarily cover only small -variations. The proposed T-PLS algorithm further decomposed the PLS systematic subspace to separate the output-orthogonal part from the output-correlated part, and the PLS residual subspace to separate large variations from noises. T-PLS based monitoring system was then developed based on the four-process subspace.

KPCA is one nonlinear version of PCA. It can efficiently compute PCs in a high-dimensional feature space using nonlinear kernel functions. The core idea of KPCA is to first map the data space into a feature space using a nonlinear mapping and then carry out the PCA operation in the feature space. KPCA divides the data into a systematic subspace and a residual subspace and uses statistic and statistic to monitor these two subspaces, respectively [13, 15, 16, 34].

In this paper, to improve the KPCA model, a fault-relevant KPCA algorithm is proposed and the approach of process monitoring based on the new fault-relevant KPCA algorithm is proposed for fault detection. The proposed method further decomposes both the KPCA principal space and residual space into two subspaces by checking the influences by process disturbances. The basic objective for further subspace decomposition is to separate the part which is influenced greatly by the fault from the part that is not clearly fault-relevant, that is, to find the fault-relevant directions and fault-relevant principal components. Then a new monitoring method is proposed based on the fault-relevant directions. Compared with traditional statistical techniques, the fault subspace is separated based on the fault-relevant influence.

The remaining sections of this paper are organized as follows. Section 2 revisits the KPCA model and then presents the fault-relevant KPCA algorithm. Section 3 introduces the on-line monitoring method of fault-relevant KPCA. Model development and on-line monitoring are proposed in Section 3. In Section 4, the simulation results are given to illustrate the effectiveness of the new method. At last the conclusions are drawn in Section 5.

2. Algorithm of Fault-Relevant KPCA

For the traditional PCA algorithm, some faults may not influence all the principal directions; that is, to a given fault, some principal directions are not relevant. KPCA algorithm is the method for nonlinear data extended from PCA algorithm, so it has the same characteristics mentioned above [35]. The proposed fault-relevant KPCA algorithm finds the principal directions which are relevant to, or affected by the disturbances and then measures the changes of the variation along these principal directions. Therefore the proposed algorithm has higher sensitivity and accuracy for process monitoring and it can detect the faults faster.

The purpose of the proposed algorithm is to get the fault-relevant principal directions in the systematic subspace and those of the residual subspace. With the obtained fault-relevant principal directions and a new set of data, the scores of new data can be gotten. Therefore, the statistic and statistic can be further calculated to monitor the process.

In KPCA, the training samples gotten from normal process are mapped into feature space using nonlinear mapping: . The covariance matrix in the feature space can be calculated as follows: where it is assumed that , and is a nonlinear mapping function that projects the input vectors from the input space to . Principal components in can be obtained by finding eigenvectors of , which is straightforward to the PCA procedure in input space, using the equation as follows: where denotes eigenvalues and denotes eigenvectors of covariance matrix .

For , solution (eigenvector) can be regarded as a linear combination of , that is, .

Using kernel trick , the eigenvalue problem can be expressed as a simplified form as follows: where and are a gram matrix which is composed of .

Then, the calculation is equivalent to resolving the eigen problem of (2.3). To satisfy the assumption must be mean-centered before the calculation. The centered gram matrix can be easily obtained by , where each element of is equal to and . Also, the coefficient should be normalized to satisfy , which corresponds to the normality constraint, of eigenvector.

The scores of vector are then extracted by projecting onto eigenvectors in and the number of scores is . As some principles, scores are selected to be principal components, and corresponding directions of are gotten at the same time [36, 37]. directions selected from span the systematic subspace and the remaining - directions span the residual subspace. The PCs of is where , and is the number of principal components.

Now PCs of training data are gotten the PCs that are relevant to faults will be found next as follows.

First, a fault process space is separated into a systematic subspace and a residual subspace following the separation rule of the process space . One data set collected from a fault case is projected into with the same mapping function to get where , , and

spans the systematic subspace of .

Then, the fault-relevant PCs of fault data can be gotten via From (2.7), (2.8) can be obtained as follows: where , and the subscript denotes fault relevant.

And the fault-relevant PCs of normal data can be calculated with the same principle:

In this way, some largest fault-relevant directions of normal data and fault data are revealed, respectively. Define the ratio of the fault-relevant PC variances between fault case and normal case as follows: where denotes the variance of PCs and denotes the th column vector in matrix as well as .

The largest value of denotes the direction along which there are the largest changes of process variation from normal status to fault case. If the index is smaller than 1, it means that the concerned variations in the fault status are smaller than those in normal case. Keep the directions with values of larger than 1 which are the fault-relevant directions with increased variations. The number of retained principal directions is . The fault-relevant principal directions compose and the remaining - directions of , which are fault irrelevant, compose . are the fault-relevant PCs of normal data with components and are the fault-irrelevant PCs, with - components.

The , span the systematic subspace and the , span the residual subspace. Define the column number of principal directions in residual subspace are , and define as follows:

Then, the PCs of normal case in the residual subspace can be calculated as follows:

Similarly, the PCs of fault case in the residual subspace can be calculated as follows:

Following the ways of (2.7), the fault-relevant principal directions and principal components in residual subspace of fault case can be obtained, respectively. One has where . Then the principal components in residual subspace of normal case can be also worked out with obtained fault-relevant principal directions in (2.15) as follows:

Then the fault-relevant residual subspace of fault case is and the fault-relevant residual subspace of normal case is . Define the ratio of the squared errors between fault case and normal case along each direction in the fault-relevant residual subspace:

The largest value denotes the direction along which there are the largest changes of squared errors from normal status to fault case. Keep those fault-relevant residual directions with values of larger than 1 which are the fault-relevant directions with increased squared errors. The final number of dimensions of fault-relevant residual subspace is . Correspondingly, the fault-relevant residual subspace is spanned by which is composed of the sorted directions extracted from . The remaining directions of , which are fault irrelevant, compose . The fault-relevant PCs of normal case compose which has components and the fault-irrelevant PCs compose , with - components.

There exist a number of kernel functions. According to Mercer’s theorem of functional analysis, there exists a mapping into a space where a kernel function acts as a dot product if the kernel function is a continuous kernel of a positive integral operator. Hence, the requirement on the kernel function is that it satisfies Mercer’s theorem. Theoretically, all functions that satisfy Mercer’s theorem can be utilized, while there are several most widely used kernel functions such as Gaussian function , polynomial , sigmoid , where , , , and are specified a priori by the user. Gaussian kernel is selected in this paper for its good performance.

3. On-Line Monitoring Strategy of Fault-Relevant KPCA

The fault-relevant KPCA-based monitoring method is similar to that using KPCA. The Hotelling’s statistic and the -statistic in the feature can be interpreted in the same way. Two systematic subspaces both have their own statistic and two residual subspaces both have their own -statistic too. Define the statistic of fault-relevant systematic subspace and that of fault-irrelevant systematic subspace . Define the -statistic of fault-relevant residual subspace and that of fault-irrelevant residual subspace . For one new data set , those four statistics can be obtained as follows:

In (3.1) and (3.3), , is the fault-relevant directions of systematic subspace with components. are the fault-relevant directions of , with components. One has where

In (3.2) and (3.4), , and is the fault-irrelevant directions of systematic subspace with - components. is the fault-irrelevant directions of , with - components.

In (3.2) and (3.4), is the th component of and .

In (3.3), is the th component of and , denotes the fault-relevant directions of residual subspace with components.

In (3.4), is the th components of and , denotes the fault-irrelevant directions of residual subspace with - components.

The confidence limit of is obtained using the -distribution [38]: where is the number of samples in the model and is the number of PCs.

The confidence limit of can be computed from its approximate distribution [39]: where is a weighting parameter included to account for the magnitude of and accounts for the degrees of freedom. If and are the estimated mean and variance of the , then and can be approximated by and .

3.1. Developing the Different Fault-Relevant Models

(1) Acquire normal operating data and several different known fault data sets.(2) Given a set of -dimensional normal operating data , and a set of -dimensional fault data , , compute the kernel matrix by and by .(3) Carry out centering in the feature space for and as follows: where For different faults, different , that is, different models can be gotten.(4) Solve the eigenvalue problem and normalize such that .

3.2. On-Line Monitoring

The main thought of on-line monitoring is that different models are developed with different fault data sets. Monitoring statistics are calculated, respectively, in these models with the on-line data at the same time. When the monitoring statistic of one model goes out of the confidence limit, the abnormality is detected and the fault is identified meanwhile, that is, the type of fault with which that model was developed. The specific steps are as follows.(1) Obtain new data for each sample.(2) Given the -dimensional test data , compute the kernel vector ,, where is the normal operating data. (3) Mean center the test kernel vector as follows: where and are obtained from step 2 and 3 of the modeling procedure and .(4) For the test data , compute ,,,, with , , , , in models, respectively.(5) Calculate the monitoring statistics of four subspaces of the test data in different models.(6) Monitor whether or exceeds its control limit calculated in the modeling procedure.

4. Simulation Study

The proposed fault-relevant KPCA method was applied to fault detection and diagnosis in benchmark simulations of Tennessee Eastman process and penicillin fermentation process and compared with the conventional KPCA model.

4.1. Tennessee Eastman Benchmark

The well-known TE process has been widely used for testing various process monitoring and fault diagnosis methods [11, 12] since it was first introduced by Downs and Vogel [40]. The process is constructed by five major operation units: a reactor, a product condenser, a vapor-liquid separator, a recycle compressor, and a product stripper. It contains two blocks of process variables: 41 measured variables and 11 manipulated variables. Process measurements are sampled with interval of three minutes. The details on the process description can be found in Downs and Vogel’s work.

As a complex chemical process, TE process provides a superior simulation platform to validate the proposed method. In this study, fifty-two variables, including 41 process measurement variables and 11 manipulated variables, are used. Four hundred and eighty normal samples are used for model identification. Fifteen known faults as described in Downs and Vogel’s work are considered. Faults 1–7 are associated with step changes in different process variables, for example, in the feed ratio and feed temperature. Faults 8–12 are associated with random variables in certain variables, for example, an increase in the variability of reactor cooling water inlet temperature. For Fault 13, there is a slow drift in the reaction kinetics. For Faults 14 and 15, two cooling water valves are stuck.

Based on KPCA algorithm, the normal process space is decomposed into a systematic subspace and a residual subspace first. Then some fault-relevant directions or principal components are picked up from the systematic subspace with the help of information extracted from fault data. In this article, Fault 1, Fault 7, and Fault 13 are used to develop different monitoring models. In the models built with these faults, all the principal components in the residual subspace are fault relevant, so that the charts are the same as that of KPCA.

For Fault 1, Figure 1(a) shows the statistic values calculated with fault-relevant principal components calculated by fault-relevant KPCA method and Figure 1(b) shows the KPCA statistic values. The KPCA statistic give alarming signals from the sample and fault-relevant goes out of control from the sample. The statistic detected the fault earlier than statistic.

For Fault 7, the results in Figure 2 show that the statistic notice the fault earlier than statistic. The real chart for this fault goes down when it detects the fault; therefore it was turned so that conventional chart confidence limit could be used to detect the fault. The fault-relevant method detected the fault from the sample while the KPCA method detected the fault from the sample.

For Fault 13, as shown in Figure 3, the two statistics have the same monitoring result. Both statistics detected the fault from the sample.

In summary, the proposed method pays more attention to the fault-relevant process variations and separates them from the fault-irrelevant variations for monitoring. Comparatively, KPCA model treats them together. For Fault 1, Fault 7, and Fault 13, the monitoring results show that the fault-relevant KPCA based monitoring performance is better than that based on KPCA model. For Fault 13, the proposed method based on monitoring performance is not worse than that based on KPCA.

The choosing of the kernel parameter is important for KPCA and other kernel methods, which would affect their performances. Similarly, the kernel parameter is also an influential factor in this method and its monitoring. Changing of the kernel parameter, the shape of the chart changes. For some faults, a good kernel parameter for statistic of KPCA may not be appropriate for statistic which is sensitive to the fault with another kernel parameter. For some faults, the fault-relevant principal components are also sensitive, but the statistic calculated by these components is not satisfactory. Therefore, for some faults, the proposed method does not have a satisfactory performance.

4.2. Penicillin Fermentation

In this section, the proposed method is applied to the monitoring of a well-known benchmark process, penicillin fermentation process. A flow diagram of the penicillin fermentation process is given in Figure 4. Trajectories of nine variables from a nominal batch run are shown in Figure 5. The production of secondary metabolites such as antibiotics has been the subject of many studies because of its academic and industrial importance. Here, we focus on the process to produce penicillin, which has nonlinear dynamics and multiphase characteristics. In typical operating procedure for the modeled fed-batch fermentation, most of the necessary cell mass is obtained during the initial preculture phase. When most of the initially added substrate has been consumed by the microorganisms, the substrate feed begins. The penicillin starts to be generated at the exponential growth phase and continues to be produced until the stationary phase. A low substrate concentration in the fermentor is necessary for achieving a high product formation rate due to the catabolite repressor. Consequently, glucose is fed continuously during fermentation at the beginning. In the present simulation experiment, a total of 60 reference batches are generated using a simulator (PenSim v2.0 simulator). Detail process description is well explained from http://www.chee.iit.edu/~cinar/software.htm. These simulations are run under closed-loop control of pH and temperature, while glucose addition is performed under open loop. Small variations are automatically added to mimic the real normal operating conditions under the default initial setting conditions. The duration of each batch is 400 h, consisting of a pre-culture phase of about 45 h and a fed-batch phase of about 355 h [41, 42].

The models are constructed using the proposed method. KPCA is then tested against monitoring of fault batches. Fault 1 is implemented by introducing a 10% step increase in the Aeration rate at 100 h and retaining until 300 h. Fault 2 is implemented by introducing a 2% step increase in the Aeration rate at 100 h and retaining until 300 h. Fault 3 is implemented by introducing a 10% step increase in the agitator power at 100 h and retaining until 300 h. The monitoring results are shown in Figures 6, 7, and 8, respectively. As shown in Figure 6, the proposed fault-relevant KPCA method and KPCA can detect faults varying in large ranges. In our study, when the faults vary in a small range, the proposed method can detect faults successfully, but the of KPCA cannot detect the faults, as shown in Figures 7 and 8. Therefore the proposed method can detect tiny fault and it is more sensitive than KPCA for these faults.

5. Conclusions

In this article, the fault-relevant KPCA algorithm is proposed to decompose the process variations from the fault-relevant perspective. By further decomposing the KPCA subspaces, the underlying process information can be more comprehensively looked into, which is helpful to the detection of abnormal changes. Fault-relevant principal components extracted from KPCA systematic subspace and residual subspace are used to monitor the process. With fault-relevant principal components, instead of with all principal components which may not be influenced by the disturbances, better monitoring results are gotten. The case study on TEP and penicillin fermentation process is performed to show the performance of the fault-relevant KPCA algorithm for process monitoring. In general, swifter and more sensitive fault detection is reported in comparison with the conventional KPCA method.

Acknowledgments

The work is supported by China’s National 973 program (2009CB320602 and 2009CB320604) and the NSF (60974057 and 61020106003).