An Efficient Quality-Related Fault Diagnosis Method for Real-Time Multimode Industrial Process

Peng, Kaixiang; Wang, Bingzheng; Dong, Jie

doi:https://doi.org/10.1155/2017/9560206

Journal of Control Science and Engineering

On this page

Abstract Introduction Conclusion Acknowledgments References Copyright Related Articles

Special Issue

Fault Diagnosis and Application to Modern Systems

View this Special Issue

Research Article | Open Access

Volume 2017 | Article ID 9560206 | https://doi.org/10.1155/2017/9560206

An Efficient Quality-Related Fault Diagnosis Method for Real-Time Multimode Industrial Process

Kaixiang Peng,¹Bingzheng Wang,¹and Jie Dong¹

Academic Editor: Zhijie Zhou

Received22 Dec 2016

Accepted13 Feb 2017

Published12 Mar 2017

Abstract

Focusing on quality-related complex industrial process performance monitoring, a novel multimode process monitoring method is proposed in this paper. Firstly, principal component space clustering is implemented under the guidance of quality variables. Through extraction of model tags, clustering information of original training data can be acquired. Secondly, according to multimode characteristics of process data, the monitoring model integrated Gaussian mixture model with total projection to latent structures is effective after building the covariance description form. The multimode total projection to latent structures (MTPLS) model is the foundation of problem solving about quality-related monitoring for multimode processes. Then, a comprehensive statistics index is defined which is based on the posterior probability of the monitored samples belonging to each Gaussian component in the Bayesian theory. After that, a combined index is constructed for process monitoring. Finally, motivated by the application of traditional contribution plot in fault diagnosis, a gradient contribution rate is applied for analyzing the variation of variable contribution rate along samples. Our method can ensure the implementation of online fault monitoring and diagnosis for multimode processes. Performances of the whole proposed scheme are verified in a real industrial, hot strip mill process (HSMP) compared with some existing methods.

1. Introduction

With modern industrial processes getting increasingly complex and large, prevention monitoring and fault diagnosis have become the key to ensure safe operation, improve product quality, and gain economic benefits. Due to the complex operation mechanism, sheer size, complex conditions, chaotic environment, and vague boundary conditions in complex industrial systems, it is quite tough to implement effective process monitoring. As a result, the data-driven process monitoring technology has become one of the research hotspots in the field of fault diagnosis. The core idea of this technique is to establish the data model by means of using historical data, mining useful information, and getting the features of normal and fault operation mode, so as to realize process monitoring. In the last decades, basic multivariate statistical monitoring techniques, such as principal component analysis (PCA) and partial least squares (PLS), have been established and successfully applied in practice [1].

However, PCA or PLS model is established with data which follow the basis hypothesis of data subject to stable single Gaussian mode. Due to the reasons of fluctuation of raw materials, product specifications, and differences among batches, process data show the characteristic of multimode in actual industrial processes especially for batch processes. Considering the problems existing in the multimode process, traditional fault detection methods and their improved algorithms are difficult to be applied directly; otherwise, the performance of data model in process monitoring will be reduced.

Many scholars have studied a lot and made some progress on those problems [1]. Hwang and Han proposed a hierarchical clustering based on the PCA modeling method [2]. Lane et al. proposed a pooled principal component analysis method [3]. However, the ensemble modeling methods, in which the common feature of subspace in each mode is extracted as a unified model, are unable to fully or accurately depict all operation models. Particularly, when there are many differences among various modes, the model characterization in their methods is often biased. Chen and Liu used the heuristic smoothing clustering algorithm to classify data automatically, which can get multiple operating modes [4]. Zhao et al. applied multiple PCA and multiple PLS method to fault monitoring for multimode processes [5], in which the similarity index between different operating models was established and used to analyze the shift between the models. In view of stage division, Doan and Srinivasan modeled different stages of the process, respectively, for fault monitoring [6]. Dealing with the multimode problem of the process, the former divided the process data using the clustering method and then established independent models, so as to make fault monitoring more targeted. However, the above independent modeling methods are often complex, have large calculating quantity, and are usually based on the experience of mode division. Whether the division is reasonable or not will directly affect the quality of monitoring results. All the above increase the difficulties of online monitoring.

Considering the unique advantages in dealing with non-Gaussian data, the Gaussian mixture model (GMM) has not been explored in multimode process monitoring until recently. Choi et al. integrated PCA and DA with GMM to detect and isolate the faults in a process with nonlinearity, multistates, or dynamics [7]. Yoo et al. applied a similar strategy into multiway PCA to monitor biological batch processes [8]. However, these methods ignore the possibility that the monitored sample may come from other Gaussian components of lower posterior probabilities, which may lead to biased monitoring results. Yu and Qin proposed a new method that combines finite mixture Gaussian models with Bayesian inference to characterize different operation modes through Gaussian components and then realized fault detection [9]. In recent years, many scholars had proposed different methods to solve multimode monitoring [9].

The main contribution of this paper is summarized as follows. An efficient method for multimode process monitoring based on finite Gaussian mixture models is proposed. A gradient contribution rate is proposed to measure the contribution to the combined index and find out the variable which should be in charge of the fault in quality. This rate can better show the changes of variables contribution rate over time after fault occurrence.

The remainder of this paper is organized as follows. In Section 2, the descriptions of traditional PCA and PLS models in covariance form are provided, and then the covariance description form of the total projection to latent structures (TPLS) model is derived. Multimode information is extracted from the principal component space by GMM and a new multimode total projection to potential structure (MTPLS) model is established in Section 3. A unified monitoring framework based on MTPLS in combination with Bayesian inference is constructed and quality-related fault monitoring is implemented using a combined index in Section 4. In Section 5, a hot strip mill process is taken as an example to verify the superiority of our new method in fault monitoring and diagnosis over traditional methods. The conclusions and future works are given in Section 6.

2. Multivariate Statistical Theory

2.1. PCA and Covariance Description Form

Principal component analysis model is one of the most basic projection models in multivariate statistical analysis. Let be the dataset of -dimensional process variables, where stands for the number of samples. Matrix can be decomposed into a score matrix and a loading matrix as follows [10]:where and stand for score matrix and loading matrix, respectively, and is the number of principal components. The covariance matrix of normalized data can be defined as follows:

The PCA loading matrix can be obtained by eigenvalue decomposition on the covariance matrix .

Based on the projection model, monitoring statistics indexes and SPE can be constructed. Let ; the indexes can be designed as follows:where denotes the principal component covariance matrix and and are the control limit with the confidence level of .

When the residual error is subject to normal distribution, Jackson and Mudholkar pointed out that the control limit can be calculated as follows:where , , represents the threshold of standard normal distribution under the confidence level of , and represents the eigenvalue of covariance matrix .

Similarly, in order to apply the sample covariance information into the monitoring index, the principal component covariance matrix can be expressed as

2.2. PLS and Covariance Description Form

In the actual industrial production, the changes of quality variables are of more concern, especially for the faults which can cause the change of quality variables. PLS model uses the quality variables to guide the decomposition of sample space.

PLS decomposition of and results in the following:where , , and score matrix can be formulated with as .

Parameter matrix can be obtained by the loading matrix and weight matrix in PLS iterative calculation, .

According to the iterative process of the PLS model, Peng et al. proposed a model construction method using data covariance information [11], in which the covariance matrix of data was introduced into the iterative process, and model parameter matrices can be obtained at the same time. Compared with conventional PLS, the model construction method using data covariance information reduced the calculation amount although the intrinsic properties essence was not changed.

Different from the PCA projection model, the decomposition structure of space in PLS is defined by two matrices and , and an oblique projection structure is induced in input space. It is the quality that guides the decomposition of sample space, so that the principal component space is changed. The covariance matrix of the principal component space can be expressed as

Similar to PCA model monitoring, the monitoring sample statistics can be constructed by using the covariance matrix of the above formula as follows:

The control limit of the residual statistic can be calculated as follows:where , , represents the sample mean of residual statistic , represents the sample variance of , and is the threshold of variables with scale factor and free degree .

3. TPLS Monitoring Model

3.1. TPLS

PLS algorithm uses two variable spaces to describe process change. However, the main component of samples contains the part which is orthogonal to , and this part cannot reflect the variations related to . On the other hand, PLS decomposition structure makes the residual in remain very large, which is not suitable to be monitored by index . Therefore, Li et al. proposed a kind of total projection algorithm [12], which is based on traditional PLS decomposition. The original latent variable space is decomposed into one subspace relevant to quality variables directly and another subspace orthogonal to quality variables. At the same time, the residual space is decomposed into subspaces with large variance and residual subspace containing noise only, using the PCA orthogonal projection technique.

By further decomposition, we can model and as follows:where , , and . stands for the part which is relevant to directly in , stands for the part which is orthogonal to in , and stands for the part with large variance component in .

At the same time, based on the structure of PLS projection, Li et al. also performed a detailed analysis of the space structure of TPLS and drew a good conclusion [12]. Similar to PLS, TPLS also exhibits an oblique projection, but TPLS projects to four different spaces, which reflect different relationship among quality variables.

For a new measurement of sample , the corresponding score and residual part can be calculated as follows [12]:

Compared with PLS, TPLS model is easy to be explained and suitable for process monitoring. Similar to PLS in monitoring strategy, TPLS uses two statistic indexes and in process monitoring. In TPLS, , , and represent the main variation in the process, and thus they are suitable for statistic, and represents the residual part of the process which is suitable to be monitored by using statistic .

3.2. The Covariance Description of TPLS

The four spaces in TPLS can get a more detailed description of the different relationships between and quality variables . Based on the covariance matrix of the PLS model, the parameter matrices , , and will be obtained. Then, parameter matrix is calculated by .

Combining with the covariance description form of PCA and PLS model, we can do space decomposition in the following form.

In PCA decomposition of , characteristic vectors of the covariance matrix are extracted to construct . can be expressed as

Similarly, in PCA decomposition of and , we can extract characteristic vectors of each covariance matrix to form a loading matrix in corresponding space. Covariance matrices can be expressed aswhere

According to the score and the residual structure model of new measurement samples, let

It can be easily proved that this form is equivalent to the standard one.

The following part shows the calculation process of TPLS model using covariance information.

Covariance Description Form of TPLS Algorithm. Obtain and :(1)Use GMM-PLS algorithm, and obtain parameter matrix: , , , .(2)Calculate PCA decomposition of : do an eigenvalue decomposition on ; obtain the loading matrix and principal component number: .(3).(4)Calculate PCA decomposition of : do an eigenvalue decomposition on ; obtain the loading matrix and principal component number: .(5)Calculate PCA decomposition of : do an eigenvalue decomposition on ; obtain loading matrix and principal component number : based on the PCA method.

4. Multimode Process Monitoring and Fault Diagnosis

4.1. Mode Division of Principal Components

According to industrial process data with the characters of multimode, we need to determine a mixed model based on historical data firstly and then design a monitoring framework. Considering covariance information required for the statistical model, multimode modeling data can be processed by GMM. It is the assumption that data are made up of different Gaussian distributions. That is, for any sample data , it is possible to take a certain probability from different Gaussian distributions. As a result, global probability distribution can be expressed by the mixed model of the Gaussian elements. It can be expressed aswhere is the number of mixture components, denotes the weight of the th Gaussian component, and , represents the statistical parameters. Parameters estimation usually adopts EM iterative algorithm. The corresponding multivariate Gaussian density function for the th component is given by

According to the rule of Bayes inference, the posterior probability of belonging to the th Gaussian component is

However, due to factors such as production flow, batch, and specification, the quality variables of the final products have some certain degree of difference in real production processes. It may be the root cause that process data is with multimode and multistage features. Therefore, considering that the PLS algorithm is with the space decomposition under guidance of quality variables, this paper first performs mode division with principal component space and acquires the mode label of . This method carried out with the projection of training data can highlight the influence of quality variables better.

Based on advantages of GMM in processing multimode problems, we deal with principal components matrix with GMM for acquiring and . The total number of estimated parameters is , where is the number of the principal components. Usually, is far less than process variables number , which can reduce the number of estimated parameters greatly and speed up the calculation.

After mode division, principal component space model based on GMM is established, where each Gauss component corresponds to different mode characteristics. For training samples, can be divided into the modes whose principal variable belongs to

Taking process variables and output variables into account, we construct a new vector which stands for the process information as follows:

Assuming that variable is satisfied with mixed Gauss distribution, the distribution parameters can be acquired by mean and cov directly; prior probabilities are the same as principal space distribution .

Divide and into the forms of [11]

As above, it can be noted that mode classification will be under the guidance of quality variables. Then, because the number of principal components is far less than that of process variables, this has a great advantage in the treatment of estimated parameters calculation. In addition, after the mode division of original training data, multimode information such as covariance matrices can be directly calculated, which reduce the amount of calculation and improve calculation accuracy.

4.2. Multimode TPLS Based Fault Detection

According to the principle of building PLS and TPLS, the essence is to use data information, variance, and covariance to represent process characteristics. As far as PLS is concerned, the modeling process is to maximize the covariance of linear combinations of process variables and quality variables, so the modeling process can be converted into a covariance form through the initial data and . Therefore, in order to adapt to the multimode characteristic of industrial process data better, we can extend multivariate statistical methods to multimode scope by covariance strategy which will improve the performance of the monitoring model.

Based on the above analysis, we can make a rational division of training data to obtain the multimode information in the process of fault monitoring. When the sample is collected and is ready for being monitored, it can be divided into corresponding models with the probability, using Bayes classification ability under the data pretreatment. Then, we can calculate the monitoring statistic of the sample to justify which mode it belongs to. We treat the posterior probability of the monitoring sample belonging to each Gauss component as the membership degree of the corresponding model.

By using data information of probability and parameters to monitor the process, the comprehensive monitoring index is constructed, which can be used to monitor the fault reasonably.

For a new monitoring sample , the probability of sample data belonging to different modes is .

4.3. Comprehensive Monitoring Index

According to PCA decomposition of in TPLS, , the covariance matrix of principal components in space can be expressed as [13]

Available by and ,

In the same way, the covariance matrices of principal components in spaces and can be done as in the above proof:

In order to realize the multimode fault monitoring, the monitoring index based on the MTPLS model is obtained by using the probability information and Bayesian inference:

Similarly,

The threshold can be inferred by the setting in standard TPLS.

In summary, we make use of covariance information mainly to calculate and then to achieve process monitoring in MTPLS. Compared with standard TPLS, the covariance model is more suitable for monitoring multimode processes and making full use of data information in the process of model construction and fault monitoring. Avoiding direct classification on data, the covariance model reduces the effect of classification on the final performance monitoring of the process.

4.4. Quality-Related Combined Index

In TPLS based process monitoring, space represents the change part related to quality variable, while space represents the uncertain parts related to quality variable. They reflect two different kinds of quality-related faults. Therefore, it is necessary to observe two subspaces at the same time. In practice, a unified monitoring index is more popular than the two separate ones. In PCA based fault detection, Yue and Qin proposed a combined index [14]. Li et al. proposed a combined one for TPLS based process monitoring [12]. Similarly, a combined index which incorporates and is proposed in a way as follows: where is the threshold of this combined index which can be obtained by approximate distribution . It is supposed that there is no fault in the process when the monitoring result is .

Scale factor and free degree are calculated in where , which is the covariance matrix of process variable . Using this combined index, we can simultaneously monitor the anomalies in the two subspaces and thus monitor the faults associated with the quality variables .

4.5. Gradient Contribution Rate for Fault Diagnosis

It is necessary to isolate the faulty variables after a fault is detected. As a common fault separation method, the contribution plot assumes that the variables which have greater contribution to the monitoring statistics are very likely to be faulty variables. According to the description framework of complete decomposition of contribution proposed by Alcala and Qin, contribution to the combined index can be described as the following form [15]:where represents the th row of matrix , represents the th row of identity matrix, and represents the number of variables in one sample.

Traditional contribution plot method is used for analyzing a specific sample when the fault is detected, which shows the contribution value of each variable to one monitoring index in bar chart. After that, the variables with greater contribution will be selected as the possible cause of fault. Westerhuis et al. put forward a generalized contribution to statistics form and a method of obtaining the control limits for variable contributions [16]. Choi et al. proposed specific statistical methods to set the upper limit of the variable contribution to the four monitoring statistics [7]. Li et al. proposed a kind of contribution plot based on TPLS, which describes the contribution of all variables to monitoring index and in a unified way [12].

For the fault diagnosis method based on traditional contribution figure for one single sample after fault occurrence, there are some flaws that cannot well describe fault source and the change of other malfunction variables caused by fault source. In order to combine the idea of analyzing the contribution rate of faulty variables along the time coordinates with the change of the variable itself, reducing the impact of variable magnitude of value on the contribution rate, we refer to the gradient contribution rate to solve the fault variable analysis.

First, we introduce a mathematical symbol and a scale factor vector , where indicates element product. , and indicates the change of variable . As can be segmented, if , then ; if , then ; if , then . So, equation can be established.

It can be seen from the first-order Taylor series expansion of near that

Based on the above conclusion, the contribution rate may be defined as follows.

For a monitoring sample , indicates the contribution rate of the th variable to index .

As described above, the contribution rate represents the gradient of each variable to detection index under the same abnormal changes. Variables which are with great contribution will be considered with great influence to index , the same to quality variable.

For a new monitoring sample , the contribution rate of the th variable can be calculated as

As a result, the gradient contribution rate based on comprehensive monitoring index can be expressed as follows:where represents the value of the th variable in monitoring sample .

Due to the diffusion effect of fault, the method of setting absolute control limits using absolute value of variable contribution for fault diagnosis is not with good effect. Therefore, we use relative contribution rate; namely,where relative contribution rate satisfies

As described above, in index based quality-related fault diagnosis, the contribution rate can reflect contribution gradients of variables to the monitoring index. Therefore, those variables which have a larger contribution rate are able to affect combined index and quality variables significantly.

4.6. Framework of Fault Detection and Diagnosis

The schematic diagram of the proposed process monitoring and diagnosis is shown in Figure 1. Detailed procedures for multimode process detection can be summarized below:(1)Collect a set of historical training data under all possible operating modes and determine the number of modes.(2)Use EM algorithm to learn the Gaussian mixture model of principal component space and estimate the model parameter set based on the iterative steps.(3)Do multimode division and multimode information acquisition of process data according to . Then, for each monitored sample , compute its posterior probabilities belonging to all Gaussian components through Bayesian inference strategy.(4)Calculate local monitoring statistics for the monitored sample within each Gaussian component and integrate them into the comprehensive index with probabilities.(5)Integrate the quality-related monitoring statistics into a quality-related combined index .(6)Specify a confidence level for determining control threshold and generate the monitoring plot for all the monitored samples.(7)Detect the abnormal operating condition at the monitored samples satisfying which is helpful for fault diagnosis.(8)Calculate the relative contribution rate of variables to the combined index before and after fault occurrence and generate the contribution rate plot for fault diagnosis analysis.

5. Application to HSMP

5.1. Hot Strip Mill Process

HSMP (hot strip mill process) is an extremely complex industrial production process. In the process of production, improving the quality of products can bring about higher economic and social benefits for the factory. Typical HSM machine production line is mainly composed of reheating furnace, roughing mill, transfer table, crop shear, finishing mill group, run-out table cooling, and coiler. Figure 2 shows the whole process flow chart. The reheating furnace can ensure the temperature of the strip reaches 1200 degrees Celsius before roughing mill. A slab of thickness of 100~200 mm is sent to the roughing mill group after cutting off the scales, eventually forming 28~45 mm thick middle slab through several times of rolling. Through the transport of transfer table and in turn with insulation cover, crop shear, and high pressure water descaling, the slab runs into seven stands of finishing mill group. In order to enhance the performance of the final product, the steel plate needs to go through laminar cooling. This paper focuses on fault monitoring in the part of finishing mill process (FMP).

As shown in Figure 2, FMP consists of seven stands. Every stand contains two working rolls and backup rolls, which are driven by their own power drive units. The distance between two working rolls is called roll gap, which can be adjusted by the hydraulic device. A detailed structure diagram of the finishing roll is shown in Figure 3. This means that the strip will go through all the seven stands during the finishing mill process.

In whole FMP, it is noted that the stands are actually not working independently but are coupled with each other by different control schemes. The thickness in the exit of the last stand is the key factor which directly affects the quality of products. Whole finishing mill process is controlled by automatic thickness control system. It can be seen that there is an obvious hysteresis control of the exit thickness. Not until the abnormal value of the exit thickness is detected, caused by some fault of front stands, can the thickness control system be started. Therefore, establishing real-time acquisition of the relationship between the process variables and exit thickness and then monitoring the thickness by real-time measuring process variables become very meaningful [14].

5.2. Fault Detection Simulation Analysis

Production specification can be determined by different thicknesses of the steel strip in HSMP which should meet different industrial demands. We select the steel plate data of two specifications for modeling: one is the thickness of 2.70 mm and the other is 3.95 mm. The sampling interval for the variable is 0.01 s and 4000 samples are used for training modeling.

In the actual finishing mill process, we can collect the data information including roll gap, milling force between working rolls, and bending force in every stand. Generally speaking, the exit thickness has more relationships with roll gap and milling force than with bending force. Using data collected under normal operating conditions, GMM iterative learning is performed in principal component space which is under the guidance of quality variables. With the model division result, the process of multimode parameters calculation of the original data is carried out. Figure 4 shows the clustering distribution of two kinds of normal production. In this part, the clustering numbers are and which are fixed according to Yu et al. The -means algorithm is applied to roughly calculate . Randomly initialize the value of before GMM iterative learning. Then, we establish the proposed PLS-MTPLS model. Variables concerned in FMP are as follows.

(a)

(b)

Process Variables : average gap of stand, , μm : the force between supporting and working roll in stand, , KN : the bending force in the working roll of stand, , KN

Quality Variable : thickness of strip at the exit of FMP, mm

For different types of faults that may occur in FMP, we select three encountered faults as a detected object in this section which are shown in Table 1.

According to the exit thickness value of the strip steel under three types of fault condition, it is obvious that fault 1 and fault 3 are quality-related, while fault 2 is quality-unrelated. As Figures 6 and 9 show, the method based on PLS-MTPLS gives a higher fault detection rate for fault 1 and fault 3. And for the quality-unrelated fault 2, PLS-MTPLS inherits the effect of space division in traditional TPLS method, making the monitoring index which is directly related to the quality have a relatively low rate of false alarm.

To examine the advantages of our proposed approach, a comparison research has been done using two evaluating indexes: FDR and FAR.

False detection rates and false alarm rates are counted for three types of fault and statistical results are shown in Table 2. It shows that PLS-MTPLS method performs better.

Fault 1 represents the failure of hydraulic roll gap control structure. Fault occurs at about 20 s, namely, the 2000th monitoring sample. The values of roll gap in the fourth stand are directly affected, and then the sampling values of milling force in the fourth stand have also been affected. Because of the influence of feedback control system, roll gaps and milling forces will be changed in the following stands, and then finally the exit thickness is affected. As shown in Figure 5, there is a delay for change of exit thickness value, namely, quality variables with respect to fault occurrence. But for fault detection results, as shown in Figure 6, there is almost no delay. From the point of view of this analysis, detection results can be a good reference for field staffs, in order to take response measurements timely.

Figure 7 gives the observation of change of relative contribution rate for fault 1. As is shown, we can clearly see that many contribution rate values of related variables have changed since the 2000th monitoring sample. When the fault is detected, according to the observation of relative contribution rate, variable which has the largest relative contribution rate is diagnosed firstly. As a result, we can conclude that roll gap of stand is the source of fault. At the same time, variables and ~ are subsequently diagnosed which are affected by the fault source, thereby causing fault propagation. Figure 7 shows curves of variables change in real data of fault source variable and variables affected. The diagnostic analysis is in accordance with the actual production situation. As a result, the relative contribution rate can not only diagnose fault variables, but also show the order of fault variables transmission. Then, it can help in finding out the real source of fault with causal relationship among these variables.

Fault 2 represents the fault of sampling value of bending force in stand, which is a kind of step transition. When the fault occurs, the value of variable will increase greatly. Then, with feedback regulation of automatic control system, the bending force value in and stands will be changed correspondingly. But this kind of fault will only cause the change of strip plate, not thickness as shown in Figure 8.

Fault 3 is a kind of fault in cooling valve between and stands, which is usual in the process of finishing mill. It will make the rolling force and roll gap in stands following stand be changed. Based on the monitoring results of comprehensive index, fault 3 can be detected timely as shown in Figure 9. The change of relative contribution rate is shown in Figure 10. It can be noted that variable is affected firstly, followed by variable and others. From the above analysis, we can draw a conclusion that the fault diagnosis method based on the relative contribution rate can be applied to FMP effectively.

In this section, we focus on the research of exit thickness of the strip. Twenty variables among measured variables in FMP were selected for building PLS-MTPLS model. Based on it, a kind of comprehensive monitoring index and a kind of relative contribution rate were established for fault monitoring and diagnosis, respectively, for three common faults. Results of monitoring and diagnosis verified that PLS-MTPLS has higher FDR and lower FAR than traditional multivariate statistics methods shown in Table 2. In addition, compared with MTPLS which clusters with original data directly, this method is with better monitoring effects in statistics of principal component which can be seen in Table 2.

6. Conclusion

In this paper, a new PLS-MTPLS method is proposed on the basis of covariance descriptions of PCA and PLS algorithm for multimode process monitoring. After mode division of quality-related principal components, multimode information is embedded into the monitoring model by integrating GMM with TPLS, which avoids the direct use of process training data for modeling. Based on the quality-related multimode monitoring model PLS-MTPLS, a kind of comprehensive monitoring index is applied to execute real-time online monitoring. Then, a combined index is constructed for improving monitoring efficiency and extended to fault diagnosis by relative gradient contribution rate calculation.

The efficiency and superiority of PLS-MTPLS are demonstrated through application to the monitoring of HSMP. As can be seen from the comparison and analysis, the proposed approach can reduce computational complexity and be more suitable for multimode processes.

Competing Interests

The authors declare that there are no competing interests regarding the publication of this paper.

Acknowledgments

This work is supported by the National Natural Science Foundation of China (NSFC) under Grants 61473033 and 61673032 and by Beijing Natural Science Foundation (4142035), China.

References

L. H. Chiang, R. D. Braatz, and E. L. Russell, Fault Detection and Diagnosis in Industrial Systems, Springer, London, UK, 2001.
D.-H. Hwang and C. Han, “Real-time monitoring for a process with multiple operating modes,” Control Engineering Practice, vol. 7, no. 7, pp. 891–902, 1999.
View at: Publisher Site | Google Scholar
S. Lane, E. B. Martin, R. Kooijmans, and A. J. Morris, “Performance monitoring of a multi-product semi-batch process,” Journal of Process Control, vol. 11, no. 1, pp. 1–11, 2001.
View at: Publisher Site | Google Scholar
J. Chen and J. Liu, “Mixture principal component analysis models for process monitoring,” Industrial and Engineering Chemistry Research, vol. 38, no. 4, pp. 1478–1488, 1999.
View at: Publisher Site | Google Scholar
S. J. Zhao, J. Zhang, and Y. M. Xu, “Monitoring of processes with multiple operating modes through multiple principle component analysis models,” Industrial and Engineering Chemistry Research, vol. 43, no. 22, pp. 7025–7035, 2004.
View at: Publisher Site | Google Scholar
X.-T. Doan and R. Srinivasan, “Online monitoring of multi-phase batch processes using phase-based multivariate statistical process control,” Computers and Chemical Engineering, vol. 32, no. 1-2, pp. 230–243, 2008.
View at: Publisher Site | Google Scholar
S. W. Choi, J. H. Park, and I.-B. Lee, “Process monitoring using a Gaussian mixture model via principal component analysis and discriminant analysis,” Computers and Chemical Engineering, vol. 28, no. 8, pp. 1377–1387, 2004.
View at: Publisher Site | Google Scholar
C. K. Yoo, K. Villez, I.-B. Lee, C. Rosén, and P. A. Vanrolleghem, “Multi-model statistical process monitoring and diagnosis of a sequencing batch reactor,” Biotechnology and Bioengineering, vol. 96, no. 4, pp. 687–701, 2007.
View at: Publisher Site | Google Scholar
J. Yu and S. J. Qin, “Multiway gaussian mixture model based multiphase batch process monitoring,” Industrial and Engineering Chemistry Research, vol. 48, no. 18, pp. 8585–8594, 2009.
View at: Publisher Site | Google Scholar
R. A. Johnson and D. W. Wichern, Applied Multivariate Statistical Analysis, Prentice-Hall, New York, NY, USA, 1998.
K. Peng, K. Zhang, B. You, and J. Dong, “Quality-related prediction and monitoring of multi-mode processes using multiple PLS with application to an industrial hot strip mill,” Neurocomputing, vol. 168, pp. 1094–1103, 2015.
View at: Publisher Site | Google Scholar
G. Li, S.-Z. Qin, Y.-D. Ji, and D.-H. Zhou, “Total PLS based contribution plots for fault diagnosis,” Acta Automatica Sinica, vol. 35, no. 6, pp. 759–765, 2009.
View at: Publisher Site | Google Scholar
K. Zhang, H. Hao, Z. Chen, S. X. Ding, and K. X. Peng, “A comparison and evaluation of key performance indicator-based multivariate statistics process monitoring approaches,” Journal of Process Control, vol. 33, pp. 112–126, 2015.
View at: Publisher Site | Google Scholar
H. H. Yue and S. J. Qin, “Reconstruction-based fault identification using a combined index,” Industrial and Engineering Chemistry Research, vol. 40, no. 20, pp. 4403–4414, 2001.
View at: Publisher Site | Google Scholar
C. F. Alcala and S. J. Qin, “Reconstruction-based contribution for process monitoring,” Automatica, vol. 45, no. 7, pp. 1593–1600, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
J. A. Westerhuis, S. P. Gurden, and A. K. Smilde, “Generalized contribution plots in multivariate statistical process monitoring,” Chemometrics and Intelligent Laboratory Systems, vol. 51, no. 1, pp. 95–114, 2000.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2017 Kaixiang Peng et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1501

Downloads

965

Citations