A Hybrid ICA-SVM Approach for Determining the Quality Variables at Fault in a Multivariate Process

Shao, Yuehjen E.; Lu, Chi-Jie; Wang, Yu-Chiun

doi:https://doi.org/10.1155/2012/284910

Mathematical Problems in Engineering

On this page

Abstract Introduction Conclusion References Copyright Related Articles

Research Article | Open Access

Volume 2012 | Article ID 284910 | https://doi.org/10.1155/2012/284910

A Hybrid ICA-SVM Approach for Determining the Quality Variables at Fault in a Multivariate Process

Yuehjen E. Shao,¹Chi-Jie Lu,²and Yu-Chiun Wang¹

Academic Editor: Alexei Mailybaev

Received23 Mar 2012

Accepted30 Jul 2012

Published13 Sept 2012

Abstract

The monitoring of a multivariate process with the use of multivariate statistical process control (MSPC) charts has received considerable attention. However, in practice, the use of MSPC chart typically encounters a difficulty. This difficult involves which quality variable or which set of the quality variables is responsible for the generation of the signal. This study proposes a hybrid scheme which is composed of independent component analysis (ICA) and support vector machine (SVM) to determine the fault quality variables when a step-change disturbance existed in a multivariate process. The proposed hybrid ICA-SVM scheme initially applies ICA to the Hotelling T² MSPC chart to generate independent components (ICs). The hidden information of the fault quality variables can be identified in these ICs. The ICs are then served as the input variables of the classifier SVM for performing the classification process. The performance of various process designs is investigated and compared with the typical classification method. Using the proposed approach, the fault quality variables for a multivariate process can be accurately and reliably determined.

1. Introduction

In recent years, considerable concern has arisen over the multivariate statistical process control (MSPC) charts in monitoring a multivariate process [1–6]. The MSPC chart is one of the most effective techniques to detect the occurrence of a multivariate process disturbance. An out-of-control signal implies that disturbances have been occurred in the process. When a signal is triggered by the MSPC chart, the process personnel should begin to search for the root causes of the underlying disturbance. Once the root causes have been determined, the process personnel would significantly decrease the effects of the disturbance and then bring the underlying process back in a state of statistical control.

When the root causes have been determined, the necessary remedial actions can be properly taken in order to compensate for the effects of the underlying disturbance. Also, the identification and fixing of the root causes would mainly depend on the accurate identification of the quality variables at fault. As a consequence, the identification of the quality variables at fault in a multivariate process is a very important research issue.

However, the use of the MSPC charts typically encounters a major problem in the interpretation of the signal. Although the MSPC chart’s signal will indicate that the underlying process is out of control, the quality variables at fault are very difficult to determine. The degree of difficulty increases when the number of quality variables () in the multivariate process increases. Typically, there are possible sets of quality variable at fault in an out-of-control multivariate process which has quality variables. For example, there are 31 possible sets of quality variables at fault in a multivariate process with 5 quality variables. When a MSPC signal is triggered, it is not straightforward to determine which one of the 31 possible combinations is responsible for this signal.

Runger et al. [1] introduced a decomposition method to overcome this problem. They computed an approximate chi-square statistic to determine which of the monitored quality variables invoked the MSPC signal. However, their method has some limitations in certain situations [2]. Specifically, their approach may not be able to offer an accurate identification rate (AIR) when a small magnitude of process disturbance exists in a multivariate process. Some classification techniques are therefore developed to overcome the drawback of their approach [2, 3]. Shao and Hsu [2] used the Artificial Neural Networks (ANNs) and support vector machine (SVM) approaches to determine the quality variables at fault in the case of process mean shifts. C. S. Cheng and H. P. Cheng [3] also studied the ANN and SVM techniques to determine the quality variables at fault in the case of process variance shifts.

Huang et al. [4] demonstrated that performance of hierarchical support vector machine technique is better than the traditional SVM. Also, Shao et al. [5] proposed decomposition schemes and developed useful statistics to estimate the quality variables at fault in the case of variance shifts that have occurred in a multivariate process. However, in their approach, the sample size needed was very large, which may be different from what is encountered in practice.

Many studies on the utilization of one-shot or one-step classifiers’ approach have been conducted [1–4, 6]. However, very little is known about the hybrid scheme for determining the quality variables at fault in a manufacturing process [7, 8]. In this paper, we present the use of a hybrid mechanism, which integrates independent component analysis (ICA) and SVM as processing methods to improve the results in determining the quality variables at fault in an out-of-control multivariate process. The basic concept of the proposed hybrid approach is that the most useful information to determine the quality variables at fault may be embedded in the monitor statistics, for example, the Hotelling T² statistics in the Hotelling T² control chart. We could enhance the AIR if we decompose the monitor statistics and input the decomposed factors to the classifiers.

Due to its frequent use in real applications [2, 9, 10], this study uses the Hotelling T² control chart to detect the process mean shifts in a multivariate process. In addition, since the ICA has been reported to have the capability of distinguishability [11–19], this study uses the ICA as the first-step technique to extract the independent components (ICs) from Hotelling T² statistics. The hidden useful information of the quality variables at fault would be embedded in these ICs. In the second step of classification, those ICs are then used as the input variables of the classifiers. This study considers the SVM as a classifier for the reason of its great potential and superior performance in practical applications [20–27].

This study is organized as follows. Section 2 discusses the individual components of the proposed hybrid mechanism. Section 3 addresses the appropriate models for determining the quality variables at fault when the process mean shifts are introduced in a multivariate process. In this section, the various experimental settings and the simulation results are also discussed. The final section summarizes the research findings and presents our conclusions.

2. Methodologies

There are two components in our proposed hybrid scheme, and they include independent component analysis and the support vector machine. The following section addresses the applications and the use of these two techniques.

2.1. Independent Component Analysis

The present study employs ICA to enhance the accurate identification rate (AIR) of the proposed hybrid scheme. There are some ICA applications for process monitoring. Lu et al. [11] successfully combined the ICA and SVM to identify the control chart patterns. Kano et al. [12] applied the ICs, instead of the original measurements, to monitor a process. In their study, a set of devised statistical process control charts have been developed effectively for each IC. Lee et al. [13] used the utilization of kernel density estimation to define the control limits of ICs that do not satisfy Gaussian distribution. In order to monitor the batch processes which combine independent component analysis and kernel estimation, Lee et al. [14] extended their original method to multiway ICA. Xia and Howell [15] developed a spectral ICA approach to transform the process measurements from the time domain to the frequency domain and to identify major oscillations.

Let be a matrix of size , , consisting of observed mixture signals of size , . In the basic ICA model, the matrix can be modeled as follows: where is the th column of the unknown mixing matrix ; is the th row of the source matrix . The vectors are latent source signals that cannot be directly observed from the observed mixture signals . The ICA model aims at finding an demixing matrix such that where is the th row of the matrix , . The vectors must be as statistically independent as possible and are called independent components (ICs). ICs are used to estimate the latent source signals . The vector in (2.2) is the th row of the demixing matrix , . It is used to filter the observed signals to generate the corresponding independent component , that is, , .

The ICA modeling is formulated as an optimization problem by setting up the measure of the independence of ICs as an objective function and using some optimization techniques for solving the demixing matrix [28, 29]. The ICs with non-Gaussian distributions imply the statistical independence [28, 29], and the non-Gaussianity of the ICs can be measured by the negentropy [28]: where is a Gaussian random vector having the same covariance matrix as . is the entropy of a random vector with density defined as .

The negentropy is always nonnegative and is zero if and only if has a Gaussian distribution. Since the problem in using negentropy is computationally very difficult, an approximation of negentropy is proposed [28] as follows: where is a Gaussian variable of zero mean and unit variance, and is a random variable with zero mean and unit variance. is a nonquadratic function and is given by in this study. The FastICA algorithm proposed by [28] is adopted in this paper to solve for the demixing matrix . Two preprocessing steps are common in the ICA modeling, centering and whitening [28]. Firstly, the input matrix is centered by subtracting the row means of the input matrix, that is, . The matrix with zero mean is then passed through the whitening matrix to remove the second-order statistic of the input matrix, that is, . The whitening matrix is twice the inverse square root of the covariance matrix of the input matrix, that is, , where is the covariance matrix of . The rows of the whitened input matrix , denoted by , are uncorrelated and have unit variance, that is, . In this study, it is assumed that the training and testing process datasets are centered and whitened.

2.2. Support Vector Machine

The use of SVM algorithm can be described as follows. Let , , be the training set with input vectors and labels. Here, is the number of sample observations and is the dimension of each observation, is known target. The algorithm is to seek the hyperplane , where is the vector of hyperplane and is a bias term, to separate the data from two classes with maximal margin width , and all the points under the boundary are named support vector. In order to obtain the optimal hyperplane, the SVM was used to solve the following optimization problem [30]:

It is difficult to solve (2.5), and we need to transform the optimization problem to be dual problem by Lagrange method. The value of in the Lagrange method must be nonnegative real coefficients. Equation (2.5) is transformed into the following constrained form [30]: In (2.6), is the penalty factor and determines the degree of penalty assigned to an error. It can be viewed as a tuning parameter which can be used to control the tradeoff between maximizing the margin and the classification error.

In general, it could not find the linear separate hyperplane in all application data. For problems that cannot be linearly separated in the input space, the SVM uses the kernel method to transform the original input space into a high-dimensional feature space where an optimal linear separating hyperplane can be found. The common kernel function is linear, polynomial, radial basis function (RBF), and sigmoid. In this study, we used multiclass SVM method proposed by Hsu and Lin [31].

3. The Proposed Approach and the Example

3.1. The ICA-SVM Scheme

This study integrates ICA and SVM for determining the quality variables at fault of an out-of-control multivariate process. In the training phase, the aim of the proposed scheme is to obtain the proper parameter setting for the SVM model. Since the RBF kernel function is adopted in this study, the performance of SVM is primarily affected by the setting of parameters and . There are no general rules for the choice of those two parameters. This study uses the grid search proposed by Hsu et al. [32] for these two parameters setting. The trained SVM model with proper parameter setting is preserved and employed in the testing phase.

The proposed model first collects two sets of Hotelling T² statistics from an out-of-control process. The ICA model is used to generate the two estimated ICs from the observed Hotelling T² statistics. Subsequently, the proposed approach considers those two ICs and 3 averaged quality variables, 4 averaged quality variables, and 5 averaged quality variables as the inputs for SVM in the case of processes with 3 quality characteristics, 4 quality characteristics, and 5 quality characteristics, respectively.

3.2. The Simulated Example

This study employs a simulated example to demonstrate the use of our proposed approach. In our simulation, we assume that a multivariate process is initially in control, and the sample observations come from a multivariate normal distribution with known mean vector and covariance matrix . This study assumes that a disturbance has intruded into the underlying process at time . It results in a mean vector change which is shifted from to .

This study applies Hotelling T² control chart to monitor a multivariate process in the cases of 3, 4, and 5 quality characteristics. For each type of process, this study considers the following types of correlation, , between any two quality variables: (1) no correlation (i.e., ), (2) moderate correlation (i.e., ), and (3) high correlation (i.e., ). Now, consider a case of out-of-control multivariate normal process with 3 quality characteristics. Since the process has 3 quality characteristics (i.e., ), the possible sets of quality variables at fault would be . In our study, we use the following notations: (1,0,0), (0,1,0), (0,0,1), (1,1,0), (1,0,1), (0,1,1), and (1,1,1) to represent the 7 possible sets, in which “0” stands for the “in-control” state and “1” stands for the “out-of-control” state. The meaning of (1,1,0) stands for the first and second quality variables (i.e., and ) that are at fault while the third quality variable (i.e., ) is not at fault.

Without loss of generality, we assume that each quality characteristic for an in-control process is sampled from a normal distribution with zero mean and one standard deviation. We also assume that the out-of-control process has a mean shift of 1 standard deviation, and, thus, the out-of-control control process is sampled from a normal distribution with a mean of one and one standard deviation. The sample size () is assumed to be 5.

The sample averages (, , and 3) are used to calculate the Hotelling T² statistics. The Hotelling T² statistics are computed as follows: where : the sample size, : the mean vector at the time , : the grand mean vector of the quality characteristics, and : the inverse of variance and covariance matrix.

This study generates 100 data sets of observations (each of sample size 5) for every possible combination of fault sets. Since there are 7 possible sets of quality variables at fault in the case of , we have 700 data sets in a simulation run. Those 700 data sets are initially used to serve as the training data. This study generates another 700 data sets for the purpose of the testing. Figure 1 displays the 700 data sets of , , and in the cases of , , and , respectively. In the first step of classification, we also use the data set of out-of-control Hotelling T² statistics which is shown in Figure 2. Figure 3 displays the two ICs which are generated by using ICA technique.

(a)

(b)

(c)

(a)

(b)

(c)

(a)

(b)

(c)

3.3. The Results

Consider the case of a multivariate process with a three-quality characteristics (i.e., ). The typical approach directly uses four variables, , , , and the Hotelling T² statistics as inputs for SVM. Different from the typical approach, the proposed approach initially decomposes the Hotelling T² statistics as two ICs, and then the proposed approach uses those two ICs as the inputs for SVM classifier. Therefore, the proposed approach employs five variables, , , , and the two ICs, as the inputs for the classifier SVM. Tables 1, 2, and 3 report the accurate identification rates (AIRs) when the typical and proposed approaches apply to the multivariate process when , , and . In Table 1, in the case of , we notice that the AIRs are 79.6% and 78.2%, respectively, for the typical and proposed approaches. The same AIR interpretations apply to the remaining conditions for Tables 1, 2, and 3.

Observing Table 1, one is able to conclude that the AIR for the proposed approach is almost larger (or better) than the cases of typical approach except for the case of . This implies that the proposed approach has a better performance. Also, in the case of , the difference in performance between the two approaches is not significant. Those findings are displayed in Figure 4.

Observing Tables 2 and 3 for the cases of and , respectively, we can be very sure that the proposed approach outperforms the typical approach. The AIR values for the proposed approach are always larger. In addition, it is apparently that the AIR values become larger when the values of become larger. The values of AIR are smaller when the number of quality characteristics increases. Those research findings are demonstrated in Figures 5 and 6.

4. Conclusion

Determination of the quality variables at fault for an out-of-control multivariate process is very important in practice. While most of the studies use the single step of classification, this study proposes a hybrid or a two-step approach, ICA-SVM, to enhance the performance of the typical approach. Accordingly, our proposed approach has two more extra inputs, two ICs, for the SVM classifier models. Again, those two ICs are obtained from running the ICA models as the first-step modeling in our proposed scheme. The two ICs are then served as inputs for the second-step modeling in our proposed scheme. The proposed ICA-SVM hybrid mechanism is able to enhance the accurate identification rate for the determination of quality variables at fault in a multivariate process.

In this study, a multivariate process with 2, 3, and 5 quality variables and various correlations structures are considered for evaluating the performance between the typical one-step and proposed hybrid approaches. Experimental results strongly agreed that the proposed hybrid ICA-SVM scheme is able to produce the better accurate identification rate for the testing datasets. Observing the experimental results, we can strongly conclude that the proposed hybrid approach is able to effectively determine the quality variables for a multivariate process.

Our approach requires several steps and to total is quite complicated; therefore, we have not attempted analytic evaluation. However, we believe that our simulation example is generically applicable for monitoring real manufacturing processes when the circumstances of the processes resemble to the simulation conditions of this study. To make the proposed method more applicable, a multivariate process with 6 to 10 quality characteristics and a different set of correlations between quality characteristics will be discussed in future research.

Acknowledgment

This work is partially supported by the National Science Council of the Republic of China, Grant nos. NSC 99-2221-E-030-014-MY3 and NSC 101-2221-E-231-006.

References

G. C. Runger, F. B. Alt, and D. C. Montgomery, “Contributors to a multivariate statistical process control chart signal,” Communications in Statistics. Theory and Methods, vol. 25, no. 10, pp. 2203–2213, 1996.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
Y. E. Shao and B. S. Hsu, “Determining the contributors for a multivariate SPC chart signal using artificial neural networks and support vector machine,” International Journal of Innovative Computing, Information and Control, vol. 5, no. 12, pp. 4899–4906, 2009.
View at: Google Scholar
C. S. Cheng and H. P. Cheng, “Identifying the source of variance shifts in the multivariate process using neural networks and support vector machines,” Expert Systems with Applications, vol. 35, no. 1-2, pp. 198–206, 2008.
View at: Publisher Site | Google Scholar
H. Y. Huang, Y. E. Shao, C. D. Hou, and M. D. Hsieh, “Identifying the contributors of the multivariate variability control chart using hierarchical support vector machines,” ICIC Express Letters, vol. 5, pp. 3543–3547, 2011.
View at: Google Scholar
Y. E. Shao, C. D. Hou, C. H. Chao, and Y. J. Chen, “A decomposition approach for identifying the sources of variance shifts in a multivariate process,” ICIC Express Letters, vol. 5, no. 4 A, pp. 971–975, 2011.
View at: Google Scholar
C. C. Chiu, Y. E. Shao, T. S. Lee, and K. M. Lee, “Identification of process disturbance using SPC/EPC and neural networks,” Journal of Intelligent Manufacturing, vol. 14, no. 3-4, pp. 379–388, 2003.
View at: Publisher Site | Google Scholar
Y. E. Shao and H. D. Hou, “Change point determination for a multivariate process using a two-stage hybrid scheme,” Applied Soft Computing. In press.
View at: Publisher Site | Google Scholar
C. D. Hou, Y. E. Shao, and S. Huang, “A combined MLE and generalized P chart approach to estimate the change point of a multinomial process,” Applied Mathematics & Information Sciences. In press.
View at: Google Scholar
R. L. Mason, N. D. Tracy, and J. C. Young, “Decomposition of T² for multivariate control chart interpretation,” Journal of Quality Technology, vol. 27, no. 2, pp. 99–108, 1995.
View at: Google Scholar
R. L. Mason and J. C. Young, “Improving the sensitivity of the T² statistic in multivariate process control,” Journal of Quality Technology, vol. 31, no. 2, pp. 155–165, 1999.
View at: Google Scholar
C. J. Lu, C. M. Wu, C. J. Keng, and C. C. Chiu, “Integrated Application of SPC/EPC/ICA and neural networks,” International Journal of Production Research, vol. 46, no. 4, pp. 873–893, 2008.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
M. Kano, S. Tanaka, S. Hasebe, I. Hashimoto, and H. Ohno, “Monitoring independent components for fault detection,” AIChE Journal, vol. 49, no. 4, pp. 969–976, 2003.
View at: Publisher Site | Google Scholar
J. M. Lee, C. Yoo, and I. B. Lee, “Statistical process monitoring with independent component analysis,” Journal of Process Control, vol. 14, no. 5, pp. 467–485, 2004.
View at: Publisher Site | Google Scholar
J. M. Lee, C. Yoo, and I. B. Lee, “On-line batch process monitoring using different unfolding method and independent component analysis,” Journal of Chemical Engineering of Japan, vol. 36, no. 11, pp. 1384–1396, 2003.
View at: Publisher Site | Google Scholar
C. Xia and J. Howell, “Isolating multiple sources of plant-wide oscillations via independent component analysis,” Control Engineering Practice, vol. 13, no. 8, pp. 1027–1035, 2005.
View at: Publisher Site | Google Scholar
L. Wang and H. B. Shi, “Application of kernel independent component analysis for multivariate statistical process monitoring,” Journal of Donghua University, vol. 26, no. 5, pp. 461–466, 2009.
View at: Google Scholar
C. J. Lu, Y. E. Shao, and P. H. Li, “Mixture control chart patterns recognition using independent component analysis and support vector machine,” Neurocomputing, vol. 74, no. 11, pp. 1908–1914, 2011.
View at: Publisher Site | Google Scholar
C. H. Wang, T. P. Dong, and W. Kuo, “A hybrid approach for identification of concurrent control chart patterns,” Journal of Intelligent Manufacturing, vol. 20, no. 4, pp. 409–419, 2009.
View at: Publisher Site | Google Scholar
C. C. Hsu, M. C. Chen, and L. S. Chen, “Integrating independent component analysis and support vector machine for multivariate process monitoring,” Computers and Industrial Engineering, vol. 59, no. 1, pp. 145–156, 2010.
View at: Publisher Site | Google Scholar
Y. E. Shao, C. J. Lu, and C. C. Chiu, “A fault detection system for an autocorrelated process using SPC/EPC/ANN and SPC/EPC/SVM schemes,” International Journal of Innovative Computing, Information and Control, vol. 7, no. 9, pp. 5417–5428, 2011.
View at: Google Scholar
K. I. Kim, K. Jung, S. H. Park, and H. J. Kim, “Support vector machines for texture classification,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 11, pp. 1542–1550, 2002.
View at: Publisher Site | Google Scholar
K. S. Shin, T. S. Lee, and H. J. Kim, “An application of support vector machines in bankruptcy prediction model,” Expert Systems with Applications, vol. 28, no. 1, pp. 127–135, 2005.
View at: Publisher Site | Google Scholar
X. Wang, “Hybrid abnormal patterns recognition of control chart using support vector machining,” in Proceedings of the International Conference on Computational Intelligence and Security (CIS'08), pp. 238–241, December 2008.
View at: Google Scholar
S. Y. Lin, R. S. Guh, and Y. R. Shiue, “Effective recognition of control chart patterns in autocorrelated data using a support vector machine based approach,” Computers and Industrial Engineering, vol. 61, no. 4, pp. 1123–1134, 2011.
View at: Publisher Site | Google Scholar
P. Chongfuangprinya, S. B. Kim, S.-K. Park, and T. Sukchotrat, “Integration of support vector machines and control charts for multivariate process monitoring,” Journal of Statistical Computation and Simulation, vol. 81, no. 9, pp. 1157–1173, 2011.
View at: Publisher Site | Google Scholar
W. Gani, H. Taleb, and M. Limam, “An assessment of the kernel-distance-based multivariate control chart through an industrial application,” Quality and Reliability Engineering International, vol. 27, no. 4, pp. 391–401, 2011.
View at: Publisher Site | Google Scholar
J. Park, I. H. Kwon, S. S. Kim, and J. G. Baek, “Spline regression based feature extraction for semiconductor process fault detection using support vector machine,” Expert Systems with Applications, vol. 38, no. 5, pp. 5711–5718, 2011.
View at: Publisher Site | Google Scholar
A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis, John Wiley & Sons, 2001.
V. D. A. Sánchez, “Frontiers of research in BSS/ICA,” Neurocomputing, vol. 49, pp. 7–23, 2002.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
V. N. Vapnik, The Nature of Statistical Learning Theory, Statistics for Engineering and Information Science, Springer, New York, NY, USA, 2nd edition, 2000.
C. W. Hsu and C. J. Lin, “A comparison of methods for multiclass support vector machines,” IEEE Transactions on Neural Networks, vol. 13, no. 2, pp. 415–425, 2002.
View at: Publisher Site | Google Scholar
C. W. Hsu, C. C. Chang, and C. J. Lin, “A practical guide to support vector classification,” Tech. Rep., Department of Computer Science and Information Engineering, National Taiwan University, 2003.
View at: Google Scholar

Copyright

Copyright © 2012 Yuehjen E. Shao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1427

Downloads

935

Citations