Fault Diagnosis of Batch Reactor Using Machine Learning Methods
Fault diagnosis of a batch reactor gives the early detection of fault and minimizes the risk of thermal runaway. It provides superior performance and helps to improve safety and consistency. It has become more vital in this technical era. In this paper, support vector machine (SVM) is used to estimate the heat release () of the batch reactor both normal and faulty conditions. The signature of the residual, which is obtained from the difference between nominal and estimated faulty values, characterizes the different natures of faults occurring in the batch reactor. Appropriate statistical and geometric features are extracted from the residual signature and the total numbers of features are reduced using SVM attribute selection filter and principle component analysis (PCA) techniques. artificial neural network (ANN) classifiers like multilayer perceptron (MLP), radial basis function (RBF), and Bayes net are used to classify the different types of faults from the reduced features. It is observed from the result of the comparative study that the proposed method for fault diagnosis with limited number of features extracted from only one estimated parameter () shows that it is more efficient and fast for diagnosing the typical faults.
Batch and semibatch reactors are extensively used in fine chemicals, biochemicals, pharmaceuticals, and food industries for the production of small amounts of products with high added value. As these processes have become more automated and more flexible, the demands on their efficiency have increased, creating more complexity in operation and control. However, the frequency of accidents increased involving important consequences on the human, natural, and economic environment [1, 2]. Therefore, fault diagnosis has become a major research topic. Batch processes usually involve a lot of variables that interrelate with one another. When any of these variables diverge away from their specified limits, a fault occurs. There are a plenty of studies on fault diagnosis varying from analytical methods to artificial intelligence and statistical approaches. The approaches of fault detection are based on the threshold checking in the past. Venkatasubramanian and Chan  proposed fault diagnosis to the continuous stirred tank reactors with neural network and identified six kinds of faults. The application of neural networks in fault diagnosis of chemical process focuses on the following aspects such as using as a classifier, using as a dynamic forecast model, and combining with other diagnostic methods. Later, pattern classification and model identification , EKF based fault detection , and fault diagnosis of ball bearing using machine learning method  were used.
In this paper, SVM model is used to generate the residual images. Fault classification has been done from the extracted image features. SVM is a novel machine learning method based on statistical learning theory. The SVM utilizes a hyperplane with maximum margin to produce a good generalization performance by separating different classes. As a result, SVM has been widely used for many applications, such as time series forecasting, fault detection, modelling of nonlinear dynamic systems, one-class SVM for machine fault detection and classification , SVM for copper clad laminate defects classification , and SVM for fault diagnosis of a steam turbine .
For this study, different major fault types like actuator fault, sensor fault, and process fault are considered. Hence the fault residuals are generated from the output of the nominal model and this faulty model. Data based fault diagnosis method requires a large amount of historical data. To make this method be more efficient, the first step is feature extraction . From the residuals signature, the most appropriate statistical and geometrical features are extracted and among these 15 features are selected. Through this characteristic extraction, the data can be transformed to the prior information of fault diagnosis system. SVM attribute filter is employed to rank the features, in order to reduce the input data dimension which makes the better performance of the classifier. And its performance is compared based on PCA feature reduction also.
This paper is mainly focused on identifying fault classification of batch reactor from the residual features using artificial intelligent classifiers such as multilayer perceptron (MLP), radial basis function (RBF), and Bayes net. The paper is organized as follows: Section 2 describes machine learning methods used in this work; Section 3 explains the case study of the well-known batch reactor; Section 4 gives fault identification method; Section 5 shows and discusses the simulation results. Finally, the conclusion from this work is presented in Section 6.
2. Machine Learning Methods
In this paper, the machine learning methods like SVM, ANN like MLP, and RBF are used. SVM is a supervised learning method, which is motivated in maximizing the ability to generalize well from a small number of training samples by mapping the original space into a high dimensional inner product space called feature space via a kernel. The SVM formulation follows structural risk minimization (SRM) principle in which an upper bound on the generalization error is minimized, whereas the error risk minimization (ERM) minimizes the prediction error on the training data. This equips the SVM with a greater potential to generalize the input-output relationship learnt during its training phase for making good predictions for new input data .
SVM’s solutions are characterized by convex optimization problems. So it can be applied in settling pattern recognition problems with small samples, nonlinearity, and higher dimensions. SVM can easily be introduced into learning problems such as function estimation.
In the batch reactor, the heat released by the reaction affects the dynamics of the reactor temperature. This term depends on the initial concentration, which is usually measurable at a very low sampling rate, not suitable for real time control or estimation. So any fault occurring in the batch reactor will have the impact on the heat release of the reactor. So a model developed based on the heat release of the reactor which can predict the type of fault occurred. But the quantity of is not directly measurable one. So an estimator is required to estimate it. Here SVM is used to build the estimator model.
Similarly, for classification of different faults, the usual ANN classifiers are used in this work. There are various neural network architectures but the application considered in this work has used MLP with back propagation learning algorithm, RBF, and the Bayes net [12–16].
3. Case Study
The proposed fault diagnosis scheme is applied to the batch reactor by Cott and Macchietto , Aziz et al. , Mujtaba et al. , and Sujatha and Pappa . The complex reaction scheme of a batch reactor is a representative of many industrial reactions as shown in Figure 1. The batch reactor is inherently a dynamic process. The reactions I and II of the batch reactor are given in the following equation: where , are the raw materials, is the desired product, and is the waste product.
This benchmark model is given on the basis of reaction equation (1), competent of simulating the reactions like the reactor temperature, jacket temperature, and heat release of reactor under nominal operating conditions and also under various faulty conditions.
These simulations are run under closed loop control with generic model controller (GMC) [20, 21]. The total running time of the batch process is 120 minutes (2 hours). The batch reactor model equations are given below:
The initial values of the above mentioned process parameters of [, , , , , ] are [12.0, 12.0, 0.0, 0.0, 20.0, 20.0] at . The reactor temperature is used as the control variable and is bounded between 20° and 100°C and the jacket temperature is the manipulated variable and it is bounded between 20° and 120°C. All the nominal parameters and constant values used in the model equations (2) are given in Table 1.
4. Fault Diagnosis of the Batch Reactor
Fault can be defined as any nonpermitted deviation of process behaviour from an acceptable one. So fault diagnosis is the problem of identifying and isolating unanticipated changes in a process. Diagnosis is a challenging problem due to several factors such as monitoring the number of variables, occurrences of the process complexity, and the variety of process failures. The failures can be broadly categorized into three types such as actuator faults, sensor faults, and process faults. The fault types are inspired from the relevant literature  presented in Figure 2.
Faults can be defined as follows.(i)Process faults: processes that occur below a certain level of detail are generally represented as lumped parameters in process models. An example of such a lumped parameter in this batch reactor is the fouling factor. Changes in these parameters are termed as process or parametric faults.(ii)Sensor failure: while all sensors have random errors, sensor failures refer to gross errors such as bias, measurement with noise, and frozen sensors.
Controller and actuator fault: the actuator faults are mostly caused by the nonlinear characteristics of the control valve by hysteresis, stiction, friction, and poor controller tuning. Actuator action in the presence of fault can be represented as where is the actuator fault vector. Abrupt constant bias has been given via the vector , so that the actuator action becomes as shown in Figure 3. Similarly the freezing of the actuator at certain time has been shown as , so that the actuator action is as shown in Figure 4. The faults that occurred in the batch reactor have a relationship with the heat release of the reactor.
4.1. SVM Estimator Model Based Fault Detection
Model based fault detection method is developed based on the assumption that the developed model is replica of the real plant dynamics. The input-output data are obtained by simulating the batch reactor with nominal operating conditions. The different faults have been introduced in the reactor through simulation by using MATLAB software. From the simulated input and output data, SVM estimator model is developed using LIBSVM toolbox. The heat release of the reactor () which is not a measurable parameter is estimated through the SVM model.
4.2. Training and Testing of the Estimator
Training the SVM estimator as shown in Figure 6 is an iterative process in which the SVM is given inputs along with the desired outputs. In this work, the SVM estimates the heat release of the reactor . The input and output mapping of the estimator is shown in Figure 5 where the past and present values of the reactor temperature and jacket temperature and the past values of the are considered as the input data. The estimator model is developed by selecting the SVM parameters such as , and the Radial Basis Function (RBF) as kernel to build and train the estimator as shown in Figure 6. Without any fault, the response of the estimator is shown in Figure 7.
The estimator models are developed under both normal and faulty conditions. The difference between faulty and nominal model is called residual (in terms of ) which is the important part of the fault diagnosing method as shown in Figure 8. Based on the residual patterns, the faults are identified through ANN classifiers.
4.3. Different Faults
Three different types of faults such as process fault, sensor fault, and actuator fault are introduced and data is collected for estimation of heat release of the reactor (). Each fault is introduced through simulation and the respective plant input and output data are collected every time. Table 2 shows the assigned fault for this work.
4.3.1. Process Fault
(i)Change in heat transfer coefficient : because of the fouling effect present in the heat exchanger, changes from its nominal value from batch to batch. Variation in residual for 10%, 20%, 30%, and 40% increase and decrease in are shown in Figure 9.(ii)Change in heat of reaction : the actual value of heat of reaction may not be available in the open literature and subsequently it can be the basic for model mismatch due to change in unmeasured parameter. The variation in residual for the heat of the reaction is reduced and increased by 10% and 25% from the nominal value as shown in Figure 10.(iii)Change in initial charge of reactants (, ): a change in product demand and accidental failure of the charging system or scale-up issues at the design stage will be the causes for the change in the operating conditions. The variation in residual for 10%, 20%, 25%, and 30% decrease and increase in initial charge are as shown in Figure 11.
In the heat release of the reactor residual patterns for the process fault, the differences in sign of magnitude, sluggishness of the response, decrease of the magnitude, and change of the starting position of the curve are observed.
4.3.2. Actuator Fault
The actuator fault considered here is as follows.(i)The addition of bias in the actuator shows the sticking nature of the actuator as shown in Figure 3.(ii)The actuator freezing occurred in the time interval 80 to 100 minutes as shown in Figure 4.
From the actuator fault of both freezing and biasing signatures, abrupt change is identified at the moment of fault occurring and based on the duration; the magnitude and the pattern of the residual vary.
4.3.3. Sensor Faults
The following sensor faults are considered in this work:(i)jacket temperature and reactor temperature measurements with the white noise;(ii)abrupt bias at sensor;(iii)abrupt zero at sensor.
Variation in residual for the sensor faults is shown in Figure 14.
The signature of the sensor fault pattern is varying with respect to the sources of different components and at the time of fault.
From the fault signatures, the relevant statistical and geometrical features are extracted. Here, 15 features such as area, mean, standard deviation, skew, kurtosis, fractional area, Feret’s diameter, integrated density, and raw integrated density are extracted by using Image J software. Few of the features are explained below.(i)Mean: average value of a signal is termed as mean value as given in (ii)Standard deviation: it is a measure of energy content in the fault signature shown in (iii)Skewness: it is a measure of symmetry or more precisely the lack of symmetry as expressed in (iv)Kurtosis: it is a measure of whether the data are peaked or flat relative to the normal distribution as given in where is the sample size and is the standard deviation.
5. Results and Discussion
The extracted features from the 24 simulated fault instances are listed in Table 3. The instance means the number of data points in the input data. The listed features are reduced to decrease the computational complexity. The feature reduction as well as fault classification is carried out by using WEKA software.
5.1. SVM Attribute Filter Based Feature Selection
SVM attribute filter is used to reduce the number of features by ranking method. Intuitively, the SVM model is a representation of the points in space mapped, so that the features of separate categories are divided by a clear gap. The new features are then mapped into the same space and predicted to belong to a category based on which side of the gap they fall on. The features are given as the input to the SVM attribute filter. The attributes are ranked by the square of the weight assigned by the SVM. An attribute selection for multiclass problem is handled by ranking attributes for each class separately using a one-versus-all method. Based on the attribute ranking Table 4, the number of input features is selected for the classifier to improve the performance of it.
5.2. Classifier Performance
After selecting the fault as an attribute for class, the classification is carried out with the ANN classifiers. The output of the WEKA software gives the confusion matrix, which shows a detailed accuracy of the classifier based on the class and evaluation of the success of the numeric prediction. In the multiple classification, the outcome of this test data displays a two-dimensional confusion matrix with a row and column for each class. Each matrix element shows the number of test examples for which the actual class is the row and the predicted class is the column. The confusion matrix shows each classifier’s ability to achieve in the classification of faults as shown in Table 5. From this, the radial basis function and multilayer perceptron have given better results as compared with the Bayes net classifier.
The training parameters for various classifiers for varying the number of features as 15 features, 9 features, and 5 features set as the input for the different classifiers are listed in Table 6, which denotes the values of the true positive (TP rate), false positive (FP rate), precision, recall, F-measure, and ROC area. The values of the TP rate, precision, recall, F-measure, and ROC area are “1” for RBF and MLP as compared with the Bayes net.
Classifiers performance criteria are shown in Table 7 for three sets of training pattern. From this table, MLP and RBF classified all instances correctly (100%) compared with Bayes net. Comparing MLP and RBF, the mean absolute error, root mean square error, relative absolute error, and root relative square error are less for RBF than MLP. So RBF classifier performance is better than MLP for this fault classification study.
From the SVM ranked classifier’s performance, it has been realised that the RBF is performing well even for less number of features and the MLP and Bayes performances are poor for less data. From the results, it could be observed that 9 features are enough to make all the classifiers’ performance good. The SVM attribute filter is used to reduce the number of input data.
The features are reduced by using PCA technique also. The correlation is the measure of similarity and the value for correlation is 1 and then the patterns are similar. If the correlation is “0” or negative then the patterns are dissimilar. The diagonal values of the correlation matrix shown in Table 8 are “1” which indicates that the similarity measure is calculated for the image with itself (autocorrelation). The other entries denote the cross correlation that is computed.
The eigenvectors of a square matrix are the nonzero vectors after being multiplied by the matrix, remaining proportional to the original vector (i.e., change only in magnitude, not in direction). The eigenvectors corresponding to the features are displayed in Table 9. For each eigenvector, the corresponding eigenvalue is the factor by which the eigenvector changes when multiplied by the matrix as they represent the characteristic values shown in Table 10.
Table 11 shows the classifier performance based on the SVM with 9 features and PCA with full data as two attributes are more or less same for the RBF classifier. But for MLP and also for Bayes net, the SVM attribute filter based classification results show its better nature than PCA.
This paper proposes a method for classification of various faults of the batch reactor by using artificial neural network (ANN) classifiers such as multilayer perceptron (MLP), radial basis function (RBF), and Bayes net from fault signatures. It is based on the relationship of each fault’s impact on heat release () of the reactor, which is the immeasurable parameter, estimated by SVM estimator. The fault signature trend is varying from fault to fault. Most appropriate features are selected from the variety of fault signatures and the numbers of attributes are reduced based on the SVM attribute filter by using ranking method and PCA, to reduce the computational difficulty of classifiers. It is observed that the classification accuracy of RBF and MLP is better than the Bayes net from the classifiers performance during training. Based on the performance criteria, the RBF performs well compared to MLP and Bayes net within 24 instances under the assumption that in practical situation less historical data are available. The results verify that the capability of the proposed fault diagnosis scheme is efficient and fast by using general ANN classifiers. It is based on the limited number of features extracted and selected from only one estimated parameter () itself. In future, it can be applied to develop a knowledge based system which is useful for early diagnosis of fault, to minimize the risk of thermal runaway for safety purpose of the batch reactor as well as to reduce the operating cost.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
V. Venkatasubramanian and K. Chan, “Neural network methodology for process fault diagnosis,” AIChE Journal, vol. 35, no. 12, pp. 1993–2002, 1989.View at: Google Scholar
S. Saludes and M. J. Fuente, “Support vector based novelty detection for fault tolerant control,” in Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference (CDC-ECC '05), pp. 5820–5825, Seville, Spain, December 2005.View at: Publisher Site | Google Scholar
K. Salahshoor, M. Kordestani, and M. S. Khoshro, “Fault detection and diagnosis of an industrial steam turbine using fusion of SVM (support vector machine) and ANFIS (adaptive neuro-fuzzy inference system) classifiers,” Energy, vol. 35, no. 12, pp. 5472–5482, 2010.View at: Publisher Site | Google Scholar
J. A. K. Suykens and J. Vandewalle, “Least squares support vector machine classifiers,” Neural Processing Letters, vol. 9, no. 3, pp. 293–300, 1999.View at: Google Scholar
J. S. R. Jang, C. T. Sun, and E. Mizutani, Neuro-Fuzzy and Soft Computing Computational Approach to Learning and Machine Intelligence, Prentice Hall of India, 2002.
R. E. Neapolitan, Probabilistic Reasoning in Expert Systems: Theory and Algorithms, John Wiley & Sons, 1989.
J. Pearl, Probabilistic Reasoning in Intelligent Systems, Morgan Kaufmann, San Francisco, Calif, USA, 1988.
H. Simon, Neural Networks—A Comprehensive Foundation, Pearson Education, San Francisco, Calif, USA, 2005.
M. J. Zurada, Introduction to Artificial Neural Systems, Delhi Jaico Publishing House, 1999.
B. J. Cott and S. Macchietto, “Temperature control of exothermic batch reactors using generic model control,” Industrial and Engineering Chemistry Research, vol. 28, no. 8, pp. 1177–1184, 1989.View at: Google Scholar
P. L. Lee and G. R. Sullivan, “Generic model control (GMC),” Computers and Chemical Engineering, vol. 12, no. 6, pp. 573–580, 1988.View at: Google Scholar