International Conference on Advances in Mechanical Engineering and Mechanics 2010
View this Special IssueResearch Article  Open Access
Fault Diagnosis of Rotating Machinery Based on Multisensor Information Fusion Using SVM and TimeDomain Features
Abstract
Multisensor information fusion, when applied to fault diagnosis, the timespace scope, and the quantity of information are expanded compared to what could be acquired by a single sensor, so the diagnostic object can be described more comprehensively. This paper presents a methodology of fault diagnosis in rotating machinery using multisensor information fusion that all the features are calculated using vibration data in time domain to constitute fusional vector and the support vector machine (SVM) is used for classification. The effectiveness of the presented methodology is tested by three case studies: diagnostic of faulty gear, rolling bearing, and identification of rotor crack. For each case study, the sensibilities of the features are analyzed. The results indicate that the peak factor is the most sensitive feature in the twelve timedomain features for identifying gear defect, and the mean, amplitude square, root mean square, root amplitude, and standard deviation are all sensitive for identifying gear, rolling bearing, and rotor crack defect comparatively.
1. Introduction
Typical rotating machinery systems such as water turbine, steam turbine, wind turbine, and rotary kiln are critical core equipment support of the important industries of the national economy [1, 2]. The safety, reliability, efficiency, and performance of rotating machinery are major concerns in industry, so, the task of condition monitoring and fault diagnosis of rotating machinery is significant [3]. The common mechanical defects of rotating machinery are divided into three categories: rotor body defects, such as unbalance, misalignment, rubbing, and rotor crack; rotor supportbearing defects, such as inner race, outer race or ball defect of rolling bearing, and oil whirl or oil whip of sliding bearing; transmission gear defects, such as chipped tooth defect or missing tooth defect. Inprocess monitoring and diagnostics of rotating machinery require reasoning about defect and process states from sensor readings. Often the relationship between the sensor readings and the process states is complex and nondeterministic. For a complex system, a single sensor is incapable of collecting enough data for accurate condition monitoring and fault diagnosis. Multiple sensors are needed in order to do a better job. When multiple sensors are used, data collected from different sensors may contain different partial information about the same machine condition. The diagnostic object can be described more comprehensively [4–6]. Compared with single sensor, the timespace scope and the quantity of information are expanded. The diagnostic accuracy and reliability can be improved. Multisensor information fusion can be categorized into three levels [7, 8]: datalevel fusion, featurelevel fusion, and decisionlevel fusion.
At datalevel fusion, all sensor data from a measured object are combined directly and features are then calculated from the fused data. Fusion of data at this level contains most information and can deliver good results. However, the sensors used in this level must be commensurate. That means the measurement has to be the same or has similar physical quantities or phenomena. During the most popular datalevel fusion methodology, such as weighted fusion [9], the weighted value of multisensor signals is difficult to determine. As a consequence, datalevel applications are limited in real environment. At featurelevel fusion, the features are calculated from each sensor according to the type of raw data. Then, these noncommensurate sensors features are combined at the feature level. All features are combined in turn into a bigger single feature set, which are then used in a special classification model such as artificial neural network (ANN), support vector machine (SVM), and cluster algorithm for decisions [10]. The featurelevel fusion is a compromise form of datalevel fusion and decisionlevel fusion. Its data alignment requirements are not strict as the datalevel fusion that heterogeneous sensors are allowed, and its information loss is less serious than the decisionlevel fusion but still achieved a better information compression. As a consequence, featurelevel applications are flexible and popular. At decisionlevel fusion, the processes of features calculation and pattern recognition are applied in sequence for singlesource data obtained from each sensor. The decision vectors are then fused using decisionlevel fusion techniques such as voting strategy, Bayesian method, behaviorknowledge space, and DempsterShafer theory [11]. Relatively speaking, there is maximum amount of information loss at decisionlevel.
This paper proposes a featurelevel fusion method for rotating machinery fault diagnosis. Generally, heterogeneous information fusion is executed at featurelevel fusion for mechanical condition monitoring and fault diagnosis in the present literature. For example, Barad et al. put forward the development of an ANN based model for condition monitoring of a power turbine that blends parameters belonging to performance, vibration, and lubrication [8]; Loutas et al. combined use of vibration, acoustic emission, and oil debris monitoring of rotating machinery [6]. The condition of mechanical system may be described in more detail by using heterogeneous information fusion, but this process needs multiclass sensors and its matching data acquisition systems, which would lead to higher monitoring costs and inconvenient operation of data acquisition in the real environment. ANN and SVM are the most popular classification models to execute decision at featurelevel fusion [12, 13]. The main difference between ANN and SVM is in their risk minimization. SVM is based on structural risk minimization principle, whereas ANN is based on traditional empirical risk minimization principle. The difference in risk minimization leads to a better generalization performance for SVM than that of ANN [14, 15]. SVM is powerful for solving the problem with small sampling, nonlinear and high dimension in machinery condition classification. In this paper, the proposed featurelevel fusion method belongs to homologous information fusion that the raw data all come from vibration sensors, so only a vibration testing system is needed for raw signal collected, which makes the process simpler. In this method, timedomain features are calculated from each vibration signal to compose a multidimensional feature set, and the SVM is selected as the classification model to process information fusion. In order to verify the effectiveness of the proposed method, fault diagnostic cases are tested, which include fault diagnosis of rolling bearing (identifying normal, inner race defect, outer race defect, and ball defect), fault diagnosis of gear (identifying normal, chipped tooth, and missing tooth), and fault diagnosis of rotor crack (identifying normal, crack depth of 3 mm, and crack depth of 5 mm). For each case study, the sensibilities of the features are analyzed.
2. Theory
2.1. Support Vector Machine (SVM)
The SVM is a machine learning method based on the statistical learning theory and structural risk minimization principle. Given two category sample sets (; ; ), is the number of samples. The optimal hyperplane separating the data can be obtained as a solution to the following optimization problem [15, 16]: where is weight vector, is scalar, is slack variable, and is error penalty.
The dual quadratic optimization description can be obtained by converting the problem with KuhnTucker condition into the equivalent Lagrangian dual problem: where is Lagrange coefficient, which must meet the following equation:
The support vector is the sample which satisfies the equation at the time of the nonzero . It reveals that the samples at the edge of distribution are essential for classification. This leads to the optimal classification decision function: where is the number of support vectors.
In linear inseparable condition, the samples (; ; ) in input space are mapped into high dimensional space where the optimal classification surface can be established through the nonlinear mapping . The nonlinear mapping is usually difficult to be solved while kernel functions meeting Mercer conditions can be used to solve this problem dexterously. The kernel function is described as follows:
The optimal classification decision function of linear inseparable samples is obtained using (5) into (4):
The common kernel functions include linear kernel function, poly kernel function, radial basis function (RBF) kernel function, and sigmoid kernel function.
The traditional SVM was originally designed for binary classification problems. However, many practical problems in fault diagnosis field are multiclassification. Now some effective multiclass support vector machines were proposed which include “oneagainstone,” “oneagainstall,” directed acyclic graph (DAG), and so on [15]. Hsu et al. have given a comparison of these methods and pointed out that the “oneagainstone” method is more suitable for practical use than other methods [17, 18].
2.2. TimeDomain Features
When the running conditions of the rotating machinery deviate from the normal condition, the timedomain statistical features of the vibration signal will be different from the normal condition. Furthermore, the timedomain statistical features will be also different under different defect models. Therefore, the timedomain statistics contain abundant defect information, and they can be used as sensitive character applied to fault diagnosis of rotating machinery. The timedomain statistical features used in this study are shown in Table 1.
 
in the table is discrete time series signal. 
2.3. Multisensors Information Fusion Model
The model of multisensor information fusion is used in this study and shown in Figure 1. The same character of different sensors is extracted to constitute a multidimensional vector and the SVM is used for pattern recognition. Twelve different timedomain features are analyzed one by one.
3. Case Studies
3.1. Data Acquisition
Experiments were performed on the machinery fault simulator (MFS) from SpectraQuest, Inc., shown in Figure 2. It can simulate most of faults that commonly occur in rotating machinery, such as rotor body defects, bearing defects, and gearbox defects. The shaft rotating speed was obtained by a laser speedometer. Acceleration signals were collected using the Dewetron 16 channels data acquisition system and IMI 608A11 accelerometers.
(a) The front view
(b) The side view
In the vibration testing experiments for roller bearing fault diagnosis, the simulator is composed of a motor, a coupling, a testing roller bearing fitted on the left of the shaft near the motor, a working roller bearing on the other side, a bearing load, and a shaft. The MFS provides a rolling bearing fault kit consisting of one normal, one inner race defect, one outer race defect, one with ball defect, and one combination of defects for performing experiments and studying bearing fault diagnosis. The acquisition frequency rate is 10 kHz. The sensors layout is depicted schematically in Figure 2(a) that a total of 8 sensors from to are used.
In the vibration testing experiments for gear fault diagnosis, the drive from the motor transmits to the gearbox through bearingrotor system and belt. The gearbox consists of a twostage parallel shaft with rolling bearings, helical gears, and a magnetic brake. The simplified diagram of gearbox transmission is shown in Figure 3, where is the testing gear. The MFS provides a gear fault kit consisting of one normal, one chipped tooth, and one missing tooth for performing experiments and studying gear fault diagnosis. The acquisition frequency rate is 20 kHz. The sensors layout is depicted schematically in Figure 2(b) that a total of 8 sensors from to are used.
In the vibration testing experiments for rotor crack fault diagnosis, the rotorbearing system is driven by the motor. In order to simulate the expanding of crack, crack faults were introduced to the test rotor by using the electrodischarge machining. The defect with crack width of 0.12 mm and crack depth of 3 mm represents slight defect, and that with crack width of 0.12 mm and crack depth of 5 mm represents serious defect. The acquisition frequency rate is 10 kHz. The sensors layout is depicted schematically in Figure 2(a) that a total of 4 sensors from to are used.
3.2. Fault Diagnostic Case of Gear
Vibration signals of gear with three fault models including normal, chipped tooth, and missing tooth are taken for analysis. A certain timedomain feature is calculated from eight sensors ( to ) to constitute an eightdimensional vector as a fault sample. One hundred and ten fault samples from each model, a total of three hundred and thirty samples, are used to constitute the fault sample sets. Sixty fault samples from each model, a total of one hundred and eighty samples, are selected randomly as training samples and the others are used as testing samples. Twelve timedomain statistics are analyzed one by one.
LibSVMmat2.9 is chosen for SVM calculation. LibSVM is developed by Lin ChihJen from Taiwan [19]. It is a simple and easytouse SVMs tool for classification. RBF kernel function is chosen as kernel function shown as follows: The crossvalidation combination with network search method is used to search the best parameters: the error penalty of SVM and of RFB. Oneagainstone multiclassification is chosen for pattern recognition. The diagnostic results of gear by using different timedomain features are listed in Table 2.

It can be found from Table 2 that the highest diagnostic accuracy is 93.33% by using the peak factor as feature to constitute fusional vector for gear fault diagnosis. Sensitivity of the features can be indicated by diagnostic accuracy when using the same classifier SVM, so, the peak factor is the most sensitive feature in the twelve timedomain features for identifying gear defect, followed by the amplitude square, root amplitude, mean, root mean square, standard deviation, and peak. The diagnostic accuracy is all above 80% by using these features. The skewness, kurtosis, waveform factor, and margin factor are less sensitive comparatively. The diagnostic accuracy is all under 70% by using these features.
It also can be found from Table 2 that the accuracy of normal testing samples is all above 90% by using any feature. During the analysis, we also found that the samples of defect with chipped tooth and defect with missing tooth are easy to be misclassified with each other, but defect samples are seldom mistakenly regarded as normal samples, so it can be deduced that normal and defect gear are always easy to distinguish.
In order to compare with single sensor for gear fault diagnosis, take eight features from a single sensor to constitute an eightdimensional vector as a fault sample. The eight features are the peak factor, amplitude square, root amplitude, mean, root mean square, standard deviation, peak, and pulse factor, which are the first eight sensitive features for identifying gear defect selected on the basis of the above analysis result. In order to avoid the orders of magnitude difference of different features, normalized eigenvector is processed before inputting SVM. In fact, during the proposed multisensors information analysis, the fault sample is constituted by the same feature from multisensors, so the orders of magnitude difference are nonexistent and normalized eigenvector is not needed. The sensors to are analyzed one by one. The diagnostic results of gear by using different single sensors are listed in Table 3.

Comparing with Tables 2 and 3, it can be found that there is higher diagnostic accuracy by using multisensors information fusion method than using single sensor method as a whole.
3.3. Fault Diagnostic Case of Rolling Bearing
Vibration signals of rolling bearing with four fault models including normal, inner race defect, outer race defect, and ball defect are taken for analysis. A certain timedomain feature is calculated from eight sensors ( to ) to constitute an eightdimensional vector as a fault sample. One hundred and ten fault samples from each model, a total of four hundred and forty samples, are used to constitute the fault sample sets. Fifty fault samples from each model, a total of two hundred samples, are selected randomly as training samples and the others are used as testing samples. Twelve timedomain statistics are analyzed one by one.
LibSVMmat2.9 is chosen for SVM calculation. Gaussian kernel function is chosen as kernel function. The crossvalidation combination with network search method is used to search the parameters and . Oneagainstone multiclassification is chosen for pattern recognition. The diagnostic results of rolling bearing by using different timedomain features are listed in Table 4.

It can be found from Table 4 that the mean, amplitude square, root mean square, root amplitude, and standard deviation are the first five sensitive features for identifying rolling bearing defect. The diagnostic accuracy is all 100% by using these features. Comparing with Tables 4 and 2, it can be found that there is a higher diagnostic accuracy for rolling bearing fault diagnosis than for gear fault diagnosis by using the proposed information fusion method as a whole. The main cause is that the way from the defect position of rolling bearing to the sensor installation position is shorter and simpler than the way from the defect position of gear.
In order to compare with single sensor for rolling bearing fault diagnosis, take eight features from a single sensor to constitute an eightdimensional vector as a fault sample. The eight features are the mean, amplitude square, root mean square, root amplitude, standard deviation, peak, kurtosis, and waveform factor, which are the first eight sensitive features for identifying rolling bearing defect selected on the basis of the above analysis result. In order to avoid the orders of magnitude difference of different features, normalized eigenvector is processed before inputting SVM. The sensors to are analyzed one by one. The diagnostic results of rolling bearing by using different single sensor are listed in Table 5.

Comparing with Tables 4 and 5, it can be found that there is higher diagnostic accuracy by using multisensors information fusion method than using single sensor method as a whole.
3.4. Fault Diagnostic Case of Rotor Crack
Vibration signals of rotor crack with three fault models including normal, crack depth of 3 mm, and crack depth of 5 mm are taken for analysis. A certain timedomain feature is calculated from four sensors ( to ) to constitute a fourdimensional vector as a fault sample. One hundred fault samples from each model, a total of three hundred samples, are used to constitute the fault sample sets. Fifty fault samples from each model, total of one hundred and fifty samples, are selected randomly as training samples and the others are used as testing samples. Twelve timedomain statistics are analyzed one by one.
LibSVMmat2.9 is chosen for SVM calculation. Gaussian kernel function is chosen as kernel function. The crossvalidation combination with network search method is used to search the parameters and . Oneagainstone multiclassification is chosen for pattern recognition. The diagnostic results of gear by using different timedomain features are listed in Table 6.

It can be found from Table 5 that the mean, amplitude square, root mean square, root amplitude, and standard deviation are the first five sensitive features for identifying rotor crack defect. The diagnostic accuracy is all 90% by using these features. The result is similar to fault diagnostic case of rolling bearing.
4. Conclusion
In this paper, a featurelevel information fusion methodology is proposed that all the features are calculated using vibration data in time domain to constitute fusional vector and the SVM is used for classification. Only a vibration testing system is needed for raw signal collected in this method, so the process is simpler. The effectiveness of the proposed methodology is tested with examples of gear, rolling bearing, and rotor crack fault diagnosis. Sensitivities of the twelve timedomain features are discussed in each case study. The analyzed results indicate that the peak factor is the most sensitive feature in the twelve timedomain features for identifying gear defect, but it is not very sensitive for identifying rolling bearing and rotor crack defect. The mean, amplitude square, root mean square, root amplitude, and standard deviation are all sensitive for identifying gear, rolling bearing, and rotor crack defect comparatively.
The features used and discussed in this paper are all in time domain; however, features in frequency domain also can be used for fault diagnosis of rotating machinery and the sensibilities of the features for identifying rolling bearing, gear, and rotor defect are also worth studying in the future.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
This work is supported by the National Natural Science Foundation of China (51105138 and 51175169), the National High Technology Research and Development Program Items (2012AA041805), the Preresearch Project (813040302), the CEEUSRO special plan of Hunan province (2010XK6066), and the Aid Program for Science and Technology Innovative Research Team in Higher Educational Institutions of Hunan province.
References
 V. T. Tran and B.S. Yang, “An intelligent conditionbased maintenance platform for rotating machinery,” Expert Systems with Applications, vol. 39, no. 3, pp. 2977–2988, 2012. View at: Publisher Site  Google Scholar
 K. P. Kumar, K. V. N. S. Rao, K. R. Krishna, and B. Theja, “Neural network based vibration analysis with novelty in data detection for a large steam turbine,” Shock and Vibration, vol. 19, no. 1, pp. 25–35, 2012. View at: Publisher Site  Google Scholar
 L. L. Jiang, Y. L. Liu, X. J. Li, and S. Tang, “Using bispectral distribution as a feature for rotating machinery fault diagnosis,” Measurement, vol. 44, no. 7, pp. 1284–1292, 2011. View at: Publisher Site  Google Scholar
 C. Z. Han, H. Y. Zhu, and Z. S. Duan, MultiSource Information Fusion, Tsinghua University Press, Beijing, China, 2006.
 G. F. Bin, J. J. Gao, X. J. Li, and B. S. Dhillon, “Early fault diagnosis of rotating machinery based on wavelet packets—empirical mode decomposition feature extraction and neural network,” Mechanical Systems and Signal Processing, vol. 27, no. 1, pp. 696–711, 2012. View at: Publisher Site  Google Scholar
 T. H. Loutas, D. Roulias, E. Pauly, and V. Kostopoulos, “The combined use of vibration, acoustic emission and oil debris online monitoring towards a more effective condition monitoring of rotating machinery,” Mechanical Systems and Signal Processing, vol. 25, no. 4, pp. 1339–1352, 2011. View at: Publisher Site  Google Scholar
 G. Niu, T. Han, B.S. Yang, and A. C. C. Tan, “Multiagent decision fusion for motor fault diagnosis,” Mechanical Systems and Signal Processing, vol. 21, no. 3, pp. 1285–1299, 2007. View at: Publisher Site  Google Scholar
 S. G. Barad, P. V. Ramaiah, R. K. Giridhar, and G. Krishnaiah, “Neural network approach for a combined performance and mechanical health monitoring of a gas turbine engine,” Mechanical Systems and Signal Processing, vol. 27, no. 1, pp. 729–742, 2012. View at: Publisher Site  Google Scholar
 Q. Tan and Y.H. Xiang, “Application of weighted evidential theory and its information fusion method in fault diagnosis,” Journal of Vibration and Shock, vol. 27, no. 4, pp. 112–116, 2008. View at: Google Scholar
 Y.Y. Liu, Y.F. Ju, C.D. Duan, and X.F. Zhao, “Structure damage diagnosis using neural network and feature fusion,” Engineering Applications of Artificial Intelligence, vol. 24, no. 1, pp. 87–92, 2011. View at: Publisher Site  Google Scholar
 O. Basir and X. Yuan, “Engine fault diagnosis based on multisensor information fusion using DempsterShafer evidence theory,” Information Fusion, vol. 8, no. 4, pp. 379–386, 2007. View at: Publisher Site  Google Scholar
 G. Niu and B.S. Yang, “Intelligent condition monitoring and prognostics system based on datafusion strategy,” Expert Systems with Applications, vol. 37, no. 12, pp. 8831–8840, 2010. View at: Publisher Site  Google Scholar
 A. Ghasemloonia and S. Esmaeel Zadeh Khadem, “Gear tooth failure detection by the resonance demodulation technique and the instantaneous power spectrum method—a comparative study,” Shock and Vibration, vol. 18, no. 3, pp. 503–523, 2011. View at: Publisher Site  Google Scholar
 Z. S. Chen and Y. M. Yang, “Fault diagnostics of helicopter gearboxes based on multisensor mixtured hidden Markov models,” Journal of Vibration and Acoustics, Transactions of the ASME, vol. 134, no. 3, Article ID 031010, 2012. View at: Publisher Site  Google Scholar
 A. Widodo and B.S. Yang, “Support vector machine in machine condition monitoring and fault diagnosis,” Mechanical Systems and Signal Processing, vol. 21, no. 6, pp. 2560–2574, 2007. View at: Publisher Site  Google Scholar
 B.S. Yang, T. Han, and W.W. Hwang, “Fault diagnosis of rotating machinery based on multiclass support vector machines,” Journal of Mechanical Science and Technology, vol. 19, no. 3, pp. 846–859, 2005. View at: Google Scholar
 J. Y. Yang, Y. Y. Zhu, Y. S. Zhang, and Q. Wang, “Intelligent fault diagnosis of rolling element bearing based on SVMS and statistical characteristics,” in Proceedings of the ASME International Conference on Manufacturing Science and Engineering, pp. 525–536, Atlanta, Ga, USA, October 2007. View at: Google Scholar
 C.W. Hsu and C.J. Lin, “A comparison of methods for multiclass support vector machines,” IEEE Transactions on Neural Networks, vol. 13, no. 2, pp. 415–425, 2002. View at: Publisher Site  Google Scholar
 C. Hsu, C. C. Chang, and C. J. Lin, “A practical guide to support vector classification[EB/OL],” 2009, http://www.csie.ntu.edu.tw/~cjlin/. View at: Google Scholar
Copyright
Copyright © 2014 Lingli Jiang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.