#### Abstract

This paper presents an improved gearbox fault diagnosis approach by integrating complementary ensemble empirical mode decomposition (CEEMD) with permutation entropy (PE). The presented approach identifies faults appearing in a gearbox system based on PE values calculated from selected intrinsic mode functions (IMFs) of vibration signals decomposed by CEEMD. Specifically, CEEMD is first used to decompose vibration signals characterizing various defect severities into a series of IMFs. Then, filtered vibration signals are obtained from appropriate selection of IMFs, and correlation coefficients between the filtered signal and each IMF are used as the basis for useful IMFs selection. Subsequently, PE values of those selected IMFs are utilized as input features to a support vector machine (SVM) classifier for characterizing the defect severity of a gearbox. Case study conducted on a gearbox system indicates the effectiveness of the proposed approach for identifying the gearbox faults.

#### 1. Introduction

Gears can be considered as significant subassembly in machines for power or rotation transmission from one shaft to another. Their fault may cause unexpected breakdown of the machine systems and lead to significant economic loss or even personnel casualties [1, 2]. Since structural defect-caused vibration signals often reflect changes of the dynamic characteristics related to the gearbox, many researches focus on transient feature extraction of the vibration signal and fault recognition of the defective gearboxes using vibration signal analysis [3]. Nonetheless, a number of factors related to structural transformation, friction, velocity shear, and strike affect the vibration-oriented signal study and reduce the effectiveness of defective diagnosis. Consequently, a number of conventional linear approaches might not operate well in detection of dynamic changes [4, 5].

Aiming at avoiding restrictions of conventional techniques, permutation entropy (PE) is used to characterize vibration signals for the purpose of fault diagnosis. The PE only uses the order of entropy for signal characterization and can overcome nonlinear distortion which existed in the signal. It has been applied in various applications. For instance, permutation entropy is proved to offer an efficient evaluation to monitor rolling bearings [5]. By integrating support vector machine (SVM) with multiscale PE, the operating condition of rolling bearing can be identified [6, 7]. Another study combined optimized SVM, ensemble empirical mode decomposition, and PE to detect and classify motor bearing faults [8]. The effectiveness of the PE has also been proved in detecting dynamic changes in rotating machines when comparing with that of other features, like Lyapunov exponent and fractal dimensions [9]. Furthermore, background noise which existed in real world applications always disturbs the result of the fault diagnosis. Therefore, performing noise reduction in the vibration signals is necessary before the PE method is executed.

Empirical mode decomposition (EMD), as an approach of adaptive signal treatment in the field of time frequency analysis, can decompose a signal into sets of intrinsic mode functions (IMFs) based on its features [10, 11]. The approach of EMD could be applied in pretreatment of the signals of vibration. For instance, a novel approach for extracting fault feature with combined AR model and EMD algorithm has been applied in processing bearing vibration signals [5]. Nonetheless, when the EMD is used to process a signal with intermittent components, the signal could not be fully decomposed because of the existence of mode mixture phenomenon [12]. Further development with ensemble empirical mode decomposition (EEMD) was proposed by Huang et al. for EMD performance improvement. EEMD becomes more precise and efficient for decomposition of signals in comparison with the original EMD by adding noise to the original signal and continuously calculating the IMF means [12]. Though the approach of EEMD has efficiently solved the issue of mode-mixing, it takes lots of time to implement the large amount of ensemble mean. In other words, the efficiency of algorithm will be decreased. In order to resolve this issue, the complementary approach of EEMD (CEEMD) has been put forward [13]. Through complementary integration of IMFs and both positive and negative added white noises in the CEEMD, residual of the noises could be extracted out from the combination of white noises and data. The approach of CEEMD demonstrates similar effectiveness to that of the EEMD with improved computational efficiency.

By making full use of characteristics of the PE and CEEMD, this paper proposes a hybrid approach to diagnose gearbox faults. The CEEMD is utilized as the preprocessing to filter signals and extract IMFs that are closely associated with the filtered signal. Subsequently, PE value of each chosen IMF would be calculated. The PE value of the chosen IMFs is utilized as the feature vector to a classifier in which the support vector machine (SVM) is applied for identifying gearbox defect. The remaining parts of this paper are arranged as follows. Overview of the gearbox fault diagnosis approach is shown in Section 2. Experimental verification is conducted on automobile transmission gearbox system in Section 3. Last but not least, the last section presents the summary and comment.

#### 2. Theoretical Framework

##### 2.1. Complementary Ensemble Empirical Mode Decomposition

CEEMD is developed based upon EEMD. Originally, the EMD approach deals with a given signal into the form presented in (1) through recursive elimination of the mean of the lower and upper envelope related to the maximum and minimum of the signal [14]:where refers to the number of IMFs, refers to the component of IMF which covers a certain frequency band, and refers to the mean trend of the signal residue. The EMD can be considered as adaptive local analysis approach for processing both nonlinear and nonstationary signals. However, the decomposition of EMD would generally undergo mixture of modes, which is defined as either a single IMF covering widely disparate scales or a signal existing in different IMF components.

Later, Huang et al. have proposed a noise-guided statistical approach to resolve the mode mixture issue, which is the ensemble empirical mode decomposition. However, the effect of the additional noise could only be restricted by a large amount of ensemble mean computation, causing high computational load.

Complementary ensemble mode decomposition, as an improved and noise enhanced data analysis approach, has been developed for reducing computational burden [13]. The procedure of CEEMD for the signal is illustrated in the following steps.

*Step 1. *A pair of white Gaussian noises with the same amplitude is added to . Thus, two signals, and , are generated.

*Step 2. *Decompose and by EMD for a number of times; then referring to ensemble means of IMF from and referring to those from are obtained.

*Step 3. *The final IMF which is the ensemble of and is calculated as the decomposition results of CEEMD as follows:

Specifically, a simulated signal composed of , , and has been adopted as an instance. is a Gaussian impulse interference signal, is a cosine signal with the frequency of 500 Hz, and is a trend term. Figure 1 illustrates the waveform of the simulated signal and Figure 2 illustrates the decomposed results by CEEMD.

Through comparing the result in Figure 2 with the signal waveforms in Figure 1, it is shown that there is no mode mixture. That is to say, CEEMD is more suitable for the study of signal.

##### 2.2. Permutation Entropy

PE is a nonlinear dynamic parameter that characterizes a signal’s complexity. Based on the principle of Takens-Maine, the phase space of time series , can be expressed as where refers to the embedded dimension, while refers to the delay of time. Furthermore, sample points of data contained in every could be sorted in an incremental order as

If , the original positions could be classified as , . Thus, vector could be shown in a set of symbols as [15, 16] where and . refers to symbol permutation which has been shown in number symbols . If are applied in denoting the possibility distribution of each symbol sequence and , the permutation entropy of for the time series of , could be considered as the entropy of Shannon for symbol sequence as follows:

If all the symbol sequences appear with the same possibility distribution as , the maximum value of could be described as . Thus, the permutation entropy of order can be standardized as

value shows the randomness level of the time series. A large value of indicates high randomness of the time series. On the contrary, a small value of means the time series has more regular characteristics.

To demonstrate the validity of the PE algorithm, sample vibration signals of a gearbox under three different conditions are shown in Figure 3, and the corresponding single factor analysis result is shown in Figure 4. Figure 4 shows that defect severity of the gearbox could be efficiently recognized by the value of PE.

**(a)**

**(b)**

**(c)**

##### 2.3. Fault Diagnosis Based on CEEMD and Permutation Entropy

In this study, a gearbox fault diagnosis method has been developed using the CEEMD and PE, and Figure 5 shows the flow chart of the method. Particularly, the procedure to implement the proposed fault diagnosis method is as follows.

*Step 1. *The sampled vibration signal measured on gearbox is decomposed using CEEMD.

*Step 2. *The product is calculated using (8), and the parameter is calculated by (9). The signal is filtered through comparison of the proposed threshold value and the parameter [17]. In other words, when , it can be assured that of the th IMF can be enhanced for a number of times in comparison with the mean value of which can be calculated based on the former IMFs. Thus, the previous IMFs with the term of trend can be eliminated as noise and the residue IMFs can be considered as filtered signal: where refers to the th IMF’s energy density, refers to the mean period of the th IMF, represents the length of each IMF, denotes the th IMF’s amplitude, and refers to the overall number of extreme points in the th IMF.

*Step 3. *The correlation coefficients between each IMF and filtered signal are calculated by (10). IMFs closely associated with the filtered signal are chosen to calculate the PE value [18]:

*Step 4. *The PE values of all the chosen IMFs are calculated to generate a feature vector which can be utilized to train the SVM for identification of gearbox operating condition.

*Step 5. *The PE feature vector from test gearbox vibration signal is extracted and utilized as input to the well-trained SVMs. In this way, the result of classification can be realized [19, 20].

#### 3. Experimental Evaluation

A series of gearbox fault signals acquired from LC5T81 type transmission were used to verify the effectiveness of the presented approach. The data was measured from the testbed presented in Figure 6. One backward speed and five forward speeds could be load on the tested gearbox. The vibration signals were collected at 3000 samples per second using the accelerometer fixed on the gearbox case. The tested gearbox is operated with the third speed of 1600 rpm and the meshing frequency of 500 Hz.

**(a) Structure of the gearbox**

**(b) The gearbox setup**

The waveforms of the vibration signals collected from the test gearbox under three conditions are shown in Figure 7. Figure 7(a) shows the signal under the normal condition, Figure 7(b) shows the signal under the light fault condition, and Figure 7(c) shows the signal from the severe fault condition.

**(a) Normal condition**

**(b) Light fault condition**

**(c) Severe fault condition**

Figure 8 illustrates the decomposed IMFs of these signals and Table 1 shows the correlation coefficients between the filtered signal and each of the IMFs.

**(a) Normal condition**

**(b) Light fault condition**

**(c) Severe fault condition**

It can be seen from the table that correlation coefficients for the first 5 IMFs are all more than 0.1. They can describe the main features of the signal and thus are selected for further analysis. According to the main steps of the presented fault diagnosis approach, the permutation entropy values of these IMFs are calculated, as listed in Table 2.

In the experiment, 120 feature vectors in total were gained from three different circumstances. 50% of the feature vectors were applied into classifier training, while the rest of them were used in classification of fault. Table 3 shows the results of classification. It shows that various working conditions can be efficiently identified. Among all the 60 groups of feature vectors, 57 groups have been classified correctly, while 3 groups have failed. The overall classification accuracy is up to 95%.

For purpose of comparison, the values of approximate entropy (ApEn) from the chosen IMFs are also calculated and applied in the SVM classifier. Table 4 shows the classification results. It can be summarized that the method is actually efficient for differentiating the gearbox faults. Furthermore, the effectiveness of the approach is compared with that of the EEMD-PE approach. It can be seen that the rates of classification in these two approaches are very similar. However, computational load of the developed approach is lower than that of the EEMD-PE approach.

To further study the effectiveness of the developed approach, a 10 × 10-fold cross validation procedure is employed with the selected 120 samples. The average classification rate of the 10 × 10-fold cross validation is 94.82%. The result is close to the classification result in Table 3.

#### 4. Conclusions

This study develops an integrated approach by combining PE algorithm with CEEMD to diagnose gearbox faults. With the CEEMD, gearbox vibration signals can be decomposed into sets of IMFs with low computational load. Then PE method can efficiently extract fault characteristic from the selected IMFs. Without mathematical model and the study of the fault mechanism of the system, this developed approach can directly recognize gearbox fault severity. Furthermore, the CEEMD, as a preprocessing step, can be utilized to purify the signal for PE calculation, leading to increased classification rate (e.g., 95% for experimental data). It is envisioned that the approach developed in this study could be used in a wide range of applications in the field of fault diagnosis.

#### Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

#### Acknowledgments

This paper is supported by the Nature Science Foundation of Jiangsu Province of China (no. BK2012739) and the National Natural Science Foundation of China (no. 61101163 and no. 51175080).