Abstract

Vibration analysis is the most used technique for defect monitoring failures of industrial gearboxes. Detection and diagnosis of gear defects are thus crucial to avoid catastrophic failures. It is therefore important to detect early fault symptoms. This paper introduces signal processing methods based on approximate entropy (ApEn), sample entropy (SampEn), and Lempel-Ziv Complexity (LZC) for detection of gears defects. These methods are based on statistical measurements exploring the regularity of vibratory signals. Applied to gear signals, the parameter selection of ApEn, SampEn, and LZC calculation is first numerically investigated, and appropriate parameters are suggested. Finally, an experimental study is presented to investigate the effectiveness of these indicators and a comparative study with traditional time domain indicators is presented. The results demonstrate that ApEn, SampEn, and LZC provide alternative features for signal processing. A new methodology is presented combining both Kurtosis and LZC for early detection of faults. The results show that this proposed method may be used as an effective tool for early detection of gear faults.

1. Introduction

Gearboxes play an important role in industrial applications and unexpected failures often result in significant economic losses. Numerous papers considering gear condition monitoring through vibration measurements were published over the years. Compared to classical techniques such as statistical time indicators or Fast Fourier Transform, advanced signal processing techniques like time-frequency analysis (STFT, Wigner-Ville) [14] or wavelet transform [5, 6] have shown to be more efficient for gear defect detection. Baydar et al. [710] proposed various methods such as the instantaneous power spectrum, Wigner-Ville distribution, and the wavelet transform method for local tooth fault detection from vibration and acoustic signals. Yesilyurt [11] applied the spectrogram and scalogram approach for gearbox fault detection.

Various researches are focused on the understanding of dynamic gears behaviour. Parey and Tandon [12] give a review of the dynamic modelling of spur gears including defects. The effects of various nonlinearities, such as variable mesh stiffness, mesh damping, gear errors, effect of friction, and backlash, on gear dynamic behaviour have been discussed. Due to instantaneous variations of these effects, gear systems are often characterised by nonlinear behaviours.

Randall [13] classified the vibrations generated by gear meshing as being caused by deviations from the ideal tooth profile producing a vibration signal at the tooth-meshing frequency. These variations in tooth loading generate amplitude modulations and fluctuations in the rotational speed. Nonuniform tooth spacing produces frequency modulation effects and local tooth faults generate additional impulses. Imperfections (errors) in the gear during manufacture will also be a source of vibration. The instantaneous variations of the effects cited above may be considered as the first steps of degradation of the gear. At this stage, a modification of the characteristics of dynamics responses may be observed. However, in the early stages of fault development, main frequencies have low amplitudes and can be masked by other vibration sources or buried in background noise.

A large number of statistical metrics have been applied for monitoring gear defects, such as the root mean square, Crest Factor, Kurtosis, FM0, FM4, FM, M6, M6A, M6, NA4, NA, NB4, NB [1416]. Most of these measures characterize the statistical properties of vibration and give a global view to provide any recognition of localized phenomena. Stevens et al. [17] noted that these measures are suitable for detection and diagnosis when mechanical faults take the form of impulses which impose periodic pulses of short time duration. However, Feng et al. [18] noted that these parameters cannot give a perception into the gear dynamic system generating the vibration signals. Wu et al. [19] used the standard deviation for monitoring tooth cracks of gear, but it has been proven to be ineffective for tooth crack detection.

Therefore, it is interesting to use others techniques as nonlinear estimations of parameters which may be a good alternative to monitoring hidden faults in the measured signals. Yan and Gao [20] give a review of a large number of nonlinear parameter identification techniques. Wang et al. [21] researched the application of some nonlinearity analysis methods, including the pseudophase portrait, singular spectrum, and correlation dimension, for monitoring and diagnosis of rotating machinery. Jiang et al. [22] used the correlation dimension for monitoring gearbox. Feng et al. [18] investigated the irregularity of the gear signal using the regularisation dimensions.

As said before, the effects of various nonlinearities cause a variation in the dynamic behaviour of the gear. As the default develops, more contact pressure between two mating parts changes. The contact pressure variation results in amplitude and frequency modulations and hence more frequency will appear in the frequency domain with their harmonics and an increase of the noise can be observed. Therefore, the fault progress leads to complexity (randomness) changes because of a variation of the contact pressure between two mating parts. The complexity of a signal increases when more frequency components exist. On the other hand, when the defect increases, we observe more frequency components in the signal and we can use the complexity of the signal as a quantitative measurement to evaluate the severity of the defect. The complexity of the signal can be described by the approximate entropy (ApEn), the sample entropy (SampEn), and the Lempel-Ziv complexity (LZC). These parameters are nonlinear parameters which can be used to characterise the regularity of the signal.

The Lempel-Ziv complexity (LZC), sample entropy (SampEn), and approximate entropy (ApEn) thus present alternative tools for signal analysis involving nonlinear dynamics. These methods are becoming more popular and have found wide applications in various disciplines, especially in the field of biomedical engineering. ApEn has recently received more attention. Yan and Gao [20] investigated the application of ApEn for the health monitoring of rolling element bearings. He and Zhang [23] applied the ApEn for monitoring signal of the acoustic emission from defects in rolling element Bearings. Fu et al. [24] used the ApEn method to fault signal analysis in electric power system. Using ApEn, Xu et al. [25] detected the looseness of the bearing bushing in turbo generator. In all these work, ApEn is used as a nonlinear feature parameter for analysing the vibration signal and effectively identifying the conditions of the mechanical system.

On the other hand, Yan and Gao [26] investigated the application of Lempel-Ziv complexity (LZC) for the health monitoring of rolling element bearings. Wang et al. [27] compare and analyse quantitative diagnosis methods based on Lempel-Ziv complexity for bearing faults, using continuous wavelet transform (CWT), Empirical Mode Decomposition (EMD) method, and wavelet packet method for decomposition of vibration signal. Kedadouche et al. [28] combined LZC and EMD for early detection of gears cracks.

The use of SampEn was introduced by Dennis Wong et al. [29] which was the improvement of a previous method, namely, approximate entropy (ApEn). However, sample entropy was not often used in monitoring of rotating machine. Dennis Wong et al. [29] used sample entropy for classification of ball bearing faults. Zhang et al. [30] used multiscale entropy and adaptive neurofuzzy inference for bearing fault classification. The sample entropy was discussed and implemented in many areas such as physiological time series analysis [31], heart rate variability [32], and neural respiratory signal [33]. Sun et al. [34] proposed a health auxiliary method based on SampEn for lead-acid battery unit. Widodo et al. [35] proposed intelligent prognostics for battery health based on sample entropy (SampEn) features of discharge voltage. The SampEn feature can depict degradation condition of battery and represents the health condition of battery.

As illustrated above, LZC, SampEn, and ApEn are becoming more and more attractive in the field of detection and fault diagnosis. However, no work has been found to apply ApEn, LZC, or SampEn measurement for diagnosis of gear faults. Therefore, in this paper, ApEn, SampEn, and LZC are introduced to analyse vibration signals from gear and investigate their efficiency for the defect detection and severity evaluation of gears faults.

2. Theoretical Background

2.1. Approximate Entropy

Consider a time series , . Its “regularity” may be measured by ApEn in a multiple dimensional space so that series of vectors are constructed and expressed as follows:Each vector is composed of consecutive and discrete point data of the time series . The distance between two vectors and can be defined as the maximum difference between their respective corresponding elements: where , , and is the number of data points contained in the times series.

For each vector , a measurement that describes the similarity between the vectors and all other vectors can be defined as where The symbol represents a predetermined tolerance value. , where is a positive constant and is the standard deviation of the time series .

A finite time series consisting of data points is used to estimate the ApEn value of the time series, which is defined as

2.2. Sample Entropy

Sample entropy was developed because it has better representation of the entropy in the analysed signals as comparing with the original approximate entropy (ApEn) [31]. The motivation for this method is the classification of complex systems that includes both deterministic and stochastic features from time series with a limited number of data points [32]. Higher values of sample entropy describe more irregularities in the time series.

For a given total number of data points, we form the vectors as where is the length of sequences to be compared.

The distance between such vectors is defined as For to , let:where is the tolerance for accepting matrices and is the number of , for .

Similarly, we define another function: where is the number of , for .

The symbol represents a predetermined tolerance value, where is a positive constant and is the standard deviation of the time series: We can determine the expression of probability matching points: where represents the probability that two sequences will match for points, whereas is the probability that two sequences will match for points. The sample entropy is then defined as

2.3. Complexity Analysis

The complexity analysis is based on the Lempel-Ziv definition [36]. This approach transforms the analysed signal into a data sequence. To illustrate the procedure, consider a gear vibration signal with a known mean value. A new sequence () is reconstructed by comparing the value of each sample of the previous sequence within the mean value. If the value of the sample is larger, it is set to one , otherwise to zero (0). Therefore, only two binary symbols are present in the new data sequence. This is subsequently scanned from its first sample to its end. When a subsequence that is not encountered in the previous scanning process is discovered, the complexity value is increased by one . Thus, the Lempel-Ziv complexity reflects the number of all different subsequences contained in the original sequence. Figure 1 (reproduced from [26]) described the algorithm.

For generality sake, normalized complexity is often used to obtain a measure independent of the sequence length:

2.4. Parameters Selection of ApEn, SampEn, and LZC for Gear Signals

From the above described algorithm of ApEn and SampEn, it can be seen that the calculated ApEn and SampEn values depend on two parameters which are the embedded dimension and the tolerance .

However, no guidelines exist for optimizing their values. In order to simulate the vibratory signals of gearbox, a gear multiplicative model whose meshing is modulated in amplitude has been used. The gear model as defined in [37] is used: where , , and represent the meshing period and the rotational periods. , , and represent the meshing signal and its modulation.

Table 1 gives the parameters simulation of the model described in (14).

Figures 2 and 3 represent the simulated signal and its spectrum, respectively.

For a given dimension , the key step of the ApEn algorithm is to calculate the distance between two vectors and to count the number of distances less than (where standard deviation of the time series), as discussed in [20].

The selections of and are based on a convergence analysis [20, 2326]. The choice of and depends on the nature of the time series treated. The best parameters to be chosen are those who present little fluctuation or on the other hand a good reproducibility.

From Figure 4, it can be seen that when , ApEn has the poorest convergence property (a big fluctuation). With increasing (more than 2), the convergence property of ApEn becomes better for greater than 0.4.

Like ApEn, sample entropy (SampEn) depends on the two parameters and . Therefore, the determination of these two parameters is critical. The same procedure was undertaken for the selection of and . The same characteristic was obtained as ApEn. From Figure 5 we can see that, for , the SampEn presents a good convergence when is greater than 0.5.

However, a larger will lead to much higher computational cost. Actually, when , the convergence property of ApEn and SampEn is already good enough. Of course, the value should also be a compromise to avoid losing much detailed system information.

By this investigation, and value = 0.5 times the standard deviation are selected for ApEn and SampEn calculation of the vibration signal in this paper.

The relationship between the ApEn, SampEn, and LZC values and the data length are illustrated in Figure 6. In Figures 7 and 8, seven simulated signals are comparatively displayed under sampling rates of 4, 8, 12, 16, 20, and 24 kHz, respectively. It is seen, in three cases (ApEn, SampEn, and LZC), that when the data length is greater than 5000 points, the variation of ApEn, SampEn, and LZC with respect to each frequency sampling rate becomes insignificant.

However, the sampling frequency and the data length are linked. So we must take consideration of the component defining the signal. The sample frequency must be more than 2 times the highest frequency presented in the signal, for Shannon respect. In our case, the sample frequency must be 2 times greater than the third meshing frequency (900 Hz). The simulation was conducted with greater than 4 kHz which is sufficient enough. On the other hand, for a better detection of the modulations, we must choose a good resolution frequency which is defined by . For the simulated signal, the modulations are about  Hz. So, a frequency resolution of 1 Hz is enough. This means that the minimum length of data to be analysed is equal to the sample frequency (4000 Hz).

Figure 9 shows a comparison of computing time between LZC, ApEn, and SampEn.

It is clear that the LZC is the best method for time computing. The sample entropy algorithm is simpler than the ApEn algorithm, requiring approximately one-half time to calculate.

2.5. Influence of Noise

It is well known that a white noise contains most abundant frequency components compared with other kinds of signal. If a signal is contaminated by a white noise, calculated ApEn, SampEn, and LZC values will also be. Using the simulated signal, the ApEn, SampEn, and LZC values corresponding to different SNRs are calculated, as listed in Table 2.

It may be noticed that the ApEn, SampEn, and LZC values increase as the SNR decreases, which corresponds to a degradation of the data quality. As discussed in [20], the working condition of machine system deteriorates due to the initiation and/or propagation of the defects. The number of frequency components contained in the signal will increase and the SNR decreases, resulting in a decrease in its regularity. This allows a decrease of its ApEn, SampEn, and LZC values. The simulation results listed in Table 2 confirm that the ApEn, SampEn, and LZC values provide a quantitative measurement for characterising a dynamic signal which can be represented by the deterioration of a machine’s health condition.

From Table 2, it can be seen that ApEn and SampEn are very sensitive to large noises; however, ApEn and SampEn are nearly unaffected by the noise which stay low. Otherwise, we can say that the LZC is more sensitive to noise than ApEn and SampEn.

3. Experimental Study

The recordings of vibration signal were carried out at CETIM, France, on a gear system with a train of gearing, with a ratio of 20/21 functioning continuously until its destruction. The sample frequency is equal to 20 kHz. Table 3 gives the details of the gear test rig parameters. The test duration was 13 days with a daily mechanical appraisal; measurements were collected every 24 h except on the first day. Table 4 gives a description of the state of the gear at each 24 h. Figure 10 presents some picture of the chipping observed in days 11 and 12. The acceleration signals for days 2, 5, 7, 9, 10, and 12 are shown in Figure 17.

Pareya et al. [38] use the same signal for their own researches. In time domain, a multitude of indicators exists, more or less efficient for the observation of certain failures. The most commonly used indicators are scalars from time analysis described by the equations below: The RMS, Peak, Kurtosis, Crest Factor, Factor, and Skewness of the experimental signal were considered and calculated from day 2 to day 13 (on day 1 no signal was taken).

The evolutions of these descriptors are displayed in Figures 1116.

It can be seen that the all indicators increase greatly after day 11. After day 11 the signal becomes impulsive. The Crest Factor (CF) observes a little growth after day 9 (17% of evolution compared to day 9).

The same observations were observed for the Peak values (26%), the Factor (35%), and the Skewness (20%). This is due to the evolution of the chipping of the teeth 15/16 observed in day 9. A visual chipping was observed in day 7. The time indicators are only able to detect the deterioration after day 9.

Consequently, the classical time domains are ineffective for early detection of gear defects. These measures are appropriate for detection and diagnosis when mechanical faults are in advanced state of the degradation. As discussed in [13], at an early stage of gear degradation, the wear is preceded by a variation of different phenomena like variations in tooth loading, nonuniform tooth spacing, elastic deformation of the gear teeth, and so forth. The classical indicators cannot track these variations. For this reason, it is interesting to investigate other methods which can give a perception into the gear dynamic behaviour caused by the effects cited above.

Therefore, ApEn, SampEn, and LZC of the experimental signal were computed from day 2 to day 13 (on day 1, no signal was taken). The values of ApEn and SampEn are plotted in Figure 18. The ApEn method does not present a significant increase except at day 12. So, the characteristic ApEn seems to have equivalent performance as the classic Kurtosis. After day 11, the ApEn decreases significantly. This is due to the fact that the signal becomes impulsive and the SNR becomes higher as explained in Section 2. The SampEn presents the same evolution as ApEn, but it presents a higher value after day 11 than ApEn. On the other hand, as explained and exhibited in Figure 9, the SampEn algorithm requires approximately one-half time to calculate than ApEn. So, it is preferable to use SampEn than ApEn. The LZC for all signals is plotted in Figure 19. The LZC shows a significant increase after day 4, revealing a clear anomaly due to gear wears. Table 4 showed that the chipping was observed clearly on day 7. That means that the beginning of the processes of the degradation was inevitably initiated before day 7. Figure 20 compares the spectrum evolution between days 4 and 5. An increase of the amplitude at day 5 may be clearly noticed as compared with day 4. Effectively, this day presents the beginning of the degradation due to wear. According to LZC measurements, the beginning of the chipping was in fact initiated at day 5. This initiation is characterised by the growth of the frequency components related to the meshing. Consequently, these results show that LZC measurement is better than approximate entropy and sample entropy for early detection of the gear defects, in this history case.

Figure 21 presents both Kurtosis and LZC of all days. Kurtosis is reported in the abscised and LZC on . It can be seen that this plot divides the twelve days into three regions. The first region contains the three first days (day 2, day 3, and day 4). The second encloses all days from 6 to 11. The second region is separated from the first by day 5. Day 5 presents the change in the characteristics of the gear signal.

The last region is marked by an increase of Kurtosis and LZC stays in the same level as the second region. At this stage, the signal becomes impulsive and the gear is damaged.

Using this representation, we can easily distinguish the different steps of the process of the degradation. This representation gives both information on the impulsiveness of the signal and on the effect of the number of frequency components and noise into the signal. Consequently, this representation combining Kurtosis and LZC may be used as an efficient tool for early detection of faults.

4. Conclusion

This paper introduces ApEn, SampEn, and LZC metrics to analyse the vibration signal recorded from defected gears. With respect to gear signals, the parameter selection of ApEn and SampEn was investigated and the results show that and value = 0.5 times the standard deviation were suitable and a good compromise for the ApEn and SampEn calculation. In addition, the influence of white noise on ApEn, SampEn, and LZC calculation was also investigated. The results show that ApEn and SampEn are nearly unaffected by the noise when staying at a small level. However, ApEn and SampEn are very sensitive to the noise at high levels. LZC is more sensitive to noise as compared with ApEn. An experimental study was conducted to evaluate the effectiveness of these parameters. The results show that sample entropy is better than approximate entropy. However, LZC can detect the defect of the gears earlier than SampEn and ApEn. The classical time indicators seem to be ineffective for early detection compared to LZC. The representation in a plan (Kurtosis, LZC) is proposed as a new tool for effectively monitoring gear defects. Although experimental results look promising, the proposed vibration methodology has to be tested on the other test rig also. This research is thus being continued to analyze vibration signals from different defect types and on different types of gears, to systematically validate the efficiency of this technique.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The financial support of NSERC (Natural Sciences and Engineering Research Council of Canada), FQRNT (Fonds Québecois de la Recherche sur la Nature et les Technologies), MITACS Canada, Pratt & Whitney Canada, and CETIM which provided the experimental results are gratefully acknowledged.