Abstract

The fault signals of planetary gears are nonstationary and nonlinear signals. It is difficult to extract weak fault features under strong background noise. This paper adopts a new filtering method, fractional Wavelet transform (FRWT). Compared with the traditional fractional Fourier transform (FRFT), it can improve the effect of noise reduction. This paper adopts a planetary gear fault diagnosis method combining fractional wavelet transform (FRWT) and two-dimensional convolutional neural network (2D-CNN). Firstly, several intrinsic mode component functions (IMFs) are obtained from the original vibration signal by AFSA-VMD decomposition, and the two components with the largest correlation coefficient are selected for signal reconstruction. Then, the reconstructed signal is filtered in fractional wavelet domain. By analyzing the wavelet energy entropy of the filtered signal, a two-dimensional normalized energy characteristic matrix is constructed and the two-dimensional features are input into the two-dimensional convolution neural network model for training. The simulation results show that the training effect of this method is better than that of FRFT-2D-CNN. Through the verification of the test set, we can know that the fault diagnosis of planetary gears can be realized accurately based on FRWT and 2D-CNN.

1. Introduction

As an important part of rotating machinery and equipment, planetary gears usually operate in a high-speed and high-power environment. They are widely used in aircraft manufacturing, coal mining machinery, wind power generation, ship manufacturing, and other industries. It is very easy to appear in the long-term operation process, smooth vibration phenomenon. Since the 1980s, many serious accidents have been caused by the fault of rotating equipment around the world, causing huge economic losses. About 80% of the faults occurred on the planetary gears [1]. Therefore, how to accurately diagnose the fault of planetary gears has important research significance.

At present, many achievements have been made in the research on fault diagnosis of planetary gears. Yu Jun and others proposed a planetary gear fault identification method that combines a stacked denoising autoencoder (SDAE) and a gated recurrent unit neural network (GRUNN) to solve the problem of low planetary gear fault recognition rate [1]. Gao Hongying and others proposed a planetary gear fault identification method combining complementary set empirical mode decomposition (CEEMD) and chaotic particle swarm kernel extreme learning machine (CPSO-ELM), which reduces the influence of external disturbances on planetary gear fault diagnosis [2]. Wang Zhenya and others proposed a fault diagnosis method based on optimized variational modal decomposition and multidomain manifold learning of the salvia group, which solved the problem of difficult feature extraction and identification of planetary gears [3]. Li Haiping proposed an intelligent diagnosis method combining Fast Fourier Transform (FFT) and Deep Confidence Network (DBN) to improve the accuracy of planetary gear fault diagnosis [4]. Li Yuheng proposed a fault diagnosis method that combines the ensemble empirical mode (EEMD) and the symmetrical differential energy operator to achieve accurate diagnosis of planetary gears and accurately obtain the fault characteristic frequency value of planetary gears [5]. Zhang et al. proposed a fault diagnosis method based on time-frequency characteristics and PSO-SVM, and verified that the method can quickly and accurately identify the fault type of planetary gears from nonstationary signals [6]. Wang et al. proposed a gear fault diagnosis method based on multicriteria fault feature selection and heterogeneous integrated learning classification, which improved the accuracy and robustness of diagnosis [7]. Aiming at a kind of multimode process with hidden degenerate faults, a fault prediction algorithm based on the combination of multi-PCA model and fault reconstruction technology is proposed, which can well solve the fault prediction problem of multimode process data [8].

In order to realize the planetary gear fault diagnosis under strong background noise, this paper adopts the planetary gear fault diagnosis method combining fractional wavelet transform and two-dimensional convolutional neural network. Firstly, the planetary gear fault signal is denoised by fractional wavelet transform. Secondly, use wavelet packet to extract the one-dimensional normalized energy value of the filtered signal, and convert the obtained one-dimensional energy value into a two-dimensional energy feature map. Finally, use a two-dimensional convolutional neural network to establish a fault diagnosis model to achieve accurate identification of different faults under different working conditions.

2. The Theoretical Basis of Fractional Wavelet Transform

2.1. Discrete Wavelet Transform

In signal processing, the continuous wavelet is discretized. After the discretization, the continuous wavelet and its corresponding wavelet transform become the discrete wavelet transform. The discrete wavelet transform [8] is the second of the displacement and scale of the continuous wavelet transform. The power is discretized, which is essentially binary wavelet transform. In order to reduce the complexity of wavelet coefficients, the wavelet coefficients are taken at some discrete points, and the scale is discretized first. In order to reduce the wavelet transform coefficients of the remainder, we set the wavelet system. In order to reduce the wavelet transform coefficients of the remainder, we limit the values of and of the wavelet coefficient to some discrete points and first discretize the scale, that is, let . At this time, the corresponding wavelet function is , where , is a uniform discrete value. At scale , since the width of is times , the interval can be used to expand without changing . At this time, is changed to

The discrete wavelet transform is

The continuous wavelet transform at is called discrete binary wavelet transform, and its expression is

2.2. Discrete Fractional Fourier Transform

The fractional Fourier transform is is called the kernel function of FRFT, is integers.

According to the definition given by formula (4), the formula of Ozaktas sampling fractional Fourier transform can be obtained as

In formula (5),

When the order , formula (6) is decomposed into the calculation process of the following formulas:

Here, and are just two intermediate results , . Discretize equations (7)–(9) to obtain the numerical calculation method of discrete fractional Fourier transform [9].

2.3. Fractional Wavelet Transform

The scale factors , (where ) and the time shift factor in the continuous fractional wavelet transform expression are discretized and sampled in the displacement domain, and the value corresponding to the sampling point can be expressed by the discrete fractional wavelet transform formula.

Discretize the scale factors , to get

When , the expression of discrete fractional wavelet transform is

The reconstruction of the fractional wavelet transform is the inverse process of the decomposition process of the fractional wavelet. In the known -th layer, the fractional wavelet coefficients are and , and the original signal is through the reconstruction. is the multiresolution analysis, which can be seen from the relationship between , , and the function projection:

Hence,

Equation (13) is the reconstruction process of traditional discrete wavelet coefficients. Firstly, the fractional coefficients and of the layers are modulated, and then one-dimensional wavelet inverse transformation is performed in the wavelet domain to obtain , and then is modulated into the fractional wavelet domain to obtain , and so on, to restore the original signal step by step.

2.4. Realization Process of Fractional Wavelet Transform

With a one-dimensional signal , using the definition of fractional wavelet transform proposed by , the realization process of one-dimensional fractional wavelet transform can be obtained as follows:(1)Input one-dimensional signal (2)Select the appropriate fractional order change range , and use the minimum output energy to search for the best transformation order(3)Perform -order fractional Fourier transform on the input signal to obtain a signal in the fractional domain(4)Perform wavelet decomposition on the obtained signal in the fractional domain to obtain a signal in the fractional wavelet domain(5)Perform wavelet reconstruction on the signal in the fractional wavelet domain to recover the signal in the fractional domain(6)Perform -order fractional Fourier transform on the signal in the fractional domain to recover the filtered signal [10, 11]

The realization process of fractional wavelet transform is shown in Figure 1 [8].

3. Wavelet Energy and Two-Dimensional Convolutional Neural Network

3.1. Feature Extraction Process Based on Wavelet Energy

The main steps are as follows:(1)The signal is decomposed by -layer wavelet packet, the -th layer has frequency band signals, and then features of the -th layer are extracted.(2)In order to improve the denoising ability of the signal, select the low-frequency coefficients and high-frequency coefficients of each frequency band decomposed in (1) to reconstruct the signal, denoted as .(3)Solve the energy of each signal, and the calculation formula for the energy value of each frequency band is as follows:Here, is the decomposition coefficient of the wavelet packet, is the energy value of the -th node in the -th layer after the signal undergoes wavelet decomposition, , and is the wavelet packet reconstruction coefficient of [12].(4)Construct feature vector.

In the process of wavelet decomposition, the energy of each layer is equal to the total energy, and the total energy of the signal is

The wavelet packet energy of each frequency band is

The wavelet packet energy feature vector is

3.2. Structure of a Two-Dimensional Convolutional Neural Network

The current typical two-dimensional convolutional neural network structure is composed of input layer, convolution layer, pooling layer, fully connected layer, and output layer. The network structure of is shown in Figure 2. The input of the convolutional neural network is mainly in the form of a two-dimensional grayscale image or a color image. Its output layer uses the classifier to output the classification and recognition results of a two-dimensional grayscale image or a color image. In other image processing fields such as target detection, other forms of network output layers need to be set up [13, 14].

The convolutional layer is composed of multiple convolutional neurons. The parameters of the convolutional neuron are obtained by using the backpropagation algorithm. The convolutional layer is a key part of the entire convolutional neural network, which is mainly used for input data to extract different features [15]; the process of convolution operation is composed of continuous convolution and discrete convolution.

The process of discrete convolution operation is as follows:

When the image convolution operation is performed, it is the operation between the image pixels. The pixels of the image can be understood as a matrix, and the pixels are not continuous. The process of the convolution operation is the selected convolution kernel and the image. Input for convolution operation: Assuming that the two-dimensional image input is and the two-dimensional convolution kernel is , the image convolution operation process can be expressed as

Convolution operation is alternating, so

Here, is the size of the convolution kernel. After the feature is extracted by the convolution operation, the offset operation needs to be performed after the convolution operation. The calculation formula is as follows:

Here, is the first feature map output by the first layer; is the activation function used by the convolutional layer; is the convolution matrix used by the convolution kernel; and is the offset of the convolution operation.

The pooling layer is also commonly referred to as the downsampling layer. The pooling layer can reduce the training time of the model, improve the robustness of feature extraction, and avoid overfitting of the model. There are usually three ways of pooling: average pooling process, maximum pooling process, and random pooling process. In actual applications, the pooling process is dominated by maximum pooling.

Maximum pooling calculation formula is

In the actual application process, the classifier needs to be trained in the fully connected layer. The commonly used classifier is the classifier. The fully connected process is shown in the following formula:

In formula (23), is the output of the fully connected layer; is the weight value; is the input of the fully connected layer; is the bias term; is the classification function; is the network layer number.

In image classification, is generally used as the classifier. If there are classifications, the output of can be expressed as

3.3. Procedure

In order to accurately classify planetary gear faults in a complex actual industrial environment, this paper proposes a planetary gear fault diagnosis method based on FRWT and 2D-CNN. A flowchart can be drawn as shown in Figure 3.

The specific steps are as follows:(1)Use fractional wavelet transform to separately denoise the gear fault signals(2)Use entropy to extract energy from the signal after noise reduction and calculate the normalized energy value(3)Convert the obtained wavelet energy value into a two-dimensional matrix feature sample set(4)Initialize the two-dimensional convolutional neural network and use the sample set to extract the characteristics of the signal(5)Train and establish a two-dimensional convolutional neural network model to identify planetary gear faults

4. Experimental Verification

4.1. Introduction to the Experimental Sample Set

The planetary gear fault experiment data used in this article is collected by the QPZZ-II mechanical fault simulation and test platform produced by Jiangsu Qianpeng Diagnostic Engineering Co., Ltd. The test platform includes drive motors, planetary gears, rolling bearings, speed control equipment, etc. The fault components used in the entire test platform include planetary gear pitting fault, broken tooth fault, wear fault, and other faults. The sampling frequency of the experimental data is 5120 Hz. A total of 10 fault samples under fault states were selected, 1700 samples were selected for each fault, and the number of sample points for each sample was 512. Among them, the ratio of the number of training samples to the number of test samples is 10 : 7, and the specific sample classification is shown in Table 1.

4.2. FRWT Filter Analysis
4.2.1. Determination of the Optimal Order of FRWT

First, the minimum output energy is used as the objective function to optimize the optimal order. The order optimization process of pitting fault reconstruction signal, broken tooth fault reconstruction signal, and wear fault reconstruction signal is shown in Figure 4. It can be clearly seen from Figure 4 that the minimum value of the FRFT output energy of the pitting fault (Dianshi880-1) reconstructed signal is 11670, and the corresponding order is 1.57; that is, the best order is 1.57; in the broken tooth fault (Duanchi1500) the minimum value of the FRFT output energy of the reconstructed signal is 20020, and the corresponding order is 1.646; that is, the best order is 1.646; the minimum value of the FRFT output energy of the reconstructed signal FRFT for wear fault (Mosun880-1) is 50360. At this time, the corresponding order is 1.558; that is, the best order is 1.558. The output energy value and the corresponding optimal order of the remaining faults are shown in Table 2.

4.2.2. Determination of the Number of FRWT Wavelet Bases and Decomposition Layers

In the fractional wavelet transform, when the selected wavelet base and the number of decomposition layers are different, the noise reduction effect of the signal will be different. Therefore, the wavelet bases are selected as and , respectively, and the number of decomposition levels is 1 to 5, and the optimal wavelet base and decomposition level are selected by calculating the output signal-to-noise ratio (SNR) of the denoising signal. The specific results are shown in Figure 5–Figure 7. The wavelet basis and decomposition layer settings for each fault are shown in Table 3.

4.2.3. FRWT and FRFT Filtering Effect Analysis

The pitting fault reconstruction signal, wear fault reconstruction signal, and broken tooth fault reconstruction signal are, respectively, subjected to FRFT filtering and FRWT filtering, and the filtering results of each fault signal are shown in Figure 8, Figure 9, and Figure 10. Using the found optimal fractional order  = 1.57, 1.646, and 1.558, the pitting reconstruction signal, broken tooth reconstruction signal, and wear reconstruction signal are, respectively, subjected to fractional Fourier transform filtering. The filtering results are shown in Figure 8(b), as shown in Figure 9(b) and Figure 10(b), and then perform wavelet transform on the signal after the fractional Fourier transform in the corresponding fractional domain and finally carry out the transformed signal,  = −1.57, −1.646, −1.558-order fractional Fourier transform to obtain the corresponding output signal time domain diagram as shown in Figure 8(c), Figure 9(c), Figure 10(c).

In order to analyze the influence of the fractional order on the signal filtering effect, this paper calculates the output signal-to-noise ratio of the two filtering methods, respectively. The input signal-to-noise ratio of the pitting fault (Dianshi880-1) signal is -12.25 dB; the broken tooth fault (the input signal-to-noise ratio of Duanchi1500) signal is -13.15 dB; the input signal-to-noise ratio of wear fault (Mosun880-1) signal is -16.47 dB. The comparison result is shown in Figure 11.

It can be seen from Figure 11 that the output signal-to-noise ratios (SNR) of FRWT for pitting faults, wear faults, and broken teeth faults are all greater than the output signal-to-noise ratio (SNR) of FRFT. According to the larger output signal-to-noise ratio (SNR), the signal will be distorted. The smaller the degree and the noise interference, the better the filtering effect of FRWT compared to the filtering effect of FRFT.

4.3. Wavelet Packet Extraction Features

Set the decomposition level of the wavelet packet to 8, which will generate a total of 256 frequency bands, and use the wavelet basis to decompose the fault vibration signals of 10 gears into eight layers, and generate a total of 256 wavelet packet components. Then use entropy to extract the wavelet energy, and then process the energy of the frequency band, that is, obtain the sum of the norm squares of each node of each layer of neurons, and finally obtain the normalized energy amplitude of each node. The corresponding normalized energy value of each frequency band is shown in Figure 12.

Among them, the characteristics of each sample are 256 frequency band energy spectra, and the characteristics of 1700 samples are converted into a matrix form with a two-dimensional form with a dimension of 1616. Figure 13 shows the converted two-dimensional frequency band energy characteristic distribution. Finally, the code is used to set the label category for each type of fault.

4.4. Training and Classification of Fault Models

The specific parameter selection for experimental verification is as follows: the number of layers of the two-dimensional CNN network is set to 6 layers, the convolutional layer and the pooling layer are each two layers, the number of convolution kernels in the first layer is 8, and its size is 33. The number of convolution kernels in the second layer is 16, and its size is 44; the batch size is 10, and the maximum number of iterations is 1500; the pooling layer uses the maximum pooling method, and its size is 22; using regularization reduces overfitting. Extract the wavelet energy values of the signals after FRFT filtering and FRWT filtering to construct a two-dimensional feature matrix as input; randomly select 1000 samples of each type of fault as the training set for model training, and 700 samples as the test set for the two-dimensional convolutional neural. The training model of the network is verified, and the training error curve is shown in Figure 14.

From the analysis in Figure 14, it can be seen that, regardless of whether the fractional Fourier transform or the fractional wavelet transform is used, when the number of iterations is less than or equal to 120, the training error of the two is equal; when the number of iterations is 120, the training error is 0.6667. The effect is extremely poor; when the number of iterations is greater than 120, the training error of the fractional wavelet transform filtering signal is obviously smaller than the training error of the fractional Fourier transform filtering signal; when the number of iterations is 1500, the training error of the fractional wavelet transform filtering method is 0.01623, and the training error of the fractional Fourier transform filtering method is 0.06514, that is; the training error of the fractional wavelet transform filter signal is significantly smaller than the training error of the fractional Fourier transform filter signal. It can be seen that the training effect of FRWT+2D-CNN is better than that of FRWT+2D-CNN.

The classification results of each fault in the test set using the FRWT+2D-CNN and FRFT+2D-CNN models are shown in Figure 15 and Figure 16. The abscissa is the predicted category label of the test set; the ordinate is the actual label category of the test set; the value of the diagonal position is the classification accuracy of each of the 10 types of faults; the position outside the diagonal is the type of fault.

Comparing Figures 15 and 16, it can be found that when FRWT+2D-CNN classifies and recognizes faults, only two samples are misclassified; that is, type 3 faults are misclassified as type 4 faults, and type 7 faults are wrong. The fault is classified as the 8th type of fault; when FRFT+2D-CNN classifies and recognizes the fault, except for the 5th type of fault, the other fault types are all misclassified, and the number of misclassified samples for each type of fault is greater than 1. Therefore, when classifying faults based on FRWT+2D-CNN, each type of fault can be accurately identified to the greatest extent possible.

In order to fully verify the stability and accuracy of the diagnosis method proposed in this article, this article randomly conducts 15 simulation tests on the two diagnosis methods (FRFT+2D-CNN, FRWT+2D-CNN), and the classification accuracy of each test is as shown in Figure 17. The average accuracy of the diagnosis models of the two classification methods is shown in Table 4.

By analyzing Figure 5, 20, it can be seen that the classification accuracy of the two diagnostic methods FRFT+2D-CNN and FRWT+2D-CNN basically remains stable, and the classification accuracies of FRFT+2D-CNN and FRWT+2D-CNN are both within 3%. With fluctuations up and down, from a macroperspective, the classification accuracy of FRWT+2D-CNN is higher than that of FRFT+2D-CNN. It can be seen from Table 4 that when the number of training samples, the number of test samples, and the number of trials are equal, the average accuracy of FRWT+2D-CNN classification is higher than the average accuracy of FRFT+2D-CNN classification. Therefore, in the fault diagnosis of planetary gears, the classification method of FRWT+2D-CNN is obviously better than the fault classification method of FRFT+2D-CNN.

5. Conclusion

(1)This paper adopts the FRWT-based planetary gear vibration signal filtering method. The simulation results show that both the fractional wavelet transform and the fractional Fourier transform can achieve the denoising effect of the signal; the denoising effect of the fractional wavelet transform is better than fractional Fourier transform: The energy-based fractional Fourier transform algorithm is better than the peak search-based fractional Fourier transform algorithm.(2)This paper adopts a two-dimensional convolutional neural network model, and the signals after the fractional Fourier transform and the fractional wavelet transform are filtered, and the one-dimensional wavelet energy value is normalized and converted into a two-dimensional feature matrix for diagnosis model training. The simulation results show that the two-dimensional convolutional neural network can effectively realize fault classification and recognition. In addition, the accuracy of planetary gear fault classification based on FRWT and 2D-CNN is better than the accuracy of planetary gear fault classification based on FRFT and 2D-CNN.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant no. 61973041) and the National Key Research and Development Program of China (Grant no. 2019YFB1705403).