Intelligent Detection Method of Gearbox Based on Adaptive Hierarchical Clustering and Subset

Yuan, Huimiao; Tang, Yongwei; Hao, Huijuan; Zhao, Yuanyuan; Zhang, Yu; Chen, Yu

doi:https://doi.org/10.1155/2022/6464516

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Analysis Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Interpretation of Machine Learning: Prediction, Representation, Modeling, and Visualization 2022

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 6464516 | https://doi.org/10.1155/2022/6464516

Intelligent Detection Method of Gearbox Based on Adaptive Hierarchical Clustering and Subset

Huimiao Yuan,¹Yongwei Tang,^1,2Huijuan Hao ,¹Yuanyuan Zhao,¹Yu Zhang,¹and Yu Chen¹

Academic Editor: Nian Zhang

Received21 Jun 2022

Revised01 Aug 2022

Accepted06 Aug 2022

Published30 Aug 2022

Abstract

Deep learning uses mechanical time-frequency signals to train deep neural networks, which realizes automatic feature extraction and intelligent diagnosis of fault features and gets rid of the dependence on a large number of signal processing technology and experience. Aiming at the problem of misclassification of similar samples, a fault diagnosis algorithm based on adaptive hierarchical clustering and subset (AHC-SFD) is proposed to extract features and applied to gearbox fault diagnosis. Firstly, the adaptive hierarchical clustering algorithm is used to analyze the characteristics of different data, and then the data set is clustered into multiple feature groups; finally, according to the feature group, the SubCNN model is established for multiscale feature extraction, so as to carry out fault diagnosis. The test results show that the fault recognition rate achieved by the proposed method is more than 99.7% on the gearbox dataset, and the method has better generalization ability.

1. Introduction

Major accidents caused by mechanical equipment failure [1] constantly alert people to ensure the safe and reliable operation of equipment, especially the mechanical equipment failure at the key core of the production line will bring significant shutdown losses to the whole production line, not only causing huge economic losses, but also endangering personal safety in serious cases. The online monitoring, fault diagnosis, and prediction of mechanical equipment [2, 3] play an important role in improving equipment operation reliability, optimizing operation and maintenance strategies, and are crucial to the maintenance of mechanical equipment. Traditional intelligent fault diagnosis methods need to master a large number of signal processing techniques to extract relatively accurate feature parameters. At the same time, if the shallow model is used to characterize the relationship between signal and fault, and the diagnosis ability and generalization ability are insufficient, it is difficult to meet the actual needs of fault diagnosis under big data.

In recent years, the application of deep learning in fault diagnosis of complex industrial systems has begun to take shape [4]. Lei et al. [5, 6] proposed a big data health monitoring method based on denoising self-encoder (DAE) for mechanical equipment, which has realized a variety of fault diagnosis for planetary gears, reflecting the powerful ability of deep learning to extract mechanical vibration signal characteristics. Yu and Zhao [7–9] effectively integrated DAE and EN to solve the problem of noise interference in fault diagnosis, effectively detect abnormal samples in industrial processes, and isolate fault variables from normal variables. Nguyen et al. [10–12] proposed a deep learning network composed of automatic encoder and softmax classifier to identify bearing faults of different degrees. DBN is more combined with other technologies to solve the problem of fault diagnosis. Since CNN was used to identify bearing faults in 2016, fault diagnosis performance and scope of application have been continuously improved. Hoang and Kang [13–16] proposed a new method based on CNN for rolling bearing fault diagnosis. By using the effectiveness of CNN in image classification, the CWRU bearing data set can achieve 100% diagnosis accuracy. Based on resnet-50, a transfer learning convolution neural network TCNN is proposed by Wen et al. [17, 18] for fault diagnosis, and the prediction accuracy is significantly better than other DL models and traditional diagnosis methods. The application of RNN in fault diagnosis began to recover in 2015. Abed et al. [19, 20] used RNN for bearing fault diagnosis and realized accurate detection and classification of bearing faults under nonstationary conditions. Pan et al. [21–23] proposed a method for bearing fault classification by combining one-dimensional CNN and LSTM, and the experimental test accuracy is 99.6%.

Although the above algorithm has been applied in mechanical equipment fault diagnosis, there is still a lot of room to improve the fault recognition rate. Feature extraction is a key part of fault diagnosis. It is found that for samples with similar features and belonging to different patterns, a single model will extract similar features, resulting in false recognition [24] and a reduction in the accuracy of fault diagnosis. In view of the above problems, referring to the idea of subset [25, 26], this study proposes a multiscale feature extraction fault diagnosis algorithm model AHC-SFD based on adaptive hierarchical clustering and applied to gearbox fault diagnosis. The test results show that the proposed method can achieve the fault recognition rate achieved by the proposed method is more than 99.7% on the gearbox dataset and has better generalization ability.

2. Gear Fault Diagnosis Algorithm Based on Adaptive Hierarchical Clustering and Subset

Gear boxes generally work in the environment with strong noise and complex structure, and the collected vibration signals are easily affected by external factors. To fully develop the feature extraction ability of the CNN network, this study proposes a fault diagnosis algorithm based on adaptive hierarchical clustering and subset. First, all data obtained the optimal clustering results through adaptive hierarchical clustering, and a multiscale feature extraction module is designed according to the clustering results to realize the classification of fault data.

2.1. Adaptive Hierarchical Clustering

The number of clusters is an important parameter that affects the clustering effect, but before clustering, it is often necessary to set the number of clusters to take a fixed value. As the amount of data changes, the original parameter values cannot optimize the clustering result of the algorithm. Combined with the characteristics of vibration signals, an adaptive hierarchical clustering (DIANA) algorithm is proposed in this study. The clustering contour coefficient is used as the index of clustering effectiveness evaluation, so that it can adaptively determine the number of clusters according to the value of self-defined discriminant function. The process is shown in Figure 1.

The specific algorithm flow chart is as follows: (1)Extract the average value of each original vibration signal to form a feature sample set , indicates fault type set(2)Start clustering, make , ;(3)Let , take as the number of clusters, and perform hierarchical clustering on the input training samples (DIANA);(4)Calculate the contour coefficient , In equation (1), represents the number of samples of class c, represents the samples of class c, and represents the absolute distance between samples and ; In equation (2), denotes a mark other than Class c, represents the number of samples not of class c, represents a sample that is not class c, is the sample of class c, and is the absolute distance between samples and ; In equation (3), represents the average distance between sample and all other samples belonging to the same type of fault, and represents the minimum value of the average distance between sample and all samples in each class of nonclass fault; In equation (4), is the contour coefficient of the sample individual, is the number of samples in the feature sample set, and is the number of clusters;(5)When , then and , perform step 7;(6)When , return to step 3;(7)Judge whether is less than , where n indicates the number of dataset types: When , is the number of clusters and the clustering results are output; When , repeat step 3.

2.2. Multiscale (Subset) Feature Extraction

In order to maximize the extraction of feature information from training data and quickly realize iteration, this study designs a multilayer and multichannel multiscale feature extraction module based on the CNN. The structure is shown in Figure 2. The branch structure of each subset (12 layers in total) is the same, in which the convolution kernel sizes of the 8-layer convolution layers are 18, 18, 14, 14, 14, 12, and 12, the number of channels is set to 16, 16, 64, 64, 256, 256, 512, and 512, and the step size is set to 2, 2, 2, 2, 2, 1, and 1. The relu activation function is used behind each convolution layer, and the max pool layer of 4 adopts the 12 structure. Finally, the extracted feature information is output.

2.3. AHC-SFD Diagnostic Algorithm

The flow chart of adaptive hierarchical clustering and subset fault diagnosis proposed in this study is shown in Figure 3. The mean value of each vibration signal is used as the input of adaptive hierarchical clustering to obtain the optimal clustering results. The labeled samples corresponding to the results are input to the multiscale feature extraction module to obtain more effective fault data features. Finally, the features extracted by the multifeature extraction module are transformed into one-dimensional data through the fully connected layer. Output the fault diagnosis result through softmax function.

3. Experimental Verification and Analysis

In order to evaluate the effectiveness and accuracy of fault diagnosis of the AHC-SFD network model, the gearbox dataset is used for experimental verification. The data are collected from a reference two-stage gearbox, the gear speed is controlled by a motor, and the torque is provided by a magnetic brake, which can be adjusted by changing its input voltage. A 32-tooth pinion and an 80-tooth pinion are installed on the first stage input shaft, the second stage consists of a 48-tooth pinion and a 64-tooth pinion. Input shaft speed is measured by tachometer, and gear vibration signal is measured by accelerometer, as shown in Figure 4.

3.1. Fault Dataset Description and Processing

The pinion on the input shaft introduces 9 different gear conditions, including five different severity labels, such as health, missing teeth, root cracking, peeling, and tip cutting. The number of samples in each status tag is the same. The collected data are roughly divided into training samples and test samples in the proportion of 4 : 1. Each sampling sample is set to 3600 points. The dataset is described in Figures 5–13 and Table 1.

3.2. Adaptive Hierarchical Clustering

3.2.1. Refactoring Input Data Format

The dataset collected by the test-bed is a one-dimensional vibration signal sequence. In order to reduce the clustering time and carry out the adaptive hierarchical clustering operation quickly and effectively, this study takes the one-dimensional vibration signal with 3600 sampling points as the average value and takes the average value as the input value of the adaptive hierarchical clustering. The specific operation is as follows:

In equation (5), represents the i-th eigen value of a sample and represents the average value of a sample.

3.2.2. Result Output

The principle of adaptive clustering is to obtain a certain clustering result, so that the distance between classes is as large as possible, the distance within a class is as small as possible, and the classes have good separability. It can be seen from 2.1 that the cluster contour coefficient is used as the index for cluster effectiveness evaluation in this study. The closer the cluster contour coefficient is to 1, the better the clustering result is. The closer it is to −1, the worse the clustering result is. In this study, the number of clusters is set between [1, 9]. During clustering, the cluster contour coefficients obtained with the change of the number of clusters is shown in Figure 14. It can be clearly seen that when the number of clusters are 2, the cluster contour coefficient (Sk) is the largest. Therefore, the branch of the multiscale feature extraction module is set to 2.

3.3. Improved CNN Network

3.3.1. Grouping Label Data According to Clustering Results

Use labeled data; the labeled data samples are , represents the feature vector, and represents the fault type. According to the clustering results in 3.2.2, the label data (one-dimensional vibration signal) is divided into two groups. The two groups are divided into training samples and test samples according to the ratio of 39 : 11 and 19 : 6, respectively. The description of the training and testing datasets is shown in Table 2.

3.3.2. Data Standardization Operation

In order to better speed up the network model training, make the data easy to calculate and obtain more generalized results, the input data are standardized, and the vibration signal data are mapped to the (0,1) interval by using the normalization equation. The mathematical expression is as follows:

In equation (6), represents the preprocessed data, represents the frequency value of the vibration signal, and represent the minimum and maximum values of frequency in each group of vibration signals, and represents the number of each vibration signal.

3.3.3. Diagnostic Result Output

In order to evaluate the difference between the normalized prediction result and the corresponding sample label, the cross entropy function is used to calculate the error loss value. The mathematical expression is as follows:

In equation (7), represents the loss function, represents the logical indication function (when the value is true, I = 1, otherwise I = 0), and represents the i-th real label of the fault.

The weight matrix is iteratively updated by means of gradient descent. The iterative equation is as follows:

In equation (8), represents the weight matrix of the j-th update.

3.3.4. Model Parameter Structure

The experiment was implemented on a Linux computer using Pycharm platform, Python as the programming language, and PyTorch deep learning framework.

During network training based on stochastic gradient descent, the multilayer back-propagation of the error signal can easily lead to “gradient dispersion” (too small gradient will make the returned training error signal extremely weak) or “gradient explosion” (too large gradient will lead to Nan in the model). With the increase of network depth, training becomes more and more difficult. Considering the network lightweight, during the experiment, the Adam optimizer is used to continuously update the network training parameters. The batch size is set to 30 and the number of iterations is 200. This study introduces the early stopping mechanism. By monitoring the changing value of the training set loss function between adjacent iterations during the training process, early stopping can terminate the model training in time to prevent the model from overfitting. The learning rate is 0.0005. The model is built on the basis of convolutional neural network model, so the parameter design is similar to the convolutional neural network, and the parameter design is shown in Table 3.

3.4. Result Analysis

To verify whether the method has a high diagnostic rate and good generalization ability, the experimental results in this study are compared with those using only the CNN. The experimental results are shown in Figure 15.

(a)

(b)

The comparison results of AHC-SFD and CNN on the test set are shown in Figure 16.

It can be seen from the comparison results in Figures 15 and 16 that after 140 epochs, the accuracy of AHC-SFD algorithm on the test set reaches 99.7%, while the accuracy of the CNN algorithm on the test set is only 98.9%. Therefore, the diagnostic methods in this study tend to be faster, more stable, with higher accuracy and stronger generalization ability.

In order to further demonstrate the learning ability of the model for different categories of features, the t-SNE dimension reduction algorithm in flow pattern learning is introduced to visualize the features learned by the full connected layer. The experimental results are shown in Figure 17.

(a)

(b)

It can be seen from the scatter plot Figure 17 that the method AHC-SFD in this study has identification errors in the samples of class 0 and class 7, and the other samples are gathered at the corresponding positions. However, CNN features have recognition errors in class 1, class 2, class 5, and class 8 samples, and there are many overlaps in class 1 and class 5 samples. It can be seen that AHC-SFD has stronger feature learning ability than the CNN.

4. Conclusion

The AHC-SFD algorithm established in this study is a diagnosis algorithm based on adaptive hierarchical clustering and subset, which has the following three advantages: (1) the AHC-SFD algorithm directly takes the original vibration signal as the input of 1D-CNN, which can obtain the characteristics of vibration signal to the greatest extent. (2) A grouping method based on adaptive hierarchical clustering is proposed, which analyzes the characteristics of different data and then clusters the dataset into multiple feature groups. (3) A multiscale feature extraction module is proposed to reduce the misclassification of similar samples, thus ensuring the maximum extraction of effective information into the data. It is verified on the gearbox dataset that the diagnostic accuracy is better than the single-channel CNN model.

Data Availability

The data set used in this article can be obtained from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding this work.

Acknowledgments

This work was supported by Innovation Ability Improvement Project of Scientific and Technological Small and Medium-Sized Enterprises in Shandong Province (grant no. 2021TSGC1089), “20 New Colleges and Universities” Funded Project in Jinan (grant no. 2021GXRC074), Major Scientific and Technological Innovation Projects in Shandong Province (grant no. 2019JZZY010117), and 2020 Industrial Internet Innovation and Development Project, Solution Application and Promotion Public Service Platform (grant no. TC200802C).

References

Z. Hou, “Research status and development prospect of rotating machinery fault diagnosis,” Forging equipment and manufacturing technology, vol. 56, no. 5, pp. 33–37, 2021.
View at: Google Scholar
X. Zhao, “Automatic on-line monitoring and fault diagnosis system for mine electromechanical equipment,” Mining equipment, vol. 11, no. 6, pp. 246-247, 2021.
View at: Google Scholar
G. Fan, “Research on on-line monitoring and fault diagnosis of secondary circuit in intelligent substation,” Light source and lighting, vol. 45, no. 2, pp. 228–230, 2022.
View at: Google Scholar
B. Shen, B. Chen, C. Zhao, F. Chen, W. Xiao, and N. Xiao, “A review of research on deep learning in mechanical equipment fault prediction and health management,” Machine tools and hydraulics, vol. 49, no. 19, pp. 162–171, 2021.
View at: Google Scholar
Y. Lei, F. Jia, and X. Zhou, “A deep learning-based method for machinery health monitoring with big data,” Journal of Mechanical Engineering, vol. 51, no. 21, pp. 49–56, 2015.
View at: Publisher Site | Google Scholar
F. Jia, Y. Lei, J. Lin, X. Zhou, and N Lu, “Deep neural networks: a promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data,” Mechanical Systems and Signal Processing, vol. 72-73, pp. 303–315, 2016.
View at: Publisher Site | Google Scholar
W. Yu and C. Zhao, “Robust monitoring and fault isolation of nonlinear industrial processes using denoising autoencoder and Elastic Net,” IEEE Transactions on Control Systems Technology, vol. 27, pp. 1–9, 2019.
View at: Google Scholar
H. Mu ., Rolling Bearing Fault Diagnosis Method Based on Integrated Soft Competition Yu Norm Art, Wuhan University of science and technology, Wuhan, China, 2017.
View at: Publisher Site
Y. Tang, “Application of integrated diagnosis method in transformer fault diagnosis,” Coal mine machinery, vol. 33, no. 5, pp. 264–266, 2012.
View at: Publisher Site | Google Scholar
V. H. Nguyen, J. S. Cheng, Y. Yu, and V. T Thai, “An architecture of deep learning network based on ensemble empirical mode decomposition in precise identification of bearing vibration signal,” Journal of Mechanical Science and Technology, vol. 33, no. 1, pp. 41–50, 2019.
View at: Publisher Site | Google Scholar
G. Chen, J. Zhang, and G. Kan, “Intelligent fault diagnosis method of bearing based on improved superposition automatic encoder,” Noise and vibration control, vol. 42, no. 1, pp. 156–161, 2022.
View at: Google Scholar
S. Liu, Research on Bearing Fault Diagnosis Based on Stack Automatic Encoder, Taiyuan University of science and technology, Taiyuan, China, 2020.
D. T. Hoang and H. J. Kang, “Rolling element bearing fault diagnosis using convolutional neural network and vibration image,” Cognitive Systems Research, vol. 53, pp. 42–50, 2019.
View at: Publisher Site | Google Scholar
C. Wei, J. Zhou, and J. Zhang, “FDM 3D printing fault diagnosis method based on,” Agricultural equipment and vehicle engineering, vol. 60, no. 2, pp. 149–153, 2022.
View at: Google Scholar
Ke Zhang, J. Wang, H. Shi, X. Zhang, and L. Fu, “Research on fault diagnosis of rolling bearing under variable working conditions based on,” Control engineering, vol. 29, no. 2, pp. 254–262, 2022.
View at: Publisher Site | Google Scholar
Y. Ye and Y. Li, “Multi wind turbine fault diagnosis based on CNN ensemble learning,” Journal of Industrial Engineering, vol. 25, no. 1, pp. 136–143, 2022.
View at: Google Scholar
L. Wen, X. Li, and L. Gao, “A transfer convolutional neural network for fault diagnosis based on ResNet- 50,” Neural Computing & Applications, vol. 31, pp. 1–14, 2019.
View at: Google Scholar
J. Ding, Q. Shao, Z. Qi, M. Xie, Bo Gao, and Yu Yang, “Convolution neural network fault diagnosis based on transfer learning,” Science, technology and engineering, vol. 22, no. 14, pp. 5653–5658, 2022.
View at: Google Scholar
W. Abed, S. Sharma, R. Sutton, and A Motwani, “A robust bearing fault detection and diagnosis technique for brushless DC motors under non-stationary operating conditions,” Journal of Control, Automation and Electrical Systems Automation and Electrical Systems, vol. 26, no. 3, pp. 241–254, 2015.
View at: Publisher Site | Google Scholar
M. Chang, Fault Diagnosis and Prediction of Wind Power Rolling Bearing Based on Deep Learning, Jiangnan University, Wuxi, China, 2021.
H. Pan, X. He, and S. Tang, “An improved bearing fault diagnosis method using one-dimensional CNN and LSTM,” Journal of Mechanical Engineering, vol. 64, no. 7/8, pp. 443–452, 2018.
View at: Google Scholar
P. Zhang, X. Shu, X. Li, J. Hang, S. Ding, and Q. Wang, “Research on fault diagnosis method of AC motor system based on LSTM,” Journal of electrical machinery and control, vol. 26, no. 3, pp. 109–116, 2022.
View at: Publisher Site | Google Scholar
Y. Li, J. Hu, J. Lai, W. Wang, Y. Zhao, and Y. Fan, “Fault diagnosis of wind turbine planetary gearbox based on 1d-cnn-lstm hybrid neural network model,” Electrical automation, vol. 43, no. 5, pp. 20–22+26, 2021.
View at: Google Scholar
J. Bai, Y. Wu, J. Zhang, and F. Chen, “Subset based deep learning for RGB-D object recognition,” Neurocomputing, vol. 165, pp. 280–292, 2015.
View at: Publisher Site | Google Scholar
A. T. Duong, H. T. Phan, and N. D. H. Le, A Hierarchical Approach for Handwritten Digit Recognition Using Sparse Autoencoder. Issues and Challenges of Intelligent Systems and Computational Intelligence, Springer, Newyork, NY, USA, 2014.
Y. Zhang, X. Li, L. Gao, and P. Li, “A new subset based deep feature learning method for intelligent fault diagnosis of bearing,” Expert Systems with Applications, vol. 110, pp. 125–142, 2018.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Huimiao Yuan et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

234

Downloads

330

Citations