A Generative Adversarial Network Based a Rolling Bearing Data Generation Method Towards Fault Diagnosis

Huo, Lin; Qi, Huanchao; Fei, Simiao; Guan, Cong; Li, Ji

doi:https://doi.org/10.1155/2022/7592258

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Related Work Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Safe and Fair Machine Learning for Neuroscience

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 7592258 | https://doi.org/10.1155/2022/7592258

A Generative Adversarial Network Based a Rolling Bearing Data Generation Method Towards Fault Diagnosis

Lin Huo,^1,2Huanchao Qi,¹Simiao Fei,³Cong Guan,⁴and Ji Li¹

Academic Editor: Shahid Mumtaz

Received12 May 2022

Revised17 Jun 2022

Accepted20 Jun 2022

Published13 Jul 2022

Abstract

As a new generative model, the generative adversarial network (GAN) has great potential in the accuracy and efficiency of generating pseudoreal data. Nowadays, bearing fault diagnosis based on machine learning usually needs sufficient data. If enough near-real data can be generated in the case of insufficient samples in the actual operating condition, the effect of fault diagnosis will be greatly improved. In this study, a new rolling bearing data generation method based on the generative adversarial network (GAN) is proposed, which can be trained adversarially and jointly via a learned embedding, and applied to solve fault diagnosis problems with insufficient data. By analyzing the time-domain characteristics of rolling bearing life cycle monitoring data in actual working conditions, the operation data are divided into three periods, and the construction and training of the generative adversarial network model are carried out. Data generated by adversarial are compared with the real data in the time domain and frequency domain, respectively, and the similarity between the generated data and the real data is verified.

1. Introduction

Nowadays, as an important mechanical component, rolling bearings have an indispensable position in mechanical rotating equipment. 30% of the failures of mechanical rotating equipment are due to the failure of rolling bearings, so the fault diagnosis of rolling bearings is extremely important. Most traditional fault detection methods for rolling bearings are based on human judgment and data-driven algorithms with existing knowledge and theories [1–6]. In the diagnosis, the focus is on the numerical calculation of effective characteristic parameters or the extraction of signals, and the fault is often analyzed and diagnosed in the established digital set. Therefore, human subjective factors will have a certain impact on the analysis results and reduce the accuracy of the analysis results [7–9]. At the same time, for safety and economic considerations, most of the bearings are in safe working state during operation, and most of the collected data are working data in a healthy state, but the working data under fault conditions are extremely lacking. This also makes it difficult to apply data-driven algorithms normally, and the results obtained lack practical reference value. In modern times, with the continuous development of deep learning and artificial neural network technologies, advanced machine learning algorithms have been widely used in the field of fault diagnosis. When these deep learning methods deal with the problem of class imbalance, due to their bias towards most classes [10–12], their accuracy will also be reduced. As an emerging generative model, the generative adversarial network method uses two different neural networks to confront each other and has the ability of projecting the standard data distribution to the high-dimensional real-world data distribution to generate high-accuracy pseudoreal data that makes it a place in the field of fault diagnosis and prognosis.

Fuan et al. [13] proposed an adaptive deep convolutional neural network for rolling bearing fault diagnosis, which reduces the dependence on manual experience to a certain extent by automatically learning the essential fault features layer by layer from the input data. Guo et al. [14] used the Hilbert envelope spectrum and support vector machine to analyze the amplitude modulated pulse in the vibration signal of rolling bearing, and accurately diagnosed many kinds of faults of rolling bearing. Shao et al. [15] used an optimized deep belief network and applied it to the analysis of rolling bearing simulation signals and experimental signals. These research results show high accuracy and feasibility, but there are also some certain shortcomings. No matter what kind of neural network model is used, the final result is always inseparable from the simplification of the vibration signal, such as feature reduction by setting a threshold, and simplifying the vibration signal to a single signal; these methods have a certain theoretical basis, but in actual working conditions, however, certain differences inevitably exist. Nowadays, the neural network theory has spawned many branches, and the practical problems applied to data fusion have gradually increased. However, in the fault diagnosis field, the application of neural networks has always been restricted. It is impossible to simulate the complex situation of the real world no matter what kind of traditional generative model has been used. This kind of huge life cycle data will have some problems for the fitting model. Traditional simulation data generation often involves methods such as MLE (maximum likelihood estimation), Markov chain methods, and approximation methods [16], which are subject to complex calculations, the pros and cons of simulation are affected by other factors, and gradient disappearance. As a result, the data generation efficiency is low and the accuracy is low.

In this environment, GAN (generative adversarial networks) came into being [17]. Through the joint action of the two modules of the generator and the discriminator, it can effectively solve the existing low accuracy problems of data generated by deep learning and neural network models while generating a large amount of simulation data. Some large artificial intelligence company such as Google, OpenAI, and Facebook are using this feature to explore kinds of GAN applications [18]. However, the application of GAN has mainly focused on the generation and processing of images [19], which has not been widely used in the field of fault diagnosis. In recent years, some scholars have begun to think about the application value of GAN as a generative model in the field of fault diagnosis. Gao et al. [20] used GAN to generate a large number of fault samples and applied them to fault diagnosis based on FEM simulation and reflected the practicability of GAN from the side through the simulation results of different classifiers, and achieved certain results. Hua et al. [21] proposed a GAN-based fault diagnosis method for imbalanced data. At the same time, new GAN networks such as WGAN [22], CGAN, and BiGAN have also been proposed successively, making the application of GAN in the field of fault diagnosis a reality. However, some of these studies are limited to the structural innovation of GAN to further improve the performance of GAN and are not applied to actual working conditions; although, some are applied to actual working conditions, the demonstration focuses on the results of fault diagnosis, not the accuracy of GAN itself. Properties and application value. Further applications of GANs in the field of fault diagnosis and the advancement of GANs compared to other generative models remain to be studied.

This paper mainly discusses the bearing data game generation method based on GAN, which effectively solves the problems of fault diagnosis methods caused by insufficient data. The accuracy and application value of the data generated by GAN are fully demonstrated through the values of mean, root mean square, skewness, and other values of the rolling bearing in three different periods and a large number of image comparisons. Compared with other data simulations, GAN has great advantages. It can realize strong nonlinear data fitting function, prevent human interference, and automatically extract the fault characteristics through the computer. The data model based on GAN have higher reliability and can also be used under more extensive working conditions. Facing the development of intelligence and data informatization, it is a trend to combine artificial intelligence and deep learning with rolling bearing fault diagnosis. This combination can promote an intelligent diagnosis model and improve the accuracy and efficiency.

In the field of rolling bearing failure research, compared with the traditional thinking “diagnosis + processing” thinking mode [23], the modern failure prognosis theory can not only prevent the occurrence of failures more effectively and reduce the consequences of accidents but can also effectively reduce maintenance costs. Most of the modern research studies on rolling bearing faults are single-point fault signal modeling [24]. Most of this method is by analyzing the vibration characteristics of a single point of rolling bearings, collecting vibration data obtained in a short period of time, and eliminating the influence of other failure modes through assumptions. This research method can analyze the single failure mode of rolling bearing in a relatively targeted manner [25], but it is difficult to analyze the multipoint compound fault and the coupling caused by it under the actual working conditions [26].

At present, many scholars have advanced and mature research methods in fault diagnosis. Zhang et al. [27] used deep adversarial learning to realize automatic identification of unknown failure modes, which has an extremely high practical application value under the condition of strong data uncertainty; Wen et al. [28] used TCNN (ResNet-50) for fault diagnosis and applied it to three different datasets, all of which have a prediction accuracy of 99%; Jiao et al. [29] used the residual joint adaptation adversarial network (RJAAN) for fault diagnosis, which can learn category discrimination and domain-invariant feature information for cross-domain fault diagnosis, which has strong robustness and superiority; Zhang et al. [30] used federated learning to solve the data island problem in fault diagnosis while ensuring the privacy of different clients, which provided the possibility for further application of confidential decentralized learning in fault diagnosis. Most of these excellent fault diagnosis methods require a large amount of rolling bearing fault data to support, and the lack of this type of data also greatly limits the application value of data-driven algorithms in actual working conditions. The acquisition of bearing fault diagnosis data is roughly divided into two types: (1) Based on real data collected by the instrument in a short time [31]. (2) Based on simulation data generated by deep learning or neural network. The first method often has special assumptions, such as approximating the vibration signal to smooth signal [32] and simplifying the failure mode of the rolling bearing to one. These assumptions often do not match the actual operating conditions. It is difficult to have higher accuracy. By the second method, it is difficult to guarantee the accuracy of the data, and the large amount of simulation data generated has a low reference value. Although there have been many research studies on rolling bearing fault diagnosis through machine learning algorithms, researchers such as Gunerkar et al. [33] have proposed an artificial neural network that uses wavelet transform as a noise reduction tool and extracts sensitive time-series parameters. Another example is convolutional neural network (CNN), convolutional sparse combination learning (CSCL), and other deep learning methods [34]. However, in the process of data input, the variables under various working conditions must be controlled artificially. Moreover, it is difficult to guarantee the application ability of the generated pseudoreal data under actual working conditions. Based on the generative adversarial network, this paper proposes a rolling bearing simulation data generation platform, which can effectively utilize the characteristics of GAN that can generate large amounts of data and generate high accuracy.

In 2014, Goodfellow proposed a new generative network model [35]. This new network model is very different from the traditional generative network. In the structure of the entire network, it not only contains a generative network but also has a discriminant network [36]. There is an antagonistic relationship between these two parts, and the idea of this antagonistic relationship is mainly derived from the game theory. In this theory, both parties are required to be equal in the game and then change their strategies according to the opponent’s strategy changes, so as to achieve the goal of winning in the game. To extend and integrate this theory into the confrontation network needs the generator and the discriminator to be the two sides of the game in the game. The generator can fit the data generation to generate model samples. The optimization goal is to be able to finally achieve Nash equilibrium and finally achieve the generator’s estimation and prediction of the overall distribution of sample data.

The generative model of the GAN can be used to model the distribution of real data and generate simulation data [37]. Compared with the real data, these generated data have extremely high similarity. Therefore, this model can be applied well in unsupervised learning, semi-supervised learning, and multi-learning.

Compared with the traditional rolling bearing fault diagnosis method, GAN abandons the influence of human subjective factors on the results and avoids the problem of poor applicability caused by fault data only applicable to specific fault types. The large amount of data it generates can help the fault diagnosis results based on data-driven algorithms to be closer to the actual working conditions. Also, different from traditional neural networks and deep learning methods [38], the adversarial relationship between the discriminator and the generator eliminates the need for variational lower bounds or approximate inference during data generation; it also avoids the calculation of the partition function caused by the repeated application of the Markov chain learning mechanism. As a new generation of generative models, the GAN solves the problems of low efficiency and inaccurate generated data of traditional generative models. It also has the idea of discriminative models and uses the confrontation between the generator and the discriminator to greatly improve the accuracy and speed of the generated data. At the same time, the limitation of the generation sample dimension and loss function of the traditional generative model is also solved in this new generative model, which also makes the GAN have a very high degree of model design freedom and greatly enhances the possibility of its practical application. A large number of innovations and application methods have emerged in just six years after GAN was available [39–41], which also made its application in the industrial field possible.

Combined with fault diagnosis theory, the application of GAN can effectively solve the problem of insufficient life cycle data that limits fault diagnosis. Inspired by this, we choose rolling bearings that are widely used in industry and has serious failure consequences for research, and use the GAN method to game generate rolling bearing pseudoreal data. By analyzing the time-domain characteristics of the rolling bearing life cycle monitoring data under actual working conditions, the operating data are divided into three periods to construct and train the generative adversarial network model. The adversarial generated data are compared with the real data in the time domain and frequency domain, respectively, to verify the similarity between the generated data and the real data. The actual bearing life cycle data are from the University of Cincinnati. The GAN model is used to generate the rolling bearing data and then verify the similarity of generated data and real data in the time domain and frequency domain.

3. Generative Adversarial Network Method

3.1. Generative Adversarial Network

The generation method is an important branch of the machine learning method. It involves the learning of distribution assumptions and distribution parameters of explicit or implicit variables of the data, and sampling new samples based on the learned model. The principle of the generative model is shown in Figure 1; each point represents an image, sampled from the real data distribution , and the real data distribution area represents real image data. A Gaussian noise distribution is input into the generation model, so that the output of the generation model is as close as possible to the real data distribution so as to accurately approximate the real data. For the choice of objective function, traditional generative models often use the maximum likelihood function as the objective function. However, the GAN introduces a discriminant model in addition to the generative model and achieves the purpose of optimization by adversarial training of the generative model and the discriminant model.

Under the guidance of the binary zero-sum game, the framework of the GAN also contains a pair of opposite models, namely, discriminator and generator. The discriminator is mainly used to make reasonable and correct judgments and distinctions between real data and the generated data by the generator and improve the accuracy of network identification data. The role of the generator is to ensure that the generated data are as close to the actual distribution as possible within a limited range. Therefore, in order to win in the game, both sides need to continuously improve their discrimination and generation ability, so as to achieve the goal of optimizing the entire power generation network, and finally find the Nash equilibrium between the two. The specific GAN framework is shown in Figure 2. It can be seen that the input of the generator is a random noise vector from the public probability distribution. The output is pseudoreal data generated by the computer. The input of the discriminator is the picture x, which can be sampled. The output of the discriminator is scalar, which is used to represent the probability that x is real data. In other words, when the discriminator considers x to be real data, its output is 1, otherwise, it is 0. The discriminator and generator are optimized repeatedly. When the discriminator cannot accurately distinguish the data source, it can be considered that the generator has learned the distribution of real data samples.

3.2. Objective Function

The objective function means that the generator and discriminator in the GAN can be any differentiable function, which needs to be optimized by the stochastic gradient descent method SGD. The first condition of using the SGD method is to establish an objective function that can judge and supervise the learning effect. When the generator G is given, the optimization of the discriminator D is the same as the training effect of the conventional binary classifier, so the objective function can be expressed by the cross entropy, which is as follows:

Among them, G and D represent the differentiable functions of the generator and the discriminator, respectively, x is the real data sample, z is the random noise vector, and is the generated data of the generator. From the perspective of the classifier, the first term of (1) means that D is labeled as 1 for real data x, and the second term is that D is used for generator G to map noise z into generator-generated data, and is labeled as 0. The objective function (1) obtains the optimal solution at:

It can be seen from (2) that GAN estimates the ratio of the distribution density of the two concepts, rather than based on the Markov chains or approximating the lower bound of variation. This is the key difference between GAN and other generative models.

From another perspective, the purpose of the discriminator is to correctly distinguish between real data x and generated data , that is, when the input is real data x, the output probability value should be as close to 1 as possible. When the input is to generate data tend to 0. Combining these two aspects, the formal expression of the objective function of the discriminator is as follows:

Since G and D play a binary zero-sum game, the objective function of generator G is . Therefore, the optimization problem of GAN can be described as the following minimax game problem:

Due to the lack of sufficient training in the initial training stage, the data generated by G are not realistic enough, so D can easily distinguish the generated data from the real data, resulting in insufficient gradient for G. Therefore, training G by maximizing rather than minimizing is a better strategy.

3.3. Objective Function Optimization

3.3.1. Optimal Discriminator

In a continuous space, the mathematical expectation of the objective function (3) can be expanded into the following integral form:

For any nonzero real numbers a and b, and , the expression is as follows:which takes the maximum value at . Therefore, given generator G, (5) takes its maximum value at

This is the optimal solution of the discriminator D. In the actual situation, because the prior is not known, the optimal discriminator cannot be obtained by (7). The function of (7) is to prove the existence of the optimal generator G. is the value to be approximated by training the discriminator in actual training.

3.3.2. Optimal Generator

The target value of the generator is such that . At this time, the discriminator

That is, it is difficult for the discriminator to distinguish between the generated data and the real data. If and only if , G is the global optimal solution of the binary zero-sum game.

3.4. Training GAN

Given a generator G, D can be obtained by . According to (3), the mathematical expectation and must be obtained. However, the above two expectations cannot be obtained through integration in practice. Therefore, the method of sampling from real data and generated data is used to approximate these two expectations. That is, we take m samples from and take m samples from .

Each round of the parameter update process is shown in Figure 3:

The challenge of how to balance generator G and discriminator D is a very important issue. In actual training, in the same round of parameter update, the parameters of each pair of discriminator D are updated k times, and the parameters of generator G are updated once. Otherwise, generator G will easily collapse to the saddle point.

Figure 4 depicts this process intuitively. The equidistant horizontal lines indicate that the data in the sampling domain are uniformly distributed. The arrow pointing from z to x indicates that the random noise vector z is mapped to generated data by . In Figure 4(a), the probability distribution of the generator (green line) has some differences between the probability distribution (black line) of the real data, discriminator output value (blue line) is higher on the left and lower on the right, which means that the discriminator can still accurately distinguish true and false data in the current state ( tend to 1 and tend to 0). As the number of training increases, Figures 4(b) and 4(c) show the process of the generated distribution gradually approaching the true distribution: In Figure 4(b), fix G and train D to converge to the optimal solution ; 4(c) Fix D. After G is updated, the gradient of D moves G to the area where D will make a wrong judgment. If both G and D have very strong learning ability, they will eventually reach , that is, the generated distribution is completely consistent with the real distribution, as shown in Figure 4(d). At this time, the state of D cannot distinguish between real data and generated data, that is , the Nash equilibrium is reached.

4. Case Study Based on GAN

4.1. Rolling Bearing Data Description

First, we process the actual rolling bearing data, build a complete GAN network and train, compare the generated data of the normal state, moderate degradation state, and rapid failure state of the bearing with the real data, and verify the similarity of the generated data to prove the feasibility of the method.

This paper uses bearing data from the University of Cincinnati, and the test bench is shown in Figure 5. The bearing life test bench of the University of Cincinnati is to install four bearings on the shaft, drive the main shaft connected to the shaft through the transmission of the belt, and keep the shaft speed constant at 2000 rpm. Apply a radial load of 6000 pounds to the shaft and bearings through a spring mechanism. At the same time, all bearings are forced to be lubricated.

The experimental bearing model is Rexnord ZA-2115 double row bearing. The structural parameters of the bearing are shown in Table 1. As shown in Figure 5 above, a PCB353B33 high-sensitivity quartz ICP accelerometer is mounted on the bearing box. Each bearing is equipped with two horizontal ( direction) and vertical ( direction) accelerometers. The vibration signals collected by the sensors constitute data set 1, 2 and 3. All failures occurred after 100 million revolutions in the design life, that is, all failures occurred after the design life was exceeded. The running time of the collected data was from 10:32:39 on February 12, 2004 to 06:22:39 on February 19, 2004, and finally stopped due to a failure of the outer ring of the bearing 1.

According to the bearing structure parameters shown in Table 1 and the calculation formula of the fault characteristic frequency given in Chapter 3, the fault characteristic frequency of the test bearing can be obtained, as shown in Table 2.

Considering the training time and other factors, the data period of data set 2 is relatively short, so this article uses data set 2 as the learning data of GAN. This set of data has a total of 984 files, and each file has 20480 data. The specific description of the data is as follows as shown in Table 3. The 4 acquisition channels correspond to A, B, C, and D, where A corresponds to the information collected by bearing 1, and so on, corresponding to the collected information of 4 bearings. This article uses three stages of data as learning data, the 1 to 702 files (normal bearing state), the 703 to 968 files (moderate bearing degradation), and the 969 to 984 files (fast bearing failure). Partial bearing data are shown in Figure 6.

According to the time-domain characteristic parameter expressions in Chapter 3, the change curves of the four time-domain characteristic indexes of the bearing life cycle kurtosis value, root mean square value, form factor, and pulse factor are calculated, as shown in Figure 7.

(a)

(b)

(c)

(d)

By observing the change curve of the four time-domain characteristic indicators shown in Figure 7 during the full life cycle of the bearing, we can know the following:

The amplitude shown in Figures 7(a)–7(c) shows a certain jump near 703 documents (about 117 h), and there are long-term fluctuations, indicating that the azimuth time at this position has shown “moderate degradation.” Since then, the trend of the RMS value is consistent with the trend of the failure development, and the kurtosis value fluctuates greatly in the later stage of the fault development, which cannot accurately reflect the changing law of the bearing health status.

Figures 7(a)–7(d). After 969 documents (about 161 h), the amplitude suddenly soars, which indicates that the bearing has been severely degraded and is in a state of “rapid failure.”

In summary, the training data are divided into 3 parts to prepare for the following training. The first part (files 1 to 702) is the data under normal bearing conditions and the second part (files 703 to 969) is the data of the moderately degraded state of the bearing. The third part (files 969 to 984) is the rapid failure state of the bearing, that is, the failure state.

4.2. The GAN Network Construction

4.2.1. Network Structure

A complete GAN is composed of two parts, one is the generative model and the other is the discriminative model. The generator model has 4 layers including an input layer, two hidden layers, and an output layer. The discriminant model is a three-layer neural network, including an input layer, a hidden layer, and an output layer. The two neural networks are connected by the last layer of the generator and the first layer of the discriminator to form a generative adversarial network, as shown in Figure 8.

4.2.2. GAN Network Model Input and Output

In combination with the test requirements, the experimental data in this article are the full life data of Cincinnati rolling bearings. A single sample is taken for a period of time, and a single sample is a 1 × 20480 vector. At the same time, the output result of the generator is also a 1 × 20480 vector.

In the fault diagnosis of the bearing, the fault condition of the state is mapped from the real state. Therefore, from the perspective of mechanism analysis and experimental results, the input layer of the generated network is a 1 × 128 noise signal, the hidden layer 1 and the hidden layer 2 have 128 neurons, and the output layer has 20480 neurons. The input layer is fully connected with the hidden layer and the output layer.

The input layer of the discriminant network is the output layer of the generating network, that is, 20480 neurons, the hidden layer has 128 neurons, and the output layer has only one neuron. The input layer is fully connected with the hidden layer and the output layer.

4.2.3. Model Hyperparameters

The hyperparameters in the GAN model mainly include the number of iterations of model structure parameters, etc. The values of these parameters will directly affect the final generation result.(1)Structural parameters The generator uses a 4-layer neural network, and the discriminator uses a 3-layer network structure. In the original GAN, the image signal is mapped from the noise, but in the fault diagnosis, the image signal is mapped from a real normal working condition. The fault signal under the same working condition. Therefore, considering the intuitive mechanism and experimental results, the generator uses a 4-layer network. At the same time, the number of neurons in the input layer and the hidden layer does not increase layer by layer, that is, the number of neurons in each layer is the same, and the number of neurons in each layer is 128. The discriminant network adopts a three-layer structure. The first layer coincides with the last layer of the generator. The second reduces the number of neurons to 128. The output layer of the last layer is only one neuron because it only needs the probability of outputting data.(2)Transfer function uses the rule function, and at the same time, to match the transfer function, the learning rate of the generating network and the discriminant network is a small 0.0001. The last layer of the discriminant network only needs to output a probability of 0–1, so the sigma function is adopted.

4.3. The Bearing Game Generated Data Training Based on GAN

In this section, we program the GAN through Python, the simulation data are generated and compared with the real data under the normal state, the moderately degraded state, and the rapid failure state of the bearing.

4.3.1. GAN-Based Training under the Normal State

This training uses the normal bearing state data in the 1st file to the 702nd file and compares them with the real data to verify the feasibility of the generated data. The analysis in this section is the comparison between the generated data and the real data in the same time period.(1)Probability distributions When setting the training termination condition, we gradually expand the number of training rounds from small to large until the probability distribution is close to the real data. The training process is shown in Figures 9, 9(a)∼9(f) are, respectively, selected as the comparison results of the probability distribution of 1000, 3000, 6000, 10000, 13600, and 15000 rounds. After experiments, it is found that under the above parameters and experimental data, the number of rounds can achieve better results at about 13,600 times. At this time, the discriminant score of the discriminant network is 0.51. In the process of training the normal data, the loss function of the generating network and the discriminating network change as shown in Figure 10. It can be seen that the generative network and the discriminant network are fighting against each other, and neither side wants the other to win.(2)Time-domain characteristics Through the comparison of the 6 indicators of mean, root mean square, skewness, kurtosis, form factor, maximum, and minimum in Table 4, it is found that the real data and the generated data under normal bearing conditions are very similar. The maximum relative deviation of the mean is 6.2%, and the minimum relative deviation is 0.(3)Time-domain distribution It can be seen from the time-domain waveform diagram in Figure 11 that the generated data and the real data are very similar at different times, but they are not the same data.(4)Frequency domain distribution The generated data and the real data are subjected to the Hilbert transform and Fourier transform to obtain the frequency domain waveform diagram, as shown in Figure 12. It can be seen that the frequency domain waveforms of the real data and the generated data are very similar. In summary, the generated data under normal conditions can be used as experimental research data.

4.3.2. Training under the Moderate Degradation State

This training uses the moderately degraded bearing state data in 703rd file to the 968th file and compares them with the real data to verify the feasibility of the generated data. The analysis in this section is the comparison between the generated data and the real data in the same time period.(1)Probability distributions When setting the training termination condition, we gradually expand the number of training rounds from small to large until the probability distribution is close to the real data. The training process is shown in Figures 13, 13(a)∼13(f) are, respectively, selected as the comparison results of the probability distribution of 600, 1000, 2000, 3000, 4200, and 5000 rounds. After experiments, it is found that under the above parameters and experimental data, the number of rounds can achieve good results at about 4200 times. At this time, the discriminant score of the discriminant network is 0.51. In the process of training the bearing’s moderately degraded data, the loss function of the generating network and the discriminating network change as shown in Figure 14. It can be seen that the generating network and the discriminating network are fighting against each other, and neither party wants the other to win.(2)Time-domain characteristics Through the comparison of the 6 indicators of mean, root mean square, skewness, kurtosis, form factor, maximum, and minimum in Table 5, it is found that the real data and the generated data under the condition of moderate bearing degradation are very similar, and the maximum relative deviation of the maximum value is 6.2%, the minimum relative deviation of the form factor is 1.1%.(3)Time-domain distribution It can be seen from the time-domain waveform diagram in Figure 15 that the generated data and the real data are very similar at different moments, but they are not the same data.(4)Frequency domain distribution The generated data and the real data are subjected to the Hilbert transform and Fourier transform to obtain the frequency domain waveform diagram, as shown in Figure 16. It can be seen that the frequency domain waveforms of the real data and the generated data are very similar. In summary, the generated data under moderate degradation conditions can be used as experimental research data.

4.3.3. Training under the Rapid Failure State

This training uses the rapid failure state data for training in 969th file to the 984th file and compares them with the real data to verify the feasibility of the generated data. During training, it is found that the discriminant network can always distinguish the true and false of the data, so the generator is strengthened, and the structure of one input layer, four hidden layers and one output layer is used for training. The analysis in this section is the comparison between the generated data and the real data in the same time period.(1)Probability distributions When setting the training termination condition, gradually expand the number of training rounds from small to large until the probability distribution is close to the real data. The training process is shown in Figure 17. Figures 17(a)∼17(f) are selected as the comparison results of the probability distribution of 1600, 3000, 5000, 7000, 8800, and 10000 rounds, respectively. After experiments, it is found that under the above parameters and experimental data, the number of rounds can achieve good results at about 8800 times. At this time, the discriminant score of the discriminant network is 0.54. In the process of training the bearing’s rapid failure data, the loss functions of the generating network and the discriminating network change as shown in Figure 18. It can be seen that the generating network and the discriminating network are fighting against each other, and neither party wants the other to win. When the confrontation reached between 4500 and 7800 times, the two sides reached a balance, and then confronted again.(2)Time-domain characteristics Through the comparison of 6 indicators of mean, root mean square, skewness, kurtosis, form factor, maximum, and minimum in Table 6, it is found that the real data and the generated data under the rapid bearing failure state are very similar, and the relative deviation of the form factor is maximum 0.1%, the minimum relative deviation of the mean is 17%.(3)Time-domain distribution It can be seen from the time-domain waveform diagram in Figure 19 that the generated data and the real data are very similar at different times, but they are not the same data.(4)Frequency domain distribution The generated data and the real data are subjected to the Hilbert transform and Fourier transform to obtain the frequency domain waveform diagram, as shown in Figure 20. It can be seen that the frequency domain waveforms of the real data and the generated data are very similar. In summary, the generated data under rapid failure conditions can be used as experimental research data.

By training the bearing data in the three states and comparing it with the real data, it is found that the generated data and the real data are very similar to the real data in both the time domain and the frequency domain and can replace the real data for experimental research. When generating the bearing operation data in the normal state, the number of training times is 13600 times, the number of training times in the moderately degraded state is at least 4200 times, and the number of the rapid failure state (fault state) is 8800 times. At the same time, the generation network structure of the normal state and the moderately degraded state is 4 layers, and the number of generation network layers in the fast failure state reaches 6 layers.

5. Conclusion

This paper proposes a new way to generate bearing data based on the generative adversarial network method in order to solve fault diagnosis problems with insufficient data. Through the time-domain characteristic analysis of the rolling bearings life cycle monitoring data in actual operation, the actual bearing data are divided into three periods: normal state, moderate degradation state, and rapid failure state. The GAN-based generated network model is then studied for the construction and training. Comparing the generated data with the real data in the time domain and the frequency domain, respectively, and verifying the similarity between the generated data and the real data, the game generated data method shows its effectiveness and that it can provide new enlightenment for solving the insufficient data problem in the fault diagnosis field. At last, the pseudoreal data game generated by GAN model is verified highly similar to the real data and showed the possibility of its application in the fault diagnosis field.

In future work, the universality of the new data set will be further verified to ensure that various data sets can be successfully applied. Moreover, the more fault diagnosis model could be built using multi-category generating bearing data to verify the diagnosis effect.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was funded by Shenyang University of Aeronautics and Astronautics, project number (LJKZ0167), project leader Lin Huo.

References

V. Purushotham, S. Narayanan, and S. A. Prasad, “Multi-fault diagnosis of rolling bearing elements using wavelet analysis and hidden Markov model based fault recognition,” NDT & E International, vol. 38, no. 8, pp. 654–664, 2005.
View at: Publisher Site | Google Scholar
C. Junsheng, Y. Dejie, and Y. Yu, “Application of an impulse response wavelet to fault diagnosis of rolling bearings,” Mechanical Systems and Signal Processing, vol. 21, no. 2, pp. 920–929, 2007.
View at: Publisher Site | Google Scholar
H. Y. Yang, J. Mathew, and L. Ma, “Fault diagnosis of rolling element bearings using basis pursuit,” Mechanical Systems and Signal Processing, vol. 19, no. 2, pp. 341–356, 2005.
View at: Publisher Site | Google Scholar
K. L. Mao and Y. Wu, “fault diagnosis of rolling element bearing based on vibration frequency analysis,” in Proceedings of the Third International Conference on Measuring Technology and Mechatronics Automation, pp. 198–201, IEEE, Shanghai, China, January 2011.
View at: Google Scholar
X. S. Lou and K. A. Loparo, “Bearing fault diagnosis based on wavelet transform and fuzzy inference,” Mechanical Systems and Signal Processing, vol. 18, no. 5, pp. 1077–1095, 2004.
View at: Publisher Site | Google Scholar
J. D. Zheng, J. Cheng, and Y. Yang, “A rolling bearing fault diagnosis approach based on LCD and fuzzy entropy,” Mechanism and Machine Theory, vol. 70, pp. 441–453, 2013.
View at: Publisher Site | Google Scholar
F. Y. Cong, J. Chen, G. Dong, and F. Zhao, “Short-time matrix series based singular value decomposition for rolling bearing fault diagnosis,” Mechanical Systems and Signal Processing, vol. 34, no. 1-2, pp. 218–230, 2013.
View at: Publisher Site | Google Scholar
Z. P. Feng, X. Chen, and T. Wang, “Time-varying demodulation analysis for rolling bearing fault diagnosis under variable speed conditions,” Journal of Sound and Vibration, vol. 400, pp. 71–85, 2017.
View at: Publisher Site | Google Scholar
L. Ou, D. Yu, and H. Yang, “A new rolling bearing fault diagnosis method based on GFT impulse component extraction,” Mechanical Systems and Signal Processing, vol. 81, pp. 162–182, 2016.
View at: Publisher Site | Google Scholar
W. Zhang, G. Peng, C. Li, Y. Chen, and Z. Zhang, “A new deep learning model for fault diagnosis with good anti-noise and domain adaptation ability on raw vibration signals,” Sensors, vol. 17, no. 2, p. 425, 2017.
View at: Publisher Site | Google Scholar
C. Zhang, L. Xu, X. Li, and H. Wang, “A method of fault diagnosis for rotary equipment based on deep learning,” in Proceedings of the 2018 Prognostics and System Health Management Conference (PHM-Chongqing), pp. 958–962, IEEE, Chongqing, China, October 2018.
View at: Publisher Site | Google Scholar
Z. Y. Chen, A. Mauricio, W. Li, and K. Gryllias, “A deep learning method for bearing fault diagnosis based on Cyclic Spectral Coherence and Convolutional Neural Networks,” Mechanical Systems and Signal Processing, vol. 140, Article ID 106683, 2020.
View at: Publisher Site | Google Scholar
W. Fuan, J. Hongkai, S. Haidong, D. Wenjing, and W. Shuaipeng, “An adaptive deep convolutional neural network for rolling bearing fault diagnosis,” Measurement Science and Technology, vol. 28, no. 9, Article ID 095005, 2017.
View at: Publisher Site | Google Scholar
L. Guo, J. Chen, and X. Li, “Rolling bearing fault classification based on envelope spectrum and support vector machine,” Journal of Vibration and Control, vol. 15, no. 9, pp. 1349–1363, 2009.
View at: Publisher Site | Google Scholar
H. W. Shao, H. Jiang, X. Zhang, and M. Niu, “Rolling bearing fault diagnosis using an optimization deep belief network,” Measurement Science and Technology, vol. 26, no. 11, Article ID 115002, 2015.
View at: Publisher Site | Google Scholar
Y. Qin, X. Wang, and J. Zou, “The optimized deep belief networks with improved logistic sigmoid units and their application in fault diagnosis for planetary gearboxes of wind turbines,” IEEE Transactions on Industrial Electronics, vol. 66, no. 5, pp. 3814–3824, 2019.
View at: Publisher Site | Google Scholar
A. Creswell, T. White, V. Dumoulin, K. Arulkumaran, B. Sengupta, and A. A. Bharath, “Generative adversarial networks: an overview,” IEEE Signal Processing Magazine, vol. 35, no. 1, pp. 53–65, 2018.
View at: Publisher Site | Google Scholar
J. Antoni, “The spectral kurtosis: a useful tool for characterising non-stationary signals,” Mechanical Systems and Signal Processing, vol. 20, no. 2, pp. 282–307, 2006.
View at: Publisher Site | Google Scholar
D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp. 91–110, 2004.
View at: Publisher Site | Google Scholar
Y. Gao, X. Liu, and J. Xiang, “FEM simulation-based generative adversarial networks to detect bearing faults,” IEEE Transactions on Industrial Informatics, vol. 16, no. 7, pp. 4961–4971, 2020.
View at: Publisher Site | Google Scholar
F. Hua, “Rolling bearing anomaly detection based on generative adversarial networks,” Artificial Intelligence and Robotics Research, vol. 08, no. 04, pp. 208–218, 2019, in Chinese.
View at: Publisher Site | Google Scholar
M. Arjovsky, S. Chintala, and L. Bottou, “Wasserstein GAN,” 2017, https://arxiv.org/abs/1701.07875.
View at: Google Scholar
W.-N. Hsu, Y. Zhang, and J. Glass, “Unsupervised learning of disentangled and interpretable representations from sequential data,” Advances in Neural Information Processing Systems, pp. 1878–1889, 2017.
View at: Publisher Site | Google Scholar
T. Barszcz and A. JabŁoński, “A novel method for the optimal band selection for vibration signal demodulation and comparison with the Kurtogram,” Mechanical Systems and Signal Processing, vol. 25, no. 1, pp. 431–451, 2011.
View at: Publisher Site | Google Scholar
N. G. Nikolaou and I. A. Antoniadis, “Demodulation of vibration signals generated by defects in rolling element bearings using complex shifted Morlet wavelets,” Mechanical Systems and Signal Processing, vol. 16, no. 4, pp. 677–694, 2002.
View at: Publisher Site | Google Scholar
G. L. McDonald, Q. Zhao, and M. J. Zuo, “Maximum correlated Kurtosis deconvolution and application on gear tooth chip fault detection,” Mechanical Systems and Signal Processing, vol. 33, pp. 237–255, 2012.
View at: Publisher Site | Google Scholar
W. Zhang, X. Li, H. Ma, Z. Luo, and X. Li, “Universal domain adaptation in fault diagnostics with hybrid weighted deep adversarial learning,” IEEE Transactions on Industrial Informatics, vol. 17, no. 12, pp. 7957–7967, 2021.
View at: Publisher Site | Google Scholar
L. Wen, X. Li, and L. Gao, “A transfer convolutional neural network for fault diagnosis based on ResNet-50,” Neural Computing & Applications, vol. 32, no. 10, pp. 6111–6124, 2019.
View at: Publisher Site | Google Scholar
J. Jiao, M. Zhao, J. Lin, and K. Liang, “Residual joint adaptation adversarial network for intelligent transfer fault diagnosis,” Mechanical Systems and Signal Processing, vol. 145, Article ID 106962, 2020.
View at: Publisher Site | Google Scholar
W. Zhang, X. Li, H. Ma, Z. Luo, and X. Li, “Federated learning for machinery fault diagnosis with dynamic validation and self-supervision,” Knowledge-Based Systems, vol. 213, Article ID 106679, 2021.
View at: Publisher Site | Google Scholar
H. Shao, H. Jiang, F. Wang, and Y. Wang, “Rolling bearing fault diagnosis using adaptive deep belief network with dual-tree complex wavelet packet,” ISA Transactions, vol. 69, no. 69, pp. 187–201, 2017, in Chinese.
View at: Publisher Site | Google Scholar
O. Janssens, V. Slavkovikj, B. Vervisch et al., “Convolutional neural network based fault detection for rotating machinery,” Journal of Sound and Vibration, vol. 377, no. 337, pp. 331–345, 2016.
View at: Publisher Site | Google Scholar
R. S. Gunerkar, A. K. Jalan, and S. U. Belgamwar, “Fault diagnosis of rolling element bearing based on artificial neural network,” Journal of Mechanical Science and Technology, vol. 33, no. 2, pp. 505–511, 2019.
View at: Publisher Site | Google Scholar
D. Peng, Z. Liu, H. Wang, Y. Qin, and L. Jia, “A novel deeper one-dimensional CNN with residual learning for fault diagnosis of wheelset bearings in high-speed trains,” IEEE Access, vol. 7, Article ID 10293, 2019.
View at: Publisher Site | Google Scholar
I. Goodfellow, “NIPS 2016 Tutorial: Generative Adversarial Networks,” 2016, https://arxiv.org/abs/1701.00160.
View at: Google Scholar
K. F. Wang, C. Gou, Y. Duan, Y. Lin, X. Zheng, and F. Y. Wang, “Generative adversarial networks: introduction and outlook,” IEEE/CAA Journal of Automatica Sinica, vol. 4, no. 4, pp. 588–598, 2017.
View at: Publisher Site | Google Scholar
M. N. Fekri, A. M. Ghosh, and K. Grolinger, “Generating energy data for machine learning with recurrent generative adversarial networks,” Energies, vol. 13, no. 1, p. 130, 2019.
View at: Publisher Site | Google Scholar
N. Kriegeskorte and T. Golan, “Neural network models and deep learning,” Current Biology, vol. 29, no. 7, pp. R231–R236, 2019.
View at: Publisher Site | Google Scholar
P.-L. Dallaire-Demers and N. Killoran, “Quantum generative adversarial networks,” Physical Review A, vol. 98, no. 1, Article ID 012324, 2018.
View at: Publisher Site | Google Scholar
D. Saxena and J. Cao, “Generative adversarial networks (GANs),” ACM Computing Surveys, vol. 54, no. 3, pp. 1–42, 2022.
View at: Publisher Site | Google Scholar
S. Suh, H. Lee, P. Lukowicz, and Y. O. Lee, “CEGAN: classification enhancement generative adversarial networks for unraveling data imbalance problems,” Neural Networks, vol. 133, pp. 69–86, 2021.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Lin Huo et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

429

Downloads

571

Citations

Computational Intelligence and Neuroscience

Safe and Fair Machine Learning for Neuroscience

A Generative Adversarial Network Based a Rolling Bearing Data Generation Method Towards Fault Diagnosis

Abstract

1. Introduction

2. Related Work

3. Generative Adversarial Network Method

3.1. Generative Adversarial Network

3.2. Objective Function

3.3. Objective Function Optimization

3.3.1. Optimal Discriminator

3.3.2. Optimal Generator

3.4. Training GAN

4. Case Study Based on GAN

4.1. Rolling Bearing Data Description

4.2. The GAN Network Construction

4.2.1. Network Structure

4.2.2. GAN Network Model Input and Output

4.2.3. Model Hyperparameters

4.3. The Bearing Game Generated Data Training Based on GAN

4.3.1. GAN-Based Training under the Normal State

4.3.2. Training under the Moderate Degradation State

4.3.3. Training under the Rapid Failure State

5. Conclusion

Data Availability

Conflicts of Interest

Acknowledgments

References

Copyright