Abstract

Under normal circumstances, bearings generally run under variable loading conditions. Under such conditions, the vibration signals of the bearing malfunctions are often nonstationary signals, which are difficult to process effectively. In order to accurately and effectively diagnose the failure types and damage degree of bearings under variable load conditions, an intelligent diagnostic model based on the variational mode decomposition (VMD) of quantum chaotic fruit fly optimization algorithm (QCFOA) and a multiclassification variational relevance vector machine (VRVM) is proposed. First, the key parameters of the VMD are selected using the QCFOA. Secondly, the known bearing fault signal is decomposed by the optimized VMD, and the center frequency and marginal spectral entropy (MSE) of each natural modal component are extracted to construct two-dimensional MSE. Then, the probit model is used to replace the logistic model, and a simpler and more practical multiclassification model is constructed. The two-dimensional MSE of each intrinsic modal component is used as a learning sample for VRVM. Finally, the bearing fault data under 1 hp load are taken as training samples, and the bearing fault data under two loads of 0 hp and 3 hp are used as test samples to verify the effectiveness of the intelligent diagnosis model. The experimental results show that the intelligent fault diagnosis method proposed in this paper can accurately diagnose the type of fault and the degree of damage and has high robustness.

1. Introduction

Rolling bearings, which are the core components of rotating machinery, operate under harsh conditions and are easily damaged. It is necessary to monitor the fault. Once a fault occurs, it may cause huge economic losses and personal safety problems. In recent years, with the continuous update of knowledge in the field of digital signal processing and machine learning, intelligent fault diagnosis technology has become a major development trend [1]. Intelligent diagnosis is essentially a pattern recognition process, including two important steps of fault feature extraction and fault identification. In particular, how to effectively extract weak fault characteristics in the fault signal is key to fault diagnosis.

The vibration signal analysis is widely used in the fault diagnosis of bearing. In general, the vibration signals of the rolling bearing fault are mostly nonstationary signals, and the method suitable for processing nonstationary signals should be adopted. Empirical mode decomposition (EMD) [2] as a powerful tool of nonstationary signal processing has received extensive attention from researchers concerned with mechanical fault diagnosis. The method of bearing fault feature extraction based on EMD has been widely applied [35]. Inspired by the EMD method, Smith et al in [6] proposed another adaptive signal decomposition method named the “local mean decomposition (LMD)” in 2005, which has aroused the attention of a large number of scholars [7]. The EMD and LMD are excellent self-adaptive processing methods for nonstationary and nonlinear signals. However, they have unavoidable deficiencies, such as the end effect, overshoot, and mode mixing. Recently, many scholars proposed optimization and improvement methods for such deficiencies. However, both of them were regarded as recursive mode decomposition methods, and inherent defects are difficult to solve fundamentally. Thus, a new adaptive signal processing method named the “variation mode decomposition (VMD)” was proposed by Dragomiretskiy and Zosso [8]. The method abandons the recursive decomposition mode and effectively alleviates or avoids a series of deficiencies in the EMD and LMD methods.

The excellent features presented by VMD have been used by scholars in the field of fault diagnosis [9]. However, an important feature of VMD is that the number of intrinsic modal components and their penalty parameters must be set in advance; once unsuitable parameter values are selected, the decomposition results will be seriously affected. Therefore, against the selection of key parameter values of VMD, some scholars introduced the particle swarm optimization algorithm to select the key parameters of VMD optimally and achieved certain results [10]. However, the traditional heuristic optimization algorithms, such as particle swarm optimization (PSO) [11], genetic algorithm (GA) [12], ant colony optimization (ACO) [13], and cuckoo algorithm (CA) [14], have some problems such as parameter dependence, computational complexity, convergence speed, and optimization accuracy, which restrict their practical applications. To solve this problem, this paper introduces the fruit fly optimization algorithm (FOA) [15] to optimize the key parameters of VMD. This algorithm is a new swarm intelligence algorithm based on the bionics principle of fruit fly foraging behavior. It is applied in many fields [1618]. However, its convergence accuracy is very sensitive to the initial value. Once the initial value is not selected properly, the search is likely to fall into a local optimum, and the convergence accuracy is low. This paper combines the quantum logistic chaotic mapping [19] and FOA and propose a quantum chaotic fruit fly algorithm in the three-dimensional space search, which strengthens the ergodicity, avoids the search process falling into the local optimal value, and improves the search efficiency. Then, using the local minimum value of the MSE as the fitness function, the quantum chaotic fruit fly optimization algorithm (QCFOA) is used to search two key parameters of VMD and obtain the optimal combination value of the key parameters. Furthermore, the optimized VMD is used to process the known fault signals, and the effective intrinsic mode function (IMF) component and its MSE are obtained.

The traditional diagnosis method usually trains the fault diagnosis model under a certain load and has great limitations in diagnosing the state of equipment under a certain load. In practical applications, many mechanical devices work under variable load conditions. Both the load and the damage degree will affect the amplitude of the fault characteristic frequency of the bearing. Therefore, the single MSE cannot effectively characterize the damage degree of the fault in the variable load condition (the radial load is mainly discussed in this paper). Because the bearing is running under different load conditions, the normal contact load between the rolling body and the raceway will change, which leads to the change in the natural vibration frequency of the bearing. To address these challenges, the center frequency is introduced in this paper, and the one-dimensional MSE is extended into two-dimensional MSE as the learning sample of the variational correlation vector machine. Then, the method is used to identify the various types of faults and the damage degree of the bearing under variable load.

The technology of intelligent diagnosis for mechanical faults is changing fast. Artificial neural network (ANN) and support vector machine (SVM), as intelligent recognizers, have received the most extensive attention, and a multitude of recent research efforts have been made to explore the mechanical fault diagnosis. However, the ANN algorithm requires a large number of training samples and has some inherent problems, such as black box operation, low generalization ability, and overlearning [2024]. Similarly, the SVM also has some unavoidable deficiencies. The method cannot get the probabilistic prediction and the uncertainty in prediction [2529].

RVM is a new machine learning algorithm based on support vector machine (SVM) and Bayesian theory framework [30]. Compared with the SVM method, it can directly give the uncertainty of the result while giving the diagnosis results. The RVM training process needs fewer parameters, and its solution is more sparse [31]. So, the probability output of RVM accords with the actual mechanical fault diagnosis process and has high applied research value. However, when the standard RVM is very limited in the size of the data sample, the computational cost of the training is very high. For this problem, Bishop [32] proposed a method of computing and solving RVM by means of the variational method, named the “VRVM.” In the case of very limited data samples, this method is better than the type II maximum edge likelihood estimation and can give the posterior distribution of parameters and hyperparameters. Compared with the standard RVM, the practicability and performance of the VRVM are better. In addition, VRVM is the same as the standard RVM, and its classification and regression are all mapped by logistic function. Therefore, when the regression problem is converted into a classification problem, the noise variable must be ignored, and the true value of the model cannot be accurately estimated. Therefore, the probit model is used instead of the logistic model in this paper, which makes the classification problem and the regression problem organically combined to avoid the approximate derivation of the logistic model from the continuous output to the discrete output mapping and to reduce the amount of computation.

Finally, the proposed method is used to diagnose the variable load fault data collected from the failure platform to verify the effectiveness and robustness of the method.

2. The Principle of VMD

In the VMD algorithm, the IMF is redefined as an AM-FM signal, which removes the loop iteration method used by the EMD algorithm. Instead, the signal decomposition process is transferred to the variational structure. By constructing and solving the constrained variational problem and decomposing the original signal into a specified number of IMF components, the construction process of the corresponding variational problem is summarized as follows.

For each IMF component , the following analytic signal is obtained through the Hilbert transform:

For each analytic signal, a central frequency is estimated, and the frequency of each analytical signal is transformed to the baseband by shifting frequency:

The Gaussian smoothing index of the frequency shift signal is used to estimate the bandwidth of each IMF component, and then the corresponding constraint variational model is expressed aswhere represents the decomposition of the IMF component, is the number, represents the frequency center of each component, is the input signal, and is the estimated center frequency. In order to obtain the abovementioned constraint variation problem, the augmented Lagrange function is introduced as follows:where is the quadratic penalty parameter and is the Lagrange multiplier. The saddle point of the augmented Lagrange function is obtained by using the alternating direction multiplier algorithm, which is the optimal solution of equation (2) constraint variational model, and the original signal is decomposed into narrowband IMF components. The solution procedure for the variational model is as follows (Algorithm 1).

 Initialize:, , ,
repeat:
   
   for do
   Update for all :
    
   Update :
    
   end for
   Dual ascent for all :
    
until convergence:

It is known from the variational model solution process that the performance of VMD is closely related to the decomposition parameters, such as the total number of modalities K and the secondary penalty .

The performance of VMD is very sensitive to the value of K. If the value of K is too small, the data will be undersegmented and some components will be contained in other modalities, and if the value of K is too large, problems such as modal copying will occur. If the value of is too small, the bandwidth of the modal component will be too large; some components will be included in other modal components, or additional “noise” will be captured; if the value of is too large, the bandwidth of the modal component will be too small and some components in the original signal will be lost.

Therefore, the design of an optimal VMD should be focused on how to obtain the optimal combination value of key parameters of the VMD.

3. VMD Optimization Based on Quantum Chaotic FOA

The VMD algorithm needs to set the number of IMF components in advance when processing signals. Different values will result in different decomposition results. In addition, the penalty parameter also has a great influence on the decomposition result. The smaller the , the larger the bandwidth of each IMF component; conversely, the bandwidth of the component signal is smaller. Therefore, parameters and are two critical parameters that affect VMD performance. In practical application, the components of the bearing fault signal are very complex, and the selection of suitable parameters and is key to the effective extraction of bearing fault characteristics by the VMD algorithm. Meanwhile, and have interactive effects on the performance of VMD. Therefore, a swarm intelligence algorithm which can optimize parameters and is needed to avoid the contingency and blindness of manually setting parameters.

3.1. Quantum Chaotic Mapping

Quantum chaotic system has the natural properties of the classical chaotic system [33]. For the same classical chaotic system, different quantitative criteria can produce different quantum chaotic maps. In the work of Goggin [34], the classical logistic system is quantified by the recoil rotor model, and the corresponding quantum logistic mapping is obtained. The definition is as follows:where is a chaos control parameter, is a dissipative parameter, , , and are the state values of the system, and and are complex conjugation of and , respectively. If the initial value in the chaotic system is real, then the chaotic sequence generated by the system is real, and there are and . In [3537], it is proved that the pseudorandom sequence based on the quantum chaotic mapping not only has all the advantages of the traditional chaotic system but also has a weaker correlation and stronger ergodicity than the traditional chaotic system.

3.2. Quantum Chaotic FOA

The advantage of the FOA is that it is easy to understand, simple to search, and easy to implement. Therefore, it is widely used in parameter optimization problems [38, 39]. In this paper, a quantum logistic chaotic map is used to extend the search space of the FOA into a three-dimensional search space, and the location of the fruit fly group is initialized by the better characteristics of nonperiodicity, ergodicity, and class randomness and more sensitivity to system parameters and initial conditions than traditional chaotic systems. The method improves the diversity of population and strengthens the ergodicity of the search, avoiding the premature convergence of the search process to the local optimum and improving search efficiency. The sketch map of its three-dimensional search space foraging is shown in Figure 1.

The steps of the QCFOA are shown as follows:

Step 1. Initialize the fruit fly swarm location = with random function randomly and parameters of the first iteration, including the maximum number of iterations , number of fruit flies , chaos control parameter , and dissipation parameter .

Step 2. Update the position of each fruit fly by using equation (5):where represents the random variable of uniform distribution, . Substituting equation (5) into of equation (6), we can get the following:where

Step 3. Each component from Step 2 is mapped to the value of odor concentration judgment via equation (7):

Step 4. Calculate odor concentration values based on fitness function:

Step 5. Select maximum odor concentration:

Step 6. Update maximum odor concentration:

Step 7. Check the termination condition: this step compares the current maximum odor concentration to the previous maximum odor concentration. When the current maximum concentration is no longer superior to the previous one, or the current iteration is equal to the maximum number of iterations, the iteration process is terminated. Otherwise, execute Step 2.

3.3. VMD Model Based on Quantum Chaotic FOA
3.3.1. The Fitness Function

The decomposition capability of the VMD is heavily determined by the selected parameters and . For nonstationary and nonlinear signal processing, it is not feasible to search the optimal parameters of artificially. In the study, the optimization algorithm of quantum chaotic fruit fly is used to search the optimal parameters of VMD. But, before the optimization, a fitness function needs to be determined.

Information entropy is a measure of uncertainty in information quality, which represents the average uncertainty of signals. The greater the entropy, the greater the uncertainty of the signal and the more complex the signal. This paper defines the VMD marginal spectral entropy of the signal based on the information entropy, which can represent the uncertainty of the signal in the frequency domain and measure the complexity of signal frequency.

Hilbert transformation for each IMF component of VMD is as follows:

The analytic signal is constructed as follows:and we can obtain the following:where is the number of IMF components, is the instantaneous amplitude, is the instantaneous phase, and represents the instantaneous frequency.

The amplitude and frequency of the Hilbert transform are time-domain functions. The amplitude of the signal can be expressed as a function of time and frequency in the three-dimensional space, and it is the Hilbert amplitude spectrum:

The marginal spectrum of the signal can be obtained by the time integral of :

The variation rule of amplitude with frequency is described by equation (15). The marginal spectrum entropy of the signal is defined as follows:where is the probability of the corresponding amplitude of the ith frequency and is the marginal spectrum of the ith IMF component. In order to facilitate analysis, the marginal spectrum entropy value is normalized: , and the value range is [0,1]. L determines the length of sequence.

3.3.2. Proposed Improved Algorithm Framework

When the bearing is in fault, the vibration signal mainly appears as a periodic impact signal. Therefore, when using the VMD algorithm to deal with bearing fault signals, multiple IMF components will be obtained. According to the definition of marginal spectrum entropy in equation (18), if the IMF component contains more noise components, its corresponding MSE value is larger. Conversely, if the IMF component mainly contains the periodic impact component of bearing failure, the marginal spectrum entropy of VMD is very small. Therefore, the minimum value of the marginal spectrum entropy is expressed as a local minimum, and the corresponding IMF component is taken as the optimal component. The local minimum marginal spectral entropy (LMMSE) of the VMD is shown as follows:

In this study, the LMMSE of the VMD is taken as the fitness value in the optimization process, and the global optimal IMF component is searched. The local minimum value of the marginal spectrum entropy is used as the final optimization target. The optimization process is the same as the FOA, and the process is carried out as follows:

Step 8. Initialize the fruit fly swarm parameters of the first iteration, including the fruit fly group number , number of iterations , and maximum iterations .

Step 9. The parameters in equation (3) are set as = 3.99 and ≥ 6.

Step 10. Generate two large groups of and randomly by using equation (7), and the number of fruit flies in each group is . The positions and of each fruit fly are then continuously updated using equation (7).

Step 11. Calculate according to equation (5), and it is represented by and . Set and , and is represented by and . Get the odor concentration by the fitness value, and obtain the best smell concentration value and the corresponding location .

Step 12. Enter iterative optimization to repeat the implementation of Step 3, .When the current smell concentration is not superior to the previous iterative smell concentration any more or the iteration number reaches the maximum number of iterations , the circulation is terminated, and return optimal combination parameters . Otherwise, carry out Step 3.

3.4. Simulations and Analysis
3.4.1. Simulation Signal

In practice, rolling bearings generate periodic impact signals when pitting or cracking occurs. But in the early stage of failure, the impact signal is very weak and is easily drowned by noise, so it is difficult to find fault characteristic frequency in traditional signal analysis methods. This paper proposes a quantum chaotic fruit fly algorithm to search the optimal parameters of VMD, and the flow chart of algorithm is shown in Figure 2. In order to qualitatively and quantitatively analyze the validity and superiority of this method, the simulation signal of the early fault of the bearing under the simulated strong noise background is analyzed [30], and the expression of the simulation signal is as follows:where is an impact simulation signal with periodic pulse attenuation with a frequency of 12 Hz and maximum amplitude of 0.5 V, is a cosine combined signal with a frequency of 35 Hz and 15 Hz, and is a Gauss white noise signal. The time-domain waveform of the simulation signal is shown in Figure 3.

From the simulation signal of early fault of the bearing in Figure 3, it can be seen that the impact attenuation signal in the early stage of simulated bearing failure is almost submerged by low-frequency components and strong noise due to its small amplitude.

3.4.2. Parameter Optimization Analysis

In order to verify the effectiveness and superiority of the improved chaotic FOA parameter optimization proposed in this paper, the performance is compared with that of classical PSO, QPSO, FOA, and CFOA. In this experiment, the number of fruit flies is set as = 30 and the maximum number of iterations is = 200. Other parameters are selected according to relevant literature to ensure the best results of each algorithm. The combination parameters are optimized by the above method. Figure 4 and Table 1 show that different algorithms have different effects on the solution.

From Table 1, it can be observed that the QCFOA method achieves the minimum number of iterations and the best global optimal solution. The searched VMD parameter combination is [690, 4], and the reconstruction error is 1.02 × 10−5. Figure 4 demonstrates the iterative process using PSO, QPSO, FOA, CFOA, and QCFOA. As shown in Figure 4, the convergence speed of the QCFOA algorithm is the fastest and the reconstruction error of the signal is the smallest.

In the search process, the LMMSE changes with the evolution of the population, and the corresponding VMD optimal parameter combination is shown in Table 1.

From Table 1, it can be observed that the QCFOA method achieves the minimum number of iterations and the best global optimal solution. The searched parameter combination of VMD [] is [690, 4], its reconstruction error is 1.02 × 10−5, and it has the least number of iterations. Figure 4 demonstrates the search process by using PSO, QPSO, FOA, CFOA, and QCFOA. The experimental results indicate that the convergence speed of the QCFOA is faster than others. Therefore, the experimental results indicate that the proposed method is more effective and superior than the mentioned optimization algorithms.

3.4.3. Comparison and Analysis of EEMD, LMD, and VMD Methods

In order to validate the effectiveness and superiority of the VMD method based on the QCFOA, EEMD, LMD, and VMD methods are used, respectively, to process the simulation signals in Figure 3, and the results are shown in Figure 5.

From Figure 5, it can be seen that the EEMD and LMD methods can effectively extract the characteristic frequencies of the low-frequency cosine components in the simulation signals including 25 Hz and 15 Hz but cannot extract the characteristic frequency of the weak impact signal of 12 Hz, whereas there is modal mixing which is an unavoidable deficiency in the decomposition of EEMD and LMD. The low-frequency cosine components of 25 Hz and 15 Hz appear in different components. In addition, the frequency amplitude of the extracted 12 Hz impact signal is also relatively weak. The VMD method based on the QCFOA proposed in this paper not only can effectively extract the characteristic frequency of the low-frequency cosine components of 25 Hz and 15 Hz but also can effectively extract the characteristic frequency of the weak impact signal of 12 Hz and its corresponding doubling frequency. The number of signal components decomposed by the improved VMD method is also significantly less than that obtained by EEMD and LMD. The experimental results show the better effectiveness and superiority of the proposed method.

4. Kernel Parameter Self-Optimization Variational Relevance Vector Machine

4.1. Variational Relevance Vector Machine

Many problems in machine learning belong to the category of supervised learning, giving an input vector and the corresponding target output . For regression problems, they can be arbitrary values. For classification problems, they are class labels. For regression, can be any value, and for classification, is the category label.

For the standard RVM regression model, the formula is defined as follows:where , is the weight parameter of the model, is the kernel function, and an RBF kernel function is selected in this section: , . represents the noise, which obeys normal distribution .

If the input vectors are independent and identically distributed, then the likelihood function of the corresponding target output is defined aswhere the parameter is Gaussian prior distribution and is noise variance:where is a hyperparameter vector, and each weight value is independently assigned a parameter . In order to make parameter learning more flexible, ultra-prior distribution is defined for and noise variance , respectively. The appropriate prior distribution is the gamma distribution:where is the gamma function, and it is usually defined that the hyperparameter is a very small value such as = = = = 10−4; such a hyperparameter before does not provide information for posterior learning, so the posterior depends entirely on the data.

When the model is established, the posterior distribution of , , and can be obtained by the variable Bayesian method [40]. In the process of iterative solution, most of tends to infinity and the corresponding is zero, realizing the sparse model.

RVM classification and regression essentially use the same framework model, except that the conditional distribution of the target value is changed. For the binary classification, the logistic function is used in the continuous latent variable , and assuming that is the Bernoulli distribution, the likelihood function is as follows [41]:

It is important to note that, in the classification problem without considering the noise variable , it is very difficult to directly use the variational method to solve the above model. In [28], a lower bound is introduced by using the inequality:where , , is a variational parameter, and when =, equation (26) is established. Finally, the variational method is used to solve the lower boundary.

The VRVM method is the posterior distribution of RVM model parameters and superparameters by variational Bayesian (VB) function. In the VB method [42] based on the RVM model, the observed variables are and , the hidden variables are , and the parameter is ; therefore, the logarithmic edge likelihood function can be written aswhere is the joint probability distribution function between the hidden variable and parameter. The variational Bayesian algorithm assumes that , , , and are independent of each other; therefore, the joint probability distribution of four variables can be written approximately as

Following the assumption of equation (28), equation (21) can be rewritten as follows:

It can be seen from equation (29) that the log-likelihood function has a lower bound and that the real value can be approximated by maximizing the lower bound . Then, the lower bound of the log-likelihood is solved using EM (expectation-maximization), and the posterior distribution of the hidden variable and all parameters is obtained. The lower bound expression is as follows:

Because the variable distributions of , , , and are all conjugate prior distributions, they have the same distribution form as their posterior distribution, and the following is their probability distribution:where represents truncated normal distribution, and the direction of truncation or is determined by .

According to the theory of the EM algorithm, the posterior distribution of the variables is actual expectation of the logarithm of the complete likelihood function with respect to other variables (indicated by ), and then the terms related to the variable are extracted. The posterior distributions of , , , and are, respectively, solved as follows:

Step 13. Extract the term related to variable in equation (30):Set the derivative of in equation (35) to be 0; hence,Solve equation (8) and get the following equation:

Step 14. Extract the term related to variable in equation (30):Set the derivative of in equation (38) to be 0. Hence,

Step 15. Extract the term related to variable in equation (29):Set the derivative of in equation (40) to be 0. Hence,

Step 16. Extract the term related to variable in equation (29):Set the derivative of in equation (42) to be 0. Hence,Through the process above, the parameter iteration formulas in each variable’s posterior distribution function are obtained, but they are expressed by the expectations of other variables. Therefore, the expectation of these parameters needs to be found [42].
For equation (32), determines the truncation direction of the truncation normal distribution; hence,where is a normal probability density function Moreover,The function is defined as follows:After the model is trained, for a test sample , its probability of prediction can be calculated as follows:where is a normal cumulative integral distribution function. According to the prediction probability, the test sample can be classified and identified. When , the test sample is judged as a classification; otherwise, it is judged as another class.

4.2. Introduction of Probit Model

The binary logistic model of standard RVM is

The logistic function is a mapping from continuous variables to binary output tn. Logistic mapping function is easy to understand, but it is not a standard probability function; there are many difficulties in the process of reasoning. In addition, the traditional RVM classification model introduces a lower bound to the likelihood function, which is an approximate derivation. Therefore, the real value of the model cannot be estimated accurately. To solve this problem, a mapping method from continuous quantity to discrete quantity through the probit model is defined as follows:where the hidden variable is a continuous random variable hidden behind . Based on the requirement of the model, the target value is assumed to be −1 or +1. The probability relationship is as follows:where is an indicator function. By integrating , it can be found thatwhere is a normal cumulative distribution function.

The advantage of the probit model is to transform the problem of binary-classification output into a regression problem by introducing hidden variables. It makes the models of classification and regression completely equivalent, and the noise variable must be ignored by using the logistic model. It can be seen form Figure 6 that the probit model approximates the logistic model very well. Therefore, the reasoning algorithm based on the probit model can be flexibly applied to the classification model directly. In addition, the use of the probit model can also easily extend the binary classification to multiple classifications [43]. Compared with the multivariate logistic model [44], the multivariate probit model can also avoid complex approximate calculations, has more simple and practical characteristics, and can be well approximated to the logistic model.

5. Analysis of Bearing Fault Diagnosis

5.1. Experimental Scheme

In order to further verify the effectiveness of the proposed method for bearing fault diagnosis under unknown variable load conditions, the experiments are conducted to collect vibration signals from bearings operating under variable load conditions. The experimental platform is shown in Figure 7; it uses bearing type 6203-2RS JEM SKF deep groove rolling bearing. The bearing inner diameter is 25 mm, outer diameter is 52 mm, and thickness is 15 mm; the rolling diameter is 8.18 mm, and the pitch diameter of the bearing is 44.2 mm; the sampling frequency is 12 kHz, and the sampling data length is 12000. The fault with different etch diameters (simulating varying degrees of damage to the inner ring, the outer ring, and the rolling body) is processed by electric spark. Bearing loads include 3 types: load 0 (0 N, 1797 r/min, simulated no load), load 1 (800 N, 1772 r/min, simulated light load), and load 2 (1600 N, 1750 r/m, simulated heavy load). The pitting diameter of the bearing is 0.1778 mm (it simulates minor fault), 0.3556 mm (it simulates medium fault), and 0.5334 mm (it simulates serious fault), which is used to simulate three different damage degrees of the bearing.

5.2. Fault Feature Extraction

To further verify the effectiveness of the VMD method based on the quantum chaotic FOA in the early bearing fault feature extraction, this paper divides the fault feature extraction into two cases: the fault characteristics of the minor faults under different loads and the fault characteristics of different damage faults under the same load.

5.2.1. Fault Feature Extraction of Weak Damage Degree under Different Loads

Under three different loads, the measured signals of small bearing failure (pitting diameter 0.1778 mm) and small features of the rolling body fault are used as experimental signals. The fault signals are shown in Figure 8. The parameter of the VMD method optimized by the quantum chaotic FOA is used to search the optimal value, and the search results are shown in Table 2.

The optimal IMF component and its corresponding marginal spectrum and center frequency for the small faults of rolling bodies under three different loads are shown in Figures 9(a)9(c), respectively. represents the rotor frequency, and represents the fault characteristic frequency of the rolling element. It can be seen from Figure 9 that the weak fault characteristic frequencies can be effectively extracted under the three loads, the frequency amplitude decreases with the increase of the load, and the bearing fault feature frequency amplitude under load 2 hp is minimum. In addition, it can be seen from Table 2 that the center frequency of the slight fault of the rolling body is also different under different loads. To further validate the effectiveness of VMD based on the QCFOA, the EEMD and LMD methods are used to deal with the rolling body fault signal with the pitting diameter of 0.1778 mm under load 2 hp, respectively. It can be seen from Figure 9 that the characteristic frequency of the bearing fault obtained by the EEMD method is very weak and almost drowned by other frequency components. The LMD method has improved relative to the EEMD method, but the fault characteristic frequency is still very weak. Compared with the VMD method in Figure 10, the characteristic frequency of the bearing fault is superior than that of EEMD and LMD. Therefore, the above experimental results indicate that the VMD method using the quantum chaotic FOA can accurately extract the characteristic frequency of the weak rolling body fault under heavy loads, and the validity of the proposed method is also verified.

5.2.2. Fault Feature Extraction of Different Defects under the Same Load

The experiment is conducted on a bearing with different degrees of fault, and the collected vibration signals shown in Figure 9 are of a bearing with different defects.

It can be seen from Figure 11 that the frequency of fault features with different damage degrees under the same load can be effectively extracted, the amplitude of the frequency increases with the increase of the degree of rolling damage, and the frequency amplitude of the minor damage fault features is the least. The center frequencies of rolling body fault with different damage degrees used for the experiment are given in Table 3, and the changes are minor.

5.3. Selection of Eigenvectors

The marginal spectrum can accurately reflect the distribution of the actual frequency components of the signal and the degree of uncertainty of the signal spectrum. The smaller the marginal spectral entropy of the signal is, the greater the concentration of the energy spectrum of the signal is and the more concentrated the marginal spectrum of the signal is. The analysis of the marginal spectrum energy can effectively reflect the working state of the bearing. From Table 2, it is seen that there is a small difference in the LMMSE of the same fault type and the fault level at different loads, but the difference in their center frequency values is large. It can be seen from Table 3 that the LMMSE under the same load is significantly different for the same fault type and different fault degrees, but the difference between their central frequencies is small. The experimental data show that the magnitude of load and the degree of fault have a certain effect on the marginal spectrum entropy of failure. Therefore, the marginal spectrum entropy of a single IMF component is difficult to accurately characterize the degree of fault under variable load conditions. For the above reasons, the center frequency is combined with the basis of one-dimensional MSE to extend it into two-dimensional MSE in this paper, and its central frequency can be obtained through equation (2).

In order to verify the effectiveness and robustness of the proposed method under variable load conditions, the experimental data under 1 hp load are used as a training sample, with 0 hp and 2 hp representing the unknown load, and the experimental data are used as a test set. The length of each data sample is 4096 points. The detailed data description is shown in Table 4.

5.4. Fault Diagnosis
5.4.1. The Process of Intelligent Diagnosis

First, the sample data are decomposed by VMD, and the corresponding IMF components are obtained. The two-dimensional MSE synthesized by marginal spectrum entropy and the central frequency corresponding to each IMF component is taken as the eigenvector. Then, the most commonly used RBF kernel function is selected as the kernel function of VRVM. The parameters of the kernel function are optimized through 10-fold cross-validation. The detailed diagnosis process is shown in Figure 12.

In Figure 12, three multiclassifiers of VRVM are used. Although the probit model is proposed in this paper, it can easily extend the binary classification to the multiple classification and avoid complex approximate calculation. However, as the number of classes increases, the Hessian matrix in the process of constructing the model will also increase, resulting in an increase in computational complexity. Therefore, in this paper, three VRVM multiclassifiers are constructed and a multiclassification intelligent diagnosis model is constructed based on the combination strategy of “maximum probability win”.

5.4.2. Diagnostic Results

In order to verify the validity and robustness of the proposed method in intelligent fault diagnosis under variable load conditions, the fault data of different damage degrees under the same load and the fault data with slight damage under different radial loads are tested for the diagnosis model in this section. The three different levels of damage are as follows: (1) minor damage, (2) moderate damage, and (3) severe damage. The three different radial loads are 0 hp, 1 hp, and 2 hp, respectively. 0 hp represents the radial load force of 0 N, 1 hp represents the radial load force of 400 N, and 2 hp represents the radial load force of 800 N.

According to the proposed bearing fault intelligent diagnosis procedure, the diagnosis results of bearing outer race damage with different degrees under 0 hp load are shown in Figure 13(a). It can clearly distinguish the running data of bearing outer race under normal and three different damage degree conditions. Similarly, the diagnosis results of bearing rolling balls under 1 ph load and bearing inner race under 2 hp load are shown in Figures 13(b)13(c), respectively. Under different loading conditions, the fault diagnosis results of the bearing inner ring, rolling body, and outer ring with slight damage are shown in Figure 14.

According to the proposed procedure shown in Figures 13 and 14, fault types and degrees can be effectively identified under variable loads with the intelligent diagnosis method. The fault recognition rate is 100%, and the fault degree recognition rate also reaches 90%; the overall recognition rate is 95%, which proves the validity of the method. Meanwhile, VRVM is compared with other classification methods in terms of classification accuracy and running time. The results are shown in Table 5. The experiment results show that the VRVM method has a good performance in classification accuracy, but the running time is longer. In practice, equipment fault samples are often scarce, and it is even more difficult to get data of different fault types under the same load. It is a practical significance to identify fault types and fault levels under unknown loads by using fault data under known loads.

6. Conclusions

In this paper, the optimal variational mode decomposition (VMD) based on the quantum chaotic fruit fly optimization algorithm and variational relevance vector machine (VRVM) are combined as a hybrid method to diagnose the bearings’ fault under variable load conditions. The results of the experiment and application demonstrate the superiority of the proposed method. The conclusions are summarized as follows:(1)The VMD method is a new adaptive signal processing method. In the process of bearing fault signal processing, its performance is influenced by the two parameters such as the number of its own components and the penalty factor. Therefore, using the quantum chaotic fruit fly optimization algorithm to filter the two key parameters of the optimal value can guarantee the effectiveness and reliability of VMD performance.(2)FOA is a new swarm intelligence optimization algorithm. Its principle is simple and easy to implement and has strong local search capability; however, its global search is weak. If the initial value is not set properly, it will easily fall into local minimum, thus losing population diversity and premature convergence. Compared with the traditional chaotic system, the quantum chaotic system has the characteristics of better aperiodicity, ergodicity, and class randomness and more sensitivity to system parameters and initial conditions. Therefore, the location of the fruit fly population is initialized by the proposed quantum chaotic system, which can improve the diversity of the population, strengthen the ergodicity of the search, avoid the search process falling into the local optimal value prematurely, and improve the search efficiency.(3)The posterior distribution of all parameters and superparameters is obtained by using RVM. Then, the probit model is used to replace the logistic model in the original variational RVM classification so that the classification and regression are organically combined. The approximate deduction of the logistic model from continuous output to discrete output mapping is avoided so that the reasoning algorithm of the RVM regression model can be directly applied to the classification model.(4)Usually, the fault diagnosis model is trained by the traditional diagnosis method under a specific load, and it has a great limitation. In practice, most mechanical equipment work under variable load conditions. A single marginal spectrum entropy cannot effectively characterize the degree of the fault under variable load conditions. Therefore, by introducing the central frequency of VMD, the one-dimensional marginal spectrum entropy is extended to two-dimensional marginal spectrum entropy as the learning sample of VRVM.(5)To develop an intelligent fault diagnosis model which is a systematic problem, it includes experimental condition design, feature extraction, feature selection, and model training, and each step will affect the validity of the final model. The method proposed in this paper provides an effective diagnostic strategy for multiclass faults and fault degree diagnosis under variable load conditions. However, identification of the accuracy and running time of bearing damage recognition needs to be improved in this paper, which is also a problem that needs further study in the later stage.

Data Availability

All data included in this study are available upon request by contacting the corresponding author.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Nos. 61174106 and 51707079), Research Project of Hubei Provincial Department of Education, China (No. B2016006), and Hubei Provincial Natural Science Foundation of China (No. 2016CFB463).