Research Article  Open Access
Tao Wu, Chang Chun Liu, Cheng He, "Fault Diagnosis of Bearings Based on KJADE and VNWOALSSVM Algorithm", Mathematical Problems in Engineering, vol. 2019, Article ID 8784154, 19 pages, 2019. https://doi.org/10.1155/2019/8784154
Fault Diagnosis of Bearings Based on KJADE and VNWOALSSVM Algorithm
Abstract
In order to accurately diagnose the faulty parts of the rolling bearing under different operating conditions, the KJADE (Kernel Function Joint Approximate Diagonalization of Eigenmatrices) algorithm is proposed to reduce the dimensionality of the highdimensional feature data. Then, the VNWOA (Von Neumann Topology Whale Optimization Algorithm) is used to optimize the LSSVM (Least Squares Support Vector Machine) method to diagnose the fault type of the rolling bearing. The VNWOA algorithm is used to optimize the regularization parameters and kernel parameters of LSSVM. The lowdimensional nonlinear features contained in the multidomain feature set are extracted by KJADE and compared with the results of PCA, LDA, KPCA, and JADE methods. Finally, VNWOALSSVM is used to identify bearing faults and compare them with LSSVM, GALSSVM, PSOLSSVM, and WOALSSVM. The results show that the recognition rate of fault diagnosis is up to 98.67% by using VNWOALSSVM. The method based on KJADE and VNWOALSSVM can diagnose and identify fault signals more effectively and has higher classification accuracy.
1. Introduction
Rolling bearings are widely used in modern machinery and are one of the most vulnerable mechanical parts in rotating machinery [1]. Sometimes it is possible to determine the safety of the entire machine due to the state of motion of the bearing. Serious accidents may occur, directly threatening the safety of life and property, so researchers pay close attention to these accidents [2]. In order to reduce the damage caused by bearing failure, fault detection is very necessary and has great practical value [3]. At this stage, with the continuous development of artificial intelligence algorithms, optimization problems, and improved algorithm convergence, as well as improving system reliability and stability [4, 5], scholars in various fields have carried out indepth research [6]. However, mechanical failure is a complex nonlinear timevarying system that is difficult to understand and estimate [7, 8], and its failure process is more complicated. How to perform rapid fault detection and diagnosis through optimization algorithms for specific mechanical faults has always been a challenge. Therefore, indepth and scientific research on the rotating mechanical bearing parts will have great value and significance in industrial production or daily life [9–11].
1.1. Literature Review
The traditional bearing fault diagnosis process based on feature extraction mainly includes four steps: vibration signal acquisition, signal preprocessing, feature extraction, and fault identification. In the traditional bearing fault diagnosis method, the three steps of signal preprocessing, feature extraction, and fault identification are closely connected [12]. If any of the steps are not satisfactory, it will affect the final recognition effect. Since the 21st century, with the tremendous improvement of computer computing power, hardware support for deep neural network training has been provided. At present, deep learning has become the research object of many scholars and has been successfully applied in the fields of image recognition [13], target detection [14], and computer vision [15]. Deep learning has the advantage of automatically extracting features, eliminating the need for cumbersome artifacts. Many scholars at home and abroad have used deep learning in bearing fault diagnosis research [16, 17] to achieve better results. For instance, Manjurul and Jong [18] used a discrete wavelet packet transform combined with an adaptive learning method to train CNN bearing fault diagnosis methods to improve the performance of the network and achieve highprecision diagnosis under multiple fault types. Du and Yang [19] proposed a method for demodulating the fault signal of a bearing based on the wavelet transform and Hilbert transform. The phase compensation method is used to refine the envelope spectrum to improve the frequency of the envelope spectrum. The resolution makes the recognition accuracy of bearing fault diagnosis improved. Shao et al. [20] proposed a bearing fault diagnosis method based on a doubletree complex wavelet packet transform and DBN. The dualtree complex wavelet packet transform is used to refine the feature information, and the diagnostic effect is improved on the basis of the original DBN.
In the aspect of mechanical fault diagnosis based on empirical mode decomposition, Ali et al. [21] proposed a new fault diagnosis method based on EMD decomposition and energy entropy feature extraction. Firstly, multiple inherent modal components of the bearingfault signal were obtained by EMD. The energy entropy characteristics of each component are obtained, and the fault type diagnosis is realized by the artificial neural network. Liu et al. [22] provided new ideas for the detection and fault diagnosis of rolling bearingvibration signals. They use the EMD method to decompose the noisereduced bearingfault signal and calculate the AR spectrum characteristics of each intrinsic modal component to achieve bearing fault type diagnosis. Yang et al. [23] obtained the envelope spectrum of multiple scale components of the bearingfault signal through empirical mode decomposition. The amplitude ratio characteristics are calculated for the envelope spectrum and used as feature vectors and are classified into classifiers for classification. Finally, an effective diagnosis of the inner ring and outer ring of the rolling bearing is realized. The SVM (Support Vector Machine) method [24] is a very important identification method in mechanical fault diagnosis. It is proposed for machine learning problems under finite samples and can effectively solve problems such as small samples and nonlinearities. Chen [25] uses JADE combined with EMD (Empirical Mode Decomposition) spectral correlation to extract the characteristic parameters reflecting the type of bearing fault. Then, the combination of the processed data and the strong generalization can realize the recognition of different fault types for the support vector machine (SVM) and finally achieved good results. He [26] extracted the time domain characteristics of the bearingvibration signal, the frequency domain characteristics, and the energy characteristics of the wavelet packet to form a multidomain feature set. Then, used KJADE to effectively extract the lowdimensional nonlinear features contained in the multidomain feature set and finally combine the SVM for fault diagnosis and achieve good results. Shao et al. [27] used the original vibration signal to train the automatic encoder, which significantly improved the diagnostic accuracy compared with traditional fault identification methods such as BPNN, SVM, and Boosting. Yan and Jia [28] proposed a support vector machine (SVM) fault classification algorithm based on multisource features. This method introduces the Laplacian fractional algorithm to filter redundant information, select sensitive features, and finally use PSOSVM to achieve identification of multiple fault conditions of the bearing. Saimurugan et al. [29] used the decision tree to select the optimal statistical characteristics of the vibration signal and then used the support vector machine to realize fault classification. In the fault type discrimination, a better diagnosis classification effect was obtained.
Past research has made significant progress, but there are some limitations. The fault diagnosis is essentially a pattern recognition problem, and deep learning can be used for pattern recognition by extracting the features layer by layer, which can be largely rid of relying on various advanced signal processing techniques and heavy artificial feature extraction. Therefore, the fault diagnosis method improved by the conventional method has achieved a large room for improvement and the effect is also good. Therefore, the study of reasonable methods and optimization algorithms is particularly important for the realization of intelligent and efficient diagnosis of rolling bearings.
1.2. Motivation and Contributions
The main purpose of this study is to extract the nonlinear characteristic parameters of normal bearing, inner ring fault, outer ring fault, rolling element fault, and three fault levels (a total of ten bearing states) under various fault modes. Then, the KJADE algorithm is used to feature the extracted original highdimensional feature matrix to extract the effective features that are more sensitive to the bearing state. Finally, a fault diagnosis method for rolling bearing based on the improved whale algorithm is proposed to effectively identify bearing faults. The diagnostic results of LSSVM, GALSSVM, PSOLSSVM, WOALSSVM, and VNWOALSSVM are compared, respectively, to verify the effect of the improved whale algorithm on fault diagnosis accuracy and diagnostic recognition rate of rolling bearings. Three contributions have been made.
1.2.1. Experimental Study on Rolling Bearing Fault
This study used Case Western Reserve University’s bearing test data as the verification object. The experimental data of the model SKF6205 deep groove ball bearing were used as the simulation data. The experimental data are processed into different points of failure at different positions of the bearing by electric spark. The vibration signal of the bearing is recorded by the acceleration sensor at a sampling frequency of 48 kHz. The torque sensor is mainly used to monitor the speed of the bearing at a speed of 1750 rpm. The load motor provides a load to the bearing. The bearing failure damage diameters are 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm), respectively. By taking the bearing test data as the original input signal, the bearing normal, inner ring fault, outer ring fault, and rolling element fault are extracted. Finally, the fault diagnosis research and analysis of the nonlinear characteristic parameters of three fault levels (a total of ten bearing states) under different fault modes are carried out.
1.2.2. Modeling
Vibration signal feature extraction is an important step in the research of rolling bearing fault diagnosis. It is essential to extract the most useful features from the vibration signal. The extracted feature is a highdimensional feature matrix, but there will still be some useless signals in it. Therefore, how to eliminate the redundancy between features and reduce the complexity of decision making is very important. In terms of fault diagnosis, due to some problems such as the convergence and accuracy of the optimization algorithm, the final fault diagnosis accuracy and fault recognition rate are not very high. This paper attempts to extract features from the time domain, frequency domain, and timefrequency domain of the vibration signal. Then, the algorithm is used to reduce the dimension of the feature. Finally, the Von Neumann topology is used to improve the WOA and optimize the LSSVM regularization parameters and the parameters of the nuclear parameters to diagnose the rolling bearing fault.
1.2.3. ModelBased Fault Diagnosis
A method based on KJADE and VNWOALSSVM is proposed to diagnose the rolling bearingfault signal. Firstly, the multidomain feature set of nonlinear feature parameters is obtained by extracting timefrequency domain features from time domain, frequency domain, and VMD envelope spectrum. The feature fusion of the highdimensional feature matrix in the feature set is then performed by KJADE. Finally, the Von Neumann topology is used to improve the WOA and optimize the LSSVM regularization parameters and kernel parameters, which effectively improves the accuracy and convergence speed of the LSSVM. At the same time, it maintains a good overall performance, which improves the ability to diagnose rolling bearing faults.
1.3. Organization of This Paper
The remainder of this paper is organized as follows. In Section 2, the theoretical methods of the timefrequency domain feature extraction, feature fusion, and two types of models are mainly described. In Section 3, the description of the proposed fault diagnosis research method based on the VNWOAoptimized LSSVM algorithm is introduced. In Section 4, several experimental cases are used to verify the effectiveness of the fault diagnosis method used in this paper compared with other traditional algorithms and the results and analysis of the method are given. Finally, the conclusion is presented in Section 5.
Figure 1 shows the framework of the specific idea of this article.
2. TimeFrequency Domain Feature Extraction and Feature Fusion
2.1. VMD Feature Extraction
The original signal is decomposed by VMD (Variational Mode Decomposition) to obtain corresponding components, wherein the th component is represented as . Then, each component is separately Hilbert transformed to obtain the transformed amplitude function . Finally, the Fourier transform is performed on each amplitude function signal to obtain the envelope spectrum [30, 31].
The Hilbert transform is performed on the th component obtained by the decomposition to obtain
Construct the parsing signal:
Corresponding amplitude function
The resulting Hilbert spectral expression is
From the entire process of HHT analysis, the frequency and amplitude of each IMF (Intrinsic Mode Function) are a function of time, so describes the distribution of the amplitude of the signal over time and frequency. If we consider as the energy density of the signal, also has the physical meaning of energy density after the Hilbert transform. is called the Hilbert energy spectrum, which is the timefrequency representation of the signal energy.
2.2. KJADE Dimensionality Reduction Method
KJADE is a new feature fusion method based on the JADE (Joint Approximative Diagonalization of Eigenmatrix) algorithm, and it has greatly improved the handling of nonlinear problems and the improvement of the robustness of nonlinear results. The core idea of this algorithm is to perform highdimensional mapping of the observed data (and ) in the sample space through the nonlinear function and obtain the corresponding highdimensional feature space . Then, the JADE algorithm is applied to the highdimensional feature space , which can finally transform the nonlinearity into a linearly separable problem [32–34]. The mapping process is shown in Figure 2.
The specific steps of the KJADE algorithm are summarized as follows:(1)The sample point data in the observed data are mapped by high dimension or infinite dimension through the nonlinear mapping function and the feature space is .(2)Calculating the covariance matrix in the feature space Introducing the concept of kernel function, convert the complex and timeconsuming inner product calculation into kernel function and get a kernel matrix [33]:The form is , where represents the width parameter of the function and is required to satisfy the Mercer condition, i.e., .(3)For the abovementioned kernel matrix K whitening process, , respectively, obtain the whitening matrix W and the whitened data .(4)Selecting the matrix group and finding the fourthorder cumulant matrix corresponding to each , respectively:(5)Calculating the rotation matrixwhere is the argument of the complex number, off is the square of the nondiagonal element, and is the pseudoinverse of U so that the cumulant matrix obtained in the previous step can be a matrix of diagonalized form.(6)Finally, the mixed matrix B is calculated to obtain the optimal approximation matrix of the source signal:
2.3. The Two Classes of Model
When the bearing fails, the extracted characteristic distribution of each bearing has a good class separability. Therefore, the difference between the vibration signals of each bearing and each part can be calculated by constructing two classes of model, which can be used as the basis for evaluating the performance degradation of the bearing. In the classification measurement of the sample, the distance between betweenclass and withinclass has been successfully applied to the class separability metric [35] and the extraction of the bearing performance degradation curve [36].
The two classes of model are shown in Figure 3. It is assumed that the feature set extracted by the bearing in the healthy state is and the feature set extracted by the bearing at time t is . The two classes of model is , where , n is the number of samples, and (D is the extracted feature dimension).
Then, the betweenclass scatter matrix is
The withinclass scatter matrix iswhere
C is the number of categories, and in this model; is the feature mean in category i; and m is the feature mean in the entire sample. The betweenclass scatter matrix represents the degree of convergence between different classes, and the withinclass scatter matrix represents the degree of aggregation of the same class. In order to comprehensively describe the feature part, the evaluation factor SS composed of betweenclass and withinclass scatters is employed to depict the clustering performance of the proposed new features quantitatively. Then, the SS is defined as the following equation:
3. Research on Fault Classification Based on the VNWOAOptimized LSSVM Algorithm
3.1. LSSVM Algorithm
SVM is a relatively new machine learning method based on statistical theory as basic research. In the case, where the sample is very limited, it can be used to find the optimal solution. LSSVM introduces the least squares based on the traditional SVM. Based on Vapnik’s support vector machine, the optimization goal is defined by a quadratic loss function with a square term. Finally, the inequality constraint in the original quadratic optimization problem becomes the equality constraint problem for solving linear equations, which can reduce the complexity of the overall calculation and increase the operation speed [37–39].
The optimization objective function using the LSSVM algorithm is
The final optimization problem becomes
In order to solve the problem of optimization well, the Lagrange multiplier is introduced and the function of Lagrange is constructed. The formula is as follows:
Then, using the constraints on the KKT condition. The relevant parameters of the Lagrange function are separately subjected to a partial derivative operation at the extreme points sought, and the result is zero. The resulting linear matrix expression is as follows:
There are some relationships in the above expression: ; ; ; ; and .
In the LSSVM algorithm, the resulting optimal classification function expression is
In the above formula, is defined as a kernel function satisfying the Mercer condition and the expression is the same as equation (6).
3.2. Whale Optimization Algorithm
3.2.1. Principle of Whale Optimization Algorithm
Mirjalili and Lewis studied the predatory behavior of humpback whales [40] and designed a new heuristic search optimization algorithm with simple operation, less adjustment parameters, and strong local optimal ability. The algorithm is called the WOA (Whale Optimization Algorithm), which has some natural inspiration, mainly to imitate the predation behavior of humpback whales, including modeling the three behaviors of the humpback whales surrounding prey, hunting prey, and hunting prey, as shown in Figure 4.
In the above formulas, is expressed as the position of any individual in the current whale population.
3.2.2. Improvement of Whale Algorithm
In view of the shortcomings of the WOA in optimizing some complex problems, the accuracy is not high and the convergence speed is slow. Therefore, the WOA is improved by using the Von Neumann topology. As shown in Figure 5, the Von Neumann topology [41], each humpback whale is surrounded by up, down, left, and right directions to form a grid, while the humpback whale is in contact with the surrounding whales in the center. The head whales have the best solution to affect other whales around them, allowing the fish to communicate frequently and maintain the multidirectionality and diversity of the group. The premature occurrence has been improved in terms of maintaining globality, convergence speed, and adjustment accuracy.
Studying the abovementioned whale position update formula, it shows that the change has a great relationship with the global optimal solution and will change with the update of the global optimal solution. The midpoints of the local and global optimal positions are selected and adopted so that the whales are affected by the global and local optimal solutions while performing position updating. It can exchange sufficient information with other local whales to enhance the local search ability of the algorithm. As the number of iterations increases, the local optimum and the global optimal will appear approximately coincident and the position of the whale will be updated to return to the original formula [42]. The improved expression is as follows:
In the above formula, indicates the optimal position of the th search agent of the humpback whale in the Von Neumann topology field , which can also be understood as being in this field . Each whale search agent's own fitness function obtains the historical optimal position of the particle corresponding to the minimum value; represents the optimal position of the whale in the global update.
3.2.3. Analysis of the Advantages of the Improved Whale Optimization Algorithm
(1)The VNWOA uses the Von Neumann topology. Within each neighborhood, the optimal solution found by the humpback whale at each central location affects only the other four whales in the same neighborhood. The full exchange of whales in the neighborhood can maintain the multidirectionality and diversity of the population.(2)While multiple neighborhoods maintain the diversity of fish stocks, avoiding the fact that a whale finds a local optimal solution prematurely, the whole population falls into local optimum, and it is improved in terms of maintaining globality and convergence speed.(3)By improving the whale position update formula in the algorithm, the midpoint of the local optimal position and the global optimal position is selected. In this way, the whale can not only be guided by the global optimum when the location is updated, but also enhance the whale communication ability in the local area. The update of its location is also affected by a part of the local optimum, thus enhancing the local search ability of the algorithm.(4)As the iteration progresses, the local optimal value gradually coincides with the global optimal value and the whale’s position update eventually returns to the original formula. The ability of the algorithm to achieve convergence and precision is enhanced.
3.3. VNWOAOptimized LSSVM Algorithm Steps
Step 1 (data preprocessing): The time domain, frequency domain, and timefrequency domain characteristics are obtained for the original vibration signals and normalized. The feature reduction is performed by KJADE, and the training set and test set are divided by 7 : 3. Step 2: Initializing the whale position and setting the population to , and the maximum number of iterations is . Step 3: Determining the network topology of the VNWOA and the range of values for initializing the LSSVM, that is, the range of values of and gamma . Step 4: Calculating the corresponding fitness of each whale, and determining the initial optimal individual and initial optimal fitness according to the order of fitness. Step 5: Using the Von Neumann topology to perform neighborhood search, exchanging information in the neighborhood, finding the best whale in the neighborhood, and then following the formulas (19), (20), (26), and (28) to perform a location update. Step 6: The whale swims in a spiral shape to the prey while shrinking the encirclement. The position of the other whales is updated according to the selected whale position, forcing the whale to deviate from the prey thus obtaining the best training accuracy in the LSSVM. Step 7: Repeating steps 5 and 6. Until the maximum number of iterations is reached, the whale position with the best fitness is trained as a parameter of the LSSVM, and then the test set is diagnosed and classified.
4. Experimental Results and Analysis
4.1. Basic Steps for Diagnosis
(1)Vibration signal preprocessing: by selecting the normal state of the bearing and three types of faults (corresponding to three fault levels for each of the three fault types), a total of ten bearing states are used as verification objects. Then, the time domain and frequency domain of ten bearing states and the method of using VMD envelope spectrum are used to extract the timefrequency domain features of the rolling bearing so as to obtain the multidomain feature set of nonlinear characteristic parameters.(2)Feature fusion: KJADE is used for feature fusion in the highdimensional feature matrix of the feature set, and the evaluation factor SS composed of betweenclass and withinclass scatters is employed to depict the clustering performance of the proposed new features quantitatively. It is then compared to the PCA, LDA, KPCA, and JADE clustering effects.(3)Selection of fault signal for training and testing: a total of 1000 fault signal samples are randomly selected from ten types of bearing states. Among them, 700 samples were used as training samples and 300 samples were used as test samples and the data length of each sample was 15.(4)Fault diagnosis: the method used for fault diagnosis mainly uses the Von Neumann topology to improve the WOA and optimize the LSSVM regularization parameters and the parameters of the nuclear parameters. In the experiment, the maximum number of iterations is set to 100, and then the 700 sets of samples in step 3 are input to the whale position with the best fitness as the parameters of the LSSVM for training. Finally, the remaining 300 sets of test data are diagnosed.
4.2. Introduction to Experimental Equipment
The rolling bearing fault data used in this experiment were obtained from the Case Western Reserve University Bearing Data Center [43], which provides a large number of rolling bearing fault data. The corresponding experimental device is shown in Figure 6. The experimental support consists of a 2 hp motor (left), a torque sensor/decoder (middle), a dynamometer (right), and an electrical control unit (not shown). The treated bearing will be mounted on the drive end of the motor, while the inner ring of the bearing will follow the rolling axis but will remain stationary. The acceleration signal is used to record the vibration signal of the bearing during the whole process. The sampling frequency is 48 kHz. The torque sensor is mainly used to monitor the rotation speed of the bearing. The rotation speed is 1750 rpm, and the load motor provides the load for the bearing. In the test, the model data of SKF6205 deep groove ball bearing is used as the simulation data to test the bearing support motor shaft and three types of test bearings are prepared.
This experiment uses an acceleration sensor adsorbed on a magnetic body to obtain the vibration data. At the drive and blast ends of the motor frame, the sensor is placed at 12 o'clock. The vibration signal was recorded with a 16channel digital recording recorder and saved as a Matlab (mat) format in the form of a file. The speed and power data are manually recorded by a torque sensor/decoder. By using the bearing test data as the original input signal, the nonlinear characteristic parameters of the bearing normal, inner ring fault, outer ring fault, rolling element fault, and three fault levels (a total of ten bearing states) in various fault modes are extracted. The EDM technology is used to test the inner ring, the outer ring, and the rolling element of the bearing, respectively. Single point damage failures of 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm) with a depth of 0.011 inches (0.2794 mm) were set [44, 45].
“Normal” indicates bearing under normal conditions; “IR” indicates inner ring failure; “OR” indicates outer ring failure; and “B” indicates rolling element failure. The ten bearing data obtained are numbered and indicated by IX. The specific bearing data are shown in Table 1.

4.3. Feature Extraction
When the motor speed is 1797 rpm (the frequency conversion is also considered to be ), the drive end fault bearing is selected as the analysis object. In the case of bearing failure damage diameters of 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm), respectively, the bearingvibration signals of the outer ring, the inner ring, and the rolling element position are selected as the object. In the case of bearing failure damage diameters of 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm), the bearingvibration signals of the three failure positions were selected as objects and compared with healthy bearings. The bearing signals of the various bearing states collected therein are shown in Figures 7(a)–7(j) correspond to the bearing health status, outer ring failure status, inner ring fault status, and rolling element fault status.
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
(j)
A timedomain waveform diagram of the bearing in a healthy state is shown in Figure 7(a), and it can be found that its vibration and noise are relatively small. The vibration waveform is disorderly and irregular, and the amplitude is small. Figures 7(b)∼7(d) show timedomain waveforms for the case, where the bearing inner ring failure damage diameter is 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm). Similarly, Figures 7(e)∼7(j) are the three impaired waveforms corresponding to the rolling element and the outer ring, respectively. From the waveform in the figure, it can be concluded that when the bearing is partially damaged, an impact signal is generated, so a shorttime lowfrequency pulse occurs in the vibration signal of the faulty bearing. At the same time, this shock will arouse the highfrequency natural vibration and the amplitude will be modulated by this excitation.
Four characteristic indexes of the center frequency, frequency standard deviation, root mean square frequency, and frequency concentration of the bearing signal are extracted as frequency domain features, and the timefrequency domain characteristic signals are extracted by using the VMD envelope spectrum method to form a multidimensional feature set. Since the corresponding spectrum and the envelope spectrum have the same range of values on the abscissa, image blending is employed and normalized.
The two characteristics of the timefrequency spectrum and the envelope spectrum when the bearing is in normal operation is shown in Figure 8. It can be seen that the bearing has the most prominent frequency conversion, the spectrum is relatively simple, and the energy is concentrated in the lowfrequency band. Figures 9–11 show the corresponding image of the rolling element. The signal is rather messy. This is because when the roller fails, the failed roller not only rotates with the inner ring, but also rotates itself, so the defect information is often submerged in the noise. In the signal, it is difficult to be as obvious as the outer ring and the inner ring, so we will see that there is a strong energy in both the low and highfrequency bands. Figures 12–17 show the image of the inner and outer rings of the bearing. We will find that the energy is mainly concentrated in the middle frequency band because the local position of the damage will cause the natural vibration of each component. On the other hand, for sensors with a resonant frequency less than 20 kHz, the resonance information caused by the impulse signal is also reflected in this frequency band. In addition, the highfrequency natural vibration of the bearing is modulated by the shock caused by these damages, so the fault information can be separated from the chaotic modulation signal by the demodulation technique. By comparing these processes, the results show that when the bearing develops from a normal state to a fault, the main energy in the spectrum will gradually shift from the lowfrequency band to the middlefrequency band. At the same time, for the same part of the bearing, as the degree of damage increases, the amplitude corresponding to the same frequency will also become larger.
4.4. Feature Fusion
From the original bearing vibration information, the number of samples is 1000 and the multidomain feature set is extracted from the time domain, the frequency domain, and the timefrequency domain, thereby obtaining a feature matrix of . On this basis, the parameters of PCA, LDA, KPCA, and JADE are reasonably set to ensure the fairness of feature fusion. The PCA selects a k value of 3 and a guaranteed error of less than 0.05 (95% of the information is retained). The dimension reduction selected by LDA is 3. The kernel function used by KPCA is a radial basis function with a parameter value of 4. The dimension of JADE is taken as 3 for feature fusion and sensitive feature extraction. Through the above parameters and dimension settings, on the one hand, it is ensured that enough sensitive feature information is retained and the feature set can be displayed more conveniently and facilitated by us. The distribution of the feature fusions of the 10 fault states using PCA, LDA, KPCA, JADE, and KJADE is shown in Figures 18–22.
It can be clearly seen from Figures 18–22 that the lowdimensional features extracted by the KJADE method compared with PCA, LDA, KPCA, and JADE have very good clustering effect in the feature space. The KJADEbased feature fusion method is more compact, and the characteristics of each bearing state are basically concentrated in a small area. Moreover, there are few overlapping parts of various sample features and these distribution characteristics will facilitate the identification of different fault types. This indicates that the extracted features obtained by KJADE are more suitable as classifiers than those obtained by conventional methods. In order to better quantitatively describe the clustering performance of the extracted new features, the evaluation factor SS composed of betweenclass and withinclass scatters is employed to depict the clustering performance of the proposed new features quantitatively. It can be seen from Table 2 that the clustering evaluation SS of KJADE features is higher than the traditional method, which further shows that the KJADE algorithm is suitable for extracting more stable and effective lowdimensional feature nonlinear lowdimensional feature components embedded in highdimensional data. It also shows that KJADE can eliminate the redundancy between features and reduce the dimension of features to obtain effective lowdimensional features. It can reduce the complexity of decision making under the premise of retaining the classifier classification performance in fault diagnosis degree.

4.5. Fault Diagnosis Results
From the total sample of bearing vibration data, 700 samples were randomly selected as training samples and 300 samples were used as test samples, and the data length of each sample was 15. The characteristics of the training set are identified by the category and then input to the VNWOALSSVM training and compared with the LSSVM, GALSSVM, PSOLSSVM, and WOALSSVM.
It can be seen from the fitness curve of Figure 23 that the fitness value of the VNWOA optimization algorithm proposed in this paper can reach 0.9784 and stabilize after 25 iterations. Its characteristics show that the convergence rate of the population is relatively slow at the beginning, and then the convergence speed is accelerated so that the algorithm quickly converges to the optimal fitness. Compared with several other algorithms, several other optimization algorithms will eventually have higher fitness values but slower convergence. The number of iterations will be 30 or more from the graph. This shows that the VNWOA algorithm is superior to other algorithms, indicating that this method can be used as an effective means of bearing fault pattern recognition.
In Figures 24–28, the ordinates 1–10 represent the category labeling symbol IX in Table 1 , indicating the type of diagnosis in the case of the algorithm. Comparing Figures 24∼28 with Table 3, it can be seen that the result of fault diagnosis of VNWOALSSVM can reach 98.67%, which is greatly improved compared to the result of fault diagnosis without optimized LSSVM of 85.67%. The diagnostic results of GA, PSO, WOA, and VNWOAoptimized LSSVM algorithm can clearly show that the diagnostic accuracy obtained by VNWOAoptimized LSSVM is the highest, indicating that VNWOALSSVM can accurately diagnose faults.

Figure 29 shows the time complexity of GALSSVM, PSOLSSVM, WOALSSVM, and VNWOALSSVM algorithms. It can be seen from the figure that the time complexity of the algorithm for fault diagnosis of LSSVM optimization by different algorithms is also different. The population set in this study is 10, the ordinate is the running time of the algorithm, and the abscissa is the number of iterations. As the number of iterations increases, the time complexity of the four optimization algorithms is increasing. Moreover, it can be clearly found in combination with Table 3 that VNWOALSSVM is much more accurate in terms of diagnosis results, but it is also much more complicated in terms of algorithm complexity. On the other hand, it can be seen from Table 3 that after optimizing the LSSVM with the VNWOA algorithm, the training accuracy, test accuracy, and training variance are obviously superior to the other four algorithms, and the mean square error is relatively small, indicating that the algorithm has good stability and generalization ability.
5. Conclusion
Through the normal, inner ring fault, outer ring fault, rolling element fault and three fault levels in various fault modes (a total of ten bearing states), the time domain, the frequency domain, and the method using the VMD envelope spectrum, the timefrequency domain features are extracted to obtain the multidomain feature set of nonlinear feature parameters. Then, the KJADE is used to feature the highdimensional feature matrix in the feature set. The extracted lowdimensional features have very good clustering in the feature space. The effect and the effective characteristics that are more sensitive to the bearing state can be obtained. At the same time, in the fault diagnosis, the Von Neumann topology is used to improve the WOA and optimize the LSSVM regularization parameters and the parameters of the nuclear parameters. The accuracy and convergence speed of LSSVM are improved and a good overall globality is maintained, thus improving the ability of fault diagnosis of rolling bearings. It shows that the proposed method for fault diagnosis of rolling bearings has a good diagnostic effect and also provides a diagnosis method for rolling bearing fault diagnosis.
Abbreviations
CNN:  Convolutional neural network 
DBN:  Deep belief network 
AR:  Autoregression 
BPNN:  Back propagation neural network 
PSOSVM:  Particle swarm optimizationsupport vector machine 
HHT:  Hilbert–Huang transform 
IMF:  Intrinsic mode function 
EMD:  Empirical mode decomposition 
EDM:  Electrical discharge machining 
VMD:  Variational mode decomposition 
SVM:  Support vector machine 
LSSVM:  Least squares support vector machine 
PCA:  Principal component analysis 
LDA:  Linear discriminant analysis 
KPCA:  Kernel principal component analysis 
JADE:  Joint approximative diagonalization of eigenmatrix 
KJADE:  Kernel function joint approximate diagonalization of eigenmatrices 
WOA:  Whale optimization algorithm 
GALSSVM:  Genetic algorithmleast squares support vector machine 
PSOLSSVM:  Particle swarm optimizationleast squares support vector machine 
WOALSSVM:  Whale optimization algorithmleast squares support vector machine 
VNWOALSSVM:  Von Neumann topology whale optimization algorithmleast squares support vector machine 
Data Availability
The data used to support the findings of this study are available from the specific operational procedures upon request. Specific operational procedures, the bearing data used to support the findings of this study, have been deposited in the “http://csegroups.case.edu/bearingdatacenter/pages/downloaddatafile.” Operating procedures: (1) Slide the mouse and find‘Normal Baseline Data’. (2) Slide the mouse and find ‘Normal_2’. (3) The page returns to the previous page. (4) Slide the mouse and find’48k Drive End Bearing Fault Data’. (5) Slide the mouse and find ‘IR007_2’. (6) In the same way, click the mouse to download: B007_2, OR007@6_2, IR014_2, B014_2, OR014@6_2, IR021_2, B021_2, and OR021@6_2. (7) End.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This article was supported by Shanghai Polytechnic University Graduate Program Fund [EGD18YJ0003].
References
 C. Z. Li, J. D. Zheng, H. Y. Pan, and Q. Y. Liu, “Rolling bearing fault diagnosis method based on fine composite multiscale scatter entropy and support vector machine,” China Mechanical Engineering, vol. 30, no. 14, pp. 1713–1719, 2019. View at: Google Scholar
 L. X. Ma, Y. L. Huang, H. C. Fan, and J. Y. Wang, “Motor bearing fault diagnosis on adaptive mutation particle Swarm optimization of SVM,” Electric Power Science and Engineering, vol. 32, no. 2, pp. 66–71, 2016. View at: Google Scholar
 Z. Huang, L. Chen, Y. Zhang, Z. Yu, H. Fang, and T. Zhang, “Robust contactpoint detection from pantographcatenary infrared images by employing horizontalvertical enhancement operator,” Infrared Physics & Technology, vol. 101, pp. 146–155, 2019. View at: Publisher Site  Google Scholar
 Y. S. Li, H. G. Zhang, B. N. Huang, and J. Han, “A distributed Newton–Raphsonbased coordination algorithm for multiagent optimization with discretetime communication,” Neural Computing & Applications, vol. 2018, 2018. View at: Publisher Site  Google Scholar
 Y. Li, H. Zhang, X. Liang, and B. Huang, “Eventtriggeredbased distributed cooperative energy management for multienergy systems,” IEEE Transactions on Industrial Informatics, vol. 15, no. 4, pp. 2008–2022, 2019. View at: Publisher Site  Google Scholar
 R. Wang, Q. Sun, D. Ma, and Z. Liu, “The smallsignal stability analysis of the droopcontrolled converter in electromagnetic timescale,” IEEE Transactions on Sustainable Energy, vol. 10, no. 3, pp. 1459–1469, 2019. View at: Publisher Site  Google Scholar
 R. Xiong, F. Sun, Z. Chen, and H. He, “A datadriven multiscale extended Kalman filtering based parameter and state estimation approach of lithiumion polymer battery in electric vehicles,” Applied Energy, vol. 113, pp. 463–476, 2014. View at: Publisher Site  Google Scholar
 H. Zhang, Y. Li, D. W. Gao, and J. Zhou, “Distributed optimal energy management for energy internet,” IEEE Transactions on Industrial Informatics, vol. 13, no. 6, pp. 3081–3097, 2017. View at: Publisher Site  Google Scholar
 X. Xia, “Forecasting method for product reliability along with performance data,” Journal of Failure Analysis and Prevention, vol. 12, no. 5, pp. 532–540, 2012. View at: Publisher Site  Google Scholar
 G. Mukhopadhyay and S. Bhattacharya, “Failure analysis of a cylindrical roller bearing from a rolling mill,” Journal of Failure Analysis and Prevention, vol. 11, no. 4, pp. 337–343, 2011. View at: Publisher Site  Google Scholar
 L. Q. Wang, H. X. Jia, D. Z. Zheng, and Z. H. Ye, “Research progress in high reliability ceramic bearing technology,” Aero Engine, vol. 39, no. 2, pp. 6–13, 2013. View at: Google Scholar
 Q. Y. Wan, Research on Fault Diagnosis Method for Rolling Bearing of Helicopter Swashplate Based on DCAECNN, Nanchang Hangkong University, Nanchang, China, 2019.
 H. Deng, L. Zhang, and X. Shu, “Feature memorybased deep recurrent neural network for language modeling,” Applied Soft Computing, vol. 68, no. 1, pp. 432–446, 2018. View at: Publisher Site  Google Scholar
 B. Pan, Z. Shi, and X. Xu, “RVCANet: a new deeplearningbased hyperspectral image classification method,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 10, no. 5, pp. 1975–1986, 2017. View at: Publisher Site  Google Scholar
 S. Pang, J. J. del Coz, Z. Yu, O. Luaces, and J. Díez, “Deep learning to frame objects for visual target tracking,” Engineering Applications of Artificial Intelligence, vol. 65, no. 1, pp. 406–420, 2017. View at: Publisher Site  Google Scholar
 J. T. Wen, C. H. Yan, J. X. Sun, and Y. L. Qiao, “Bearing fault diagnosis method based on compression acquisition and deep learning,” Journal of Scientific Instrument, vol. 39, pp. 171–179, 2018. View at: Google Scholar
 S. Ma and F. Chu, “Ensemble deep learningbased fault diagnosis of rotor bearing systems,” Computers in Industry, vol. 105, no. 1, pp. 143–152, 2019. View at: Publisher Site  Google Scholar
 L. Manjurul and M. K. Jong, “Automated bearing fault diagnosis scheme using 2D representation of wavelet packet transform and deep convolutional neural network,” Computers in Industry, vol. 106, no. 1, pp. 142–153, 2019. View at: Publisher Site  Google Scholar
 Q. H. Du and X. N. Yang, “Application of refined envelope analysis in diagnosis of rolling bearing defects,” Bearing, vol. 3, pp. 31–34, 2004. View at: Google Scholar
 H. Shao, H. Jiang, F. Wang, and Y. Wang, “Rolling bearing fault diagnosis using adaptive deep belief network with dualtree complex wavelet packet,” ISA Transactions, vol. 69, no. 1, pp. 187–201, 2017. View at: Publisher Site  Google Scholar
 J. B. Ali, N. Fnaiech, L. Saidi, B. ChebelMorello, and F. Fnaiech, “Application of empirical mode decomposition and artificial neural network for automatic bearing fault diagnosis based on vibration signals,” Applied Acoustics, vol. 89, pp. 16–27, 2015. View at: Publisher Site  Google Scholar
 C. C. Liu, P. L. Nie, and Y. Tong, “Feature fault feature extraction based on morphological filtering and EMDAR spectrum,” Noise and Vibration Control, vol. 35, no. 3, pp. 159–162, 2015. View at: Google Scholar
 Y. Yang, D. J. Yu, and J. S. Cheng, “Fault diagnosis method of rolling bearing based on empirical mode decomposition envelope spectrum,” China Mechanical Engineering, vol. 15, no. 16, pp. 1469–1471, 2008. View at: Google Scholar
 J. A. K. Suykens and J. Vandewalle, “Least squares support vector machine classifiers,” Neural Processing Letters, vol. 9, no. 3, pp. 293–300, 1999. View at: Google Scholar
 F. L. Chen, JADE Based Bearing Fault Diagnosis and Life Prediction Research, Anhui University, Hefei, China, 2015.
 B. He, Research on Bearing Fault Identification and Performance Degradation Evaluation Method Based on KJADE, Anhui University, Hefei, China, 2017.
 H. Shao, H. Jiang, Y. Lin, and X. Li, “A novel method for intelligent fault diagnosis of rolling bearings using ensemble deep autoencoders,” Mechanical Systems and Signal Processing, vol. 102, no. 1, pp. 278–297, 2018. View at: Publisher Site  Google Scholar
 X. Yan and M. Jia, “A novel optimized SVM classification algorithm with multidomain feature and its application to fault diagnosis of rolling bearing,” Neurocomputing, vol. 313, pp. 47–64, 2018. View at: Publisher Site  Google Scholar
 M. Saimurugan, K. I. Ramachandran, V. Sugumaran, and N. R. Sakthivel, “Multi component fault diagnosis of rotational mechanical system based on decision tree and support vector machine,” Expert Systems with Applications, vol. 38, no. 4, pp. 3819–3826, 2011. View at: Publisher Site  Google Scholar
 B. B. He, H. Y. Dai, and H. L. Shi, “Metro vehicle fault analysis based on time spectrum and aggregate empirical mode decomposition (EEMD) envelope spectrum analysis,” Urban Rail Transit Research, vol. 18, no. 7, pp. 30–34, 2015. View at: Google Scholar
 C. Liu, L. Zhu, and C. Ni, “Chatter detection in milling process based on VMD and energy entropy,” Mechanical Systems and Signal Processing, vol. 105, pp. 169–182, 2018. View at: Publisher Site  Google Scholar
 M. E. Tipping and C. C. Nh, “Sparse kernel principal component analysis,” Advances in Neural Information Processing Systems, vol. 13, pp. 633–639, 2001. View at: Google Scholar
 H. Hoffmann, “Kernel PCA for novelty detection,” Pattern Recognition, vol. 40, no. 3, pp. 863–874, 2007. View at: Publisher Site  Google Scholar
 Y. Liu, B. He, F. Liu, S. Lu, and Y. Zhao, “Feature fusion using kernel joint approximate diagonalization of eigenmatrices for rolling bearing fault identification,” Journal of Sound and Vibration, vol. 385, pp. 389–401, 2016. View at: Publisher Site  Google Scholar
 Q. He, “Vibration signal classification by wavelet packet energy flow manifold learning,” Journal of Sound and Vibration, vol. 332, no. 7, pp. 1881–1894, 2013. View at: Publisher Site  Google Scholar
 X. X. Ding and Q. B. He, “Twoclass model based on nonlinear manifold learning for bearing health monitoring,” in Proceedings of the IEEE International Instrumentation and Measurement Technology Conference, pp. 1–6, Taipei, Taiwan, May 2016. View at: Publisher Site  Google Scholar
 T. X. Wen and B. Zhang, “fault diagnosis of rolling bearing based on EMD and LSSVM,” World Science and Technology Research and Development, vol. 36, pp. 76–79+88, 2014. View at: Google Scholar
 X. F. Yue and H. H. Shao, “Fault diagnosis of rolling bearing based on DELSSVM,” Computer Measurement & Control, vol. 23, no. 12, pp. 3933–3935, 2015. View at: Google Scholar
 H. Xu and G. Chen, “An intelligent fault identification method of rolling bearings based on LSSVM optimized by improved PSO,” Mechanical Systems and Signal Processing, vol. 35, pp. 167–175, 2013. View at: Google Scholar
 S. Mirjalili and A. Lewis, “The whale optimization algorithm,” Advances in Engineering Software, vol. 95, pp. 51–67, 2016. View at: Publisher Site  Google Scholar
 X. Y. Min, X. F. Xu, and Z. J. Wang, “Combining von neumann neighborhood topology with approximatemapping local search for ABCbased service composition,” in Proceedings of the 2014 IEEE International Conference on Services Computing, pp. 187–194, IEEE, Chicago, IL, USA, August 2014. View at: Google Scholar
 J. Y. Xu, Y. Wang, and Z. C. Ji, “Fault diagnosis of rolling bearing based on WKELM optimized by whale algorithm,” Journal of System Simulation, vol. 29, no. 9, pp. 2189–2197, 2017. View at: Google Scholar
 K. A. Loparo, Bearing Vibration Dataset, Case Western Reserve University, Cleveland, OH, USA, 2003, http://www.eecs.cwru.edu/laboratory/bearing/download.Html.
 S. P. Wang, Research on Fault Feature Extraction Method of Rolling Bearing Based on Popular Learning, Dalian University of Technology, Dalian, China, 2013.
 A. L. Pei, Research on Fault Feature Extraction Method of Rolling Bearing, Shenyang Aerospace University, Shenyang, China, 2015.
Copyright
Copyright © 2019 Tao Wu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.