Abstract

In order to accurately diagnose the faulty parts of the rolling bearing under different operating conditions, the KJADE (Kernel Function Joint Approximate Diagonalization of Eigenmatrices) algorithm is proposed to reduce the dimensionality of the high-dimensional feature data. Then, the VNWOA (Von Neumann Topology Whale Optimization Algorithm) is used to optimize the LSSVM (Least Squares Support Vector Machine) method to diagnose the fault type of the rolling bearing. The VNWOA algorithm is used to optimize the regularization parameters and kernel parameters of LSSVM. The low-dimensional nonlinear features contained in the multidomain feature set are extracted by KJADE and compared with the results of PCA, LDA, KPCA, and JADE methods. Finally, VNWOA-LSSVM is used to identify bearing faults and compare them with LSSVM, GA-LSSVM, PSO-LSSVM, and WOA-LSSVM. The results show that the recognition rate of fault diagnosis is up to 98.67% by using VNWOA-LSSVM. The method based on KJADE and VNWOA-LSSVM can diagnose and identify fault signals more effectively and has higher classification accuracy.

1. Introduction

Rolling bearings are widely used in modern machinery and are one of the most vulnerable mechanical parts in rotating machinery [1]. Sometimes it is possible to determine the safety of the entire machine due to the state of motion of the bearing. Serious accidents may occur, directly threatening the safety of life and property, so researchers pay close attention to these accidents [2]. In order to reduce the damage caused by bearing failure, fault detection is very necessary and has great practical value [3]. At this stage, with the continuous development of artificial intelligence algorithms, optimization problems, and improved algorithm convergence, as well as improving system reliability and stability [4, 5], scholars in various fields have carried out in-depth research [6]. However, mechanical failure is a complex nonlinear time-varying system that is difficult to understand and estimate [7, 8], and its failure process is more complicated. How to perform rapid fault detection and diagnosis through optimization algorithms for specific mechanical faults has always been a challenge. Therefore, in-depth and scientific research on the rotating mechanical bearing parts will have great value and significance in industrial production or daily life [911].

1.1. Literature Review

The traditional bearing fault diagnosis process based on feature extraction mainly includes four steps: vibration signal acquisition, signal preprocessing, feature extraction, and fault identification. In the traditional bearing fault diagnosis method, the three steps of signal preprocessing, feature extraction, and fault identification are closely connected [12]. If any of the steps are not satisfactory, it will affect the final recognition effect. Since the 21st century, with the tremendous improvement of computer computing power, hardware support for deep neural network training has been provided. At present, deep learning has become the research object of many scholars and has been successfully applied in the fields of image recognition [13], target detection [14], and computer vision [15]. Deep learning has the advantage of automatically extracting features, eliminating the need for cumbersome artifacts. Many scholars at home and abroad have used deep learning in bearing fault diagnosis research [16, 17] to achieve better results. For instance, Manjurul and Jong [18] used a discrete wavelet packet transform combined with an adaptive learning method to train CNN bearing fault diagnosis methods to improve the performance of the network and achieve high-precision diagnosis under multiple fault types. Du and Yang [19] proposed a method for demodulating the fault signal of a bearing based on the wavelet transform and Hilbert transform. The phase compensation method is used to refine the envelope spectrum to improve the frequency of the envelope spectrum. The resolution makes the recognition accuracy of bearing fault diagnosis improved. Shao et al. [20] proposed a bearing fault diagnosis method based on a double-tree complex wavelet packet transform and DBN. The dual-tree complex wavelet packet transform is used to refine the feature information, and the diagnostic effect is improved on the basis of the original DBN.

In the aspect of mechanical fault diagnosis based on empirical mode decomposition, Ali et al. [21] proposed a new fault diagnosis method based on EMD decomposition and energy entropy feature extraction. Firstly, multiple inherent modal components of the bearing-fault signal were obtained by EMD. The energy entropy characteristics of each component are obtained, and the fault type diagnosis is realized by the artificial neural network. Liu et al. [22] provided new ideas for the detection and fault diagnosis of rolling bearing-vibration signals. They use the EMD method to decompose the noise-reduced bearing-fault signal and calculate the AR spectrum characteristics of each intrinsic modal component to achieve bearing fault type diagnosis. Yang et al. [23] obtained the envelope spectrum of multiple scale components of the bearing-fault signal through empirical mode decomposition. The amplitude ratio characteristics are calculated for the envelope spectrum and used as feature vectors and are classified into classifiers for classification. Finally, an effective diagnosis of the inner ring and outer ring of the rolling bearing is realized. The SVM (Support Vector Machine) method [24] is a very important identification method in mechanical fault diagnosis. It is proposed for machine learning problems under finite samples and can effectively solve problems such as small samples and nonlinearities. Chen [25] uses JADE combined with EMD (Empirical Mode Decomposition) spectral correlation to extract the characteristic parameters reflecting the type of bearing fault. Then, the combination of the processed data and the strong generalization can realize the recognition of different fault types for the support vector machine (SVM) and finally achieved good results. He [26] extracted the time domain characteristics of the bearing-vibration signal, the frequency domain characteristics, and the energy characteristics of the wavelet packet to form a multidomain feature set. Then, used KJADE to effectively extract the low-dimensional nonlinear features contained in the multidomain feature set and finally combine the SVM for fault diagnosis and achieve good results. Shao et al. [27] used the original vibration signal to train the automatic encoder, which significantly improved the diagnostic accuracy compared with traditional fault identification methods such as BPNN, SVM, and Boosting. Yan and Jia [28] proposed a support vector machine (SVM) fault classification algorithm based on multisource features. This method introduces the Laplacian fractional algorithm to filter redundant information, select sensitive features, and finally use PSO-SVM to achieve identification of multiple fault conditions of the bearing. Saimurugan et al. [29] used the decision tree to select the optimal statistical characteristics of the vibration signal and then used the support vector machine to realize fault classification. In the fault type discrimination, a better diagnosis classification effect was obtained.

Past research has made significant progress, but there are some limitations. The fault diagnosis is essentially a pattern recognition problem, and deep learning can be used for pattern recognition by extracting the features layer by layer, which can be largely rid of relying on various advanced signal processing techniques and heavy artificial feature extraction. Therefore, the fault diagnosis method improved by the conventional method has achieved a large room for improvement and the effect is also good. Therefore, the study of reasonable methods and optimization algorithms is particularly important for the realization of intelligent and efficient diagnosis of rolling bearings.

1.2. Motivation and Contributions

The main purpose of this study is to extract the nonlinear characteristic parameters of normal bearing, inner ring fault, outer ring fault, rolling element fault, and three fault levels (a total of ten bearing states) under various fault modes. Then, the KJADE algorithm is used to feature the extracted original high-dimensional feature matrix to extract the effective features that are more sensitive to the bearing state. Finally, a fault diagnosis method for rolling bearing based on the improved whale algorithm is proposed to effectively identify bearing faults. The diagnostic results of LSSVM, GA-LSSVM, PSO-LSSVM, WOA-LSSVM, and VNWOA-LSSVM are compared, respectively, to verify the effect of the improved whale algorithm on fault diagnosis accuracy and diagnostic recognition rate of rolling bearings. Three contributions have been made.

1.2.1. Experimental Study on Rolling Bearing Fault

This study used Case Western Reserve University’s bearing test data as the verification object. The experimental data of the model SKF6205 deep groove ball bearing were used as the simulation data. The experimental data are processed into different points of failure at different positions of the bearing by electric spark. The vibration signal of the bearing is recorded by the acceleration sensor at a sampling frequency of 48 kHz. The torque sensor is mainly used to monitor the speed of the bearing at a speed of 1750 rpm. The load motor provides a load to the bearing. The bearing failure damage diameters are 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm), respectively. By taking the bearing test data as the original input signal, the bearing normal, inner ring fault, outer ring fault, and rolling element fault are extracted. Finally, the fault diagnosis research and analysis of the nonlinear characteristic parameters of three fault levels (a total of ten bearing states) under different fault modes are carried out.

1.2.2. Modeling

Vibration signal feature extraction is an important step in the research of rolling bearing fault diagnosis. It is essential to extract the most useful features from the vibration signal. The extracted feature is a high-dimensional feature matrix, but there will still be some useless signals in it. Therefore, how to eliminate the redundancy between features and reduce the complexity of decision making is very important. In terms of fault diagnosis, due to some problems such as the convergence and accuracy of the optimization algorithm, the final fault diagnosis accuracy and fault recognition rate are not very high. This paper attempts to extract features from the time domain, frequency domain, and time-frequency domain of the vibration signal. Then, the algorithm is used to reduce the dimension of the feature. Finally, the Von Neumann topology is used to improve the WOA and optimize the LSSVM regularization parameters and the parameters of the nuclear parameters to diagnose the rolling bearing fault.

1.2.3. Model-Based Fault Diagnosis

A method based on KJADE and VNWOA-LSSVM is proposed to diagnose the rolling bearing-fault signal. Firstly, the multidomain feature set of nonlinear feature parameters is obtained by extracting time-frequency domain features from time domain, frequency domain, and VMD envelope spectrum. The feature fusion of the high-dimensional feature matrix in the feature set is then performed by KJADE. Finally, the Von Neumann topology is used to improve the WOA and optimize the LSSVM regularization parameters and kernel parameters, which effectively improves the accuracy and convergence speed of the LSSVM. At the same time, it maintains a good overall performance, which improves the ability to diagnose rolling bearing faults.

1.3. Organization of This Paper

The remainder of this paper is organized as follows. In Section 2, the theoretical methods of the time-frequency domain feature extraction, feature fusion, and two types of models are mainly described. In Section 3, the description of the proposed fault diagnosis research method based on the VNWOA-optimized LSSVM algorithm is introduced. In Section 4, several experimental cases are used to verify the effectiveness of the fault diagnosis method used in this paper compared with other traditional algorithms and the results and analysis of the method are given. Finally, the conclusion is presented in Section 5.

Figure 1 shows the framework of the specific idea of this article.

2. Time-Frequency Domain Feature Extraction and Feature Fusion

2.1. VMD Feature Extraction

The original signal is decomposed by VMD (Variational Mode Decomposition) to obtain corresponding components, wherein the th component is represented as . Then, each component is separately Hilbert transformed to obtain the transformed amplitude function . Finally, the Fourier transform is performed on each amplitude function signal to obtain the envelope spectrum [30, 31].

The Hilbert transform is performed on the th component obtained by the decomposition to obtain

Construct the parsing signal:

Corresponding amplitude function

The resulting Hilbert spectral expression is

From the entire process of HHT analysis, the frequency and amplitude of each IMF (Intrinsic Mode Function) are a function of time, so describes the distribution of the amplitude of the signal over time and frequency. If we consider as the energy density of the signal, also has the physical meaning of energy density after the Hilbert transform. is called the Hilbert energy spectrum, which is the time-frequency representation of the signal energy.

2.2. KJADE Dimensionality Reduction Method

KJADE is a new feature fusion method based on the JADE (Joint Approximative Diagonalization of Eigenmatrix) algorithm, and it has greatly improved the handling of nonlinear problems and the improvement of the robustness of nonlinear results. The core idea of this algorithm is to perform high-dimensional mapping of the observed data (and ) in the sample space through the nonlinear function and obtain the corresponding high-dimensional feature space . Then, the JADE algorithm is applied to the high-dimensional feature space , which can finally transform the nonlinearity into a linearly separable problem [3234]. The mapping process is shown in Figure 2.

The specific steps of the KJADE algorithm are summarized as follows:(1)The sample point data in the observed data are mapped by high dimension or infinite dimension through the nonlinear mapping function and the feature space is .(2)Calculating the covariance matrix in the feature space Introducing the concept of kernel function, convert the complex and time-consuming inner product calculation into kernel function and get a kernel matrix [33]:The form is , where represents the width parameter of the function and is required to satisfy the Mercer condition, i.e., .(3)For the abovementioned kernel matrix K whitening process, , respectively, obtain the whitening matrix W and the whitened data .(4)Selecting the matrix group and finding the fourth-order cumulant matrix corresponding to each , respectively:(5)Calculating the rotation matrixwhere is the argument of the complex number, off is the square of the nondiagonal element, and is the pseudoinverse of U so that the cumulant matrix obtained in the previous step can be a matrix of diagonalized form.(6)Finally, the mixed matrix B is calculated to obtain the optimal approximation matrix of the source signal:

2.3. The Two Classes of Model

When the bearing fails, the extracted characteristic distribution of each bearing has a good class separability. Therefore, the difference between the vibration signals of each bearing and each part can be calculated by constructing two classes of model, which can be used as the basis for evaluating the performance degradation of the bearing. In the classification measurement of the sample, the distance between between-class and within-class has been successfully applied to the class separability metric [35] and the extraction of the bearing performance degradation curve [36].

The two classes of model are shown in Figure 3. It is assumed that the feature set extracted by the bearing in the healthy state is and the feature set extracted by the bearing at time t is . The two classes of model is , where , n is the number of samples, and (D is the extracted feature dimension).

Then, the between-class scatter matrix is

The within-class scatter matrix iswhere

C is the number of categories, and in this model; is the feature mean in category i; and m is the feature mean in the entire sample. The between-class scatter matrix represents the degree of convergence between different classes, and the within-class scatter matrix represents the degree of aggregation of the same class. In order to comprehensively describe the feature part, the evaluation factor SS composed of between-class and within-class scatters is employed to depict the clustering performance of the proposed new features quantitatively. Then, the SS is defined as the following equation:

3. Research on Fault Classification Based on the VNWOA-Optimized LSSVM Algorithm

3.1. LSSVM Algorithm

SVM is a relatively new machine learning method based on statistical theory as basic research. In the case, where the sample is very limited, it can be used to find the optimal solution. LSSVM introduces the least squares based on the traditional SVM. Based on Vapnik’s support vector machine, the optimization goal is defined by a quadratic loss function with a square term. Finally, the inequality constraint in the original quadratic optimization problem becomes the equality constraint problem for solving linear equations, which can reduce the complexity of the overall calculation and increase the operation speed [3739].

The optimization objective function using the LSSVM algorithm is

The final optimization problem becomes

In order to solve the problem of optimization well, the Lagrange multiplier is introduced and the function of Lagrange is constructed. The formula is as follows:

Then, using the constraints on the KKT condition. The relevant parameters of the Lagrange function are separately subjected to a partial derivative operation at the extreme points sought, and the result is zero. The resulting linear matrix expression is as follows:

There are some relationships in the above expression: ; ; ; ; and .

In the LSSVM algorithm, the resulting optimal classification function expression is

In the above formula, is defined as a kernel function satisfying the Mercer condition and the expression is the same as equation (6).

3.2. Whale Optimization Algorithm
3.2.1. Principle of Whale Optimization Algorithm

Mirjalili and Lewis studied the predatory behavior of humpback whales [40] and designed a new heuristic search optimization algorithm with simple operation, less adjustment parameters, and strong local optimal ability. The algorithm is called the WOA (Whale Optimization Algorithm), which has some natural inspiration, mainly to imitate the predation behavior of humpback whales, including modeling the three behaviors of the humpback whales surrounding prey, hunting prey, and hunting prey, as shown in Figure 4.

(1)In the model surrounding the prey, assuming that the target prey position or the position closest to the target prey is equated with the current best candidate solution, after the defined best search agent is subsequently defined, the other agents will update their position and gradually nearest to the best search agent. Its location update expressions are as follows:In the above equations (19)∼(23), t represents the number of current iterations, is the vector of the whale in the best position at the current stage, represents the vector of the current whale position, a is represented as a constant and the range is changed from 2 to 0 linear decreasing law, belongs to any number in [0, 1], and represents the maximum number of iterations.(2)In the model of hunting prey, the model consists of two parts.The first part is narrowing the range of prey, that is, the value of a in equation (21) is decreasing. At the same time, the value of A varies by and the value of a will also change with changes. If , it means that the target position of the whale hunting prey is at any coordinate between the current position and the prey, thus showing that the humpback whale has a good local search ability.The second part is position update with spiral features:In equations (24) and (25), represents the distance between the whale and the prey at the current stage; b represents the logarithmic spiral shape constant; l represents a random number; and the value ranges from .Humpback whales contain both the first and second parts described above when hunting prey, and the system defines a 50% probability as a threshold to apply to the position of the humpback whale. The specific expression is as follows:In the above equation, is any number belonging to [0, 1].(3)In the model of searching for prey, for the individual whales, they are randomly supplemented with prey within a certain range. When , the distance D will also change randomly and the whale will change accordingly in the search for the target of feeding. Therefore, the method can improve its original feeding ability and has a global search ability. The expressions are as follows:

In the above formulas, is expressed as the position of any individual in the current whale population.

3.2.2. Improvement of Whale Algorithm

In view of the shortcomings of the WOA in optimizing some complex problems, the accuracy is not high and the convergence speed is slow. Therefore, the WOA is improved by using the Von Neumann topology. As shown in Figure 5, the Von Neumann topology [41], each humpback whale is surrounded by up, down, left, and right directions to form a grid, while the humpback whale is in contact with the surrounding whales in the center. The head whales have the best solution to affect other whales around them, allowing the fish to communicate frequently and maintain the multidirectionality and diversity of the group. The premature occurrence has been improved in terms of maintaining globality, convergence speed, and adjustment accuracy.

Studying the abovementioned whale position update formula, it shows that the change has a great relationship with the global optimal solution and will change with the update of the global optimal solution. The midpoints of the local and global optimal positions are selected and adopted so that the whales are affected by the global and local optimal solutions while performing position updating. It can exchange sufficient information with other local whales to enhance the local search ability of the algorithm. As the number of iterations increases, the local optimum and the global optimal will appear approximately coincident and the position of the whale will be updated to return to the original formula [42]. The improved expression is as follows:

In the above formula, indicates the optimal position of the th search agent of the humpback whale in the Von Neumann topology field , which can also be understood as being in this field . Each whale search agent's own fitness function obtains the historical optimal position of the particle corresponding to the minimum value; represents the optimal position of the whale in the global update.

3.2.3. Analysis of the Advantages of the Improved Whale Optimization Algorithm

(1)The VNWOA uses the Von Neumann topology. Within each neighborhood, the optimal solution found by the humpback whale at each central location affects only the other four whales in the same neighborhood. The full exchange of whales in the neighborhood can maintain the multidirectionality and diversity of the population.(2)While multiple neighborhoods maintain the diversity of fish stocks, avoiding the fact that a whale finds a local optimal solution prematurely, the whole population falls into local optimum, and it is improved in terms of maintaining globality and convergence speed.(3)By improving the whale position update formula in the algorithm, the midpoint of the local optimal position and the global optimal position is selected. In this way, the whale can not only be guided by the global optimum when the location is updated, but also enhance the whale communication ability in the local area. The update of its location is also affected by a part of the local optimum, thus enhancing the local search ability of the algorithm.(4)As the iteration progresses, the local optimal value gradually coincides with the global optimal value and the whale’s position update eventually returns to the original formula. The ability of the algorithm to achieve convergence and precision is enhanced.

3.3. VNWOA-Optimized LSSVM Algorithm Steps

Step 1 (data preprocessing): The time domain, frequency domain, and time-frequency domain characteristics are obtained for the original vibration signals and normalized. The feature reduction is performed by KJADE, and the training set and test set are divided by 7 : 3.Step 2: Initializing the whale position and setting the population to , and the maximum number of iterations is .Step 3: Determining the network topology of the VNWOA and the range of values for initializing the LSSVM, that is, the range of values of and gamma .Step 4: Calculating the corresponding fitness of each whale, and determining the initial optimal individual and initial optimal fitness according to the order of fitness.Step 5: Using the Von Neumann topology to perform neighborhood search, exchanging information in the neighborhood, finding the best whale in the neighborhood, and then following the formulas (19), (20), (26), and (28) to perform a location update.Step 6: The whale swims in a spiral shape to the prey while shrinking the encirclement. The position of the other whales is updated according to the selected whale position, forcing the whale to deviate from the prey thus obtaining the best training accuracy in the LSSVM.Step 7: Repeating steps 5 and 6. Until the maximum number of iterations is reached, the whale position with the best fitness is trained as a parameter of the LSSVM, and then the test set is diagnosed and classified.

4. Experimental Results and Analysis

4.1. Basic Steps for Diagnosis

(1)Vibration signal preprocessing: by selecting the normal state of the bearing and three types of faults (corresponding to three fault levels for each of the three fault types), a total of ten bearing states are used as verification objects. Then, the time domain and frequency domain of ten bearing states and the method of using VMD envelope spectrum are used to extract the time-frequency domain features of the rolling bearing so as to obtain the multidomain feature set of nonlinear characteristic parameters.(2)Feature fusion: KJADE is used for feature fusion in the high-dimensional feature matrix of the feature set, and the evaluation factor SS composed of between-class and within-class scatters is employed to depict the clustering performance of the proposed new features quantitatively. It is then compared to the PCA, LDA, KPCA, and JADE clustering effects.(3)Selection of fault signal for training and testing: a total of 1000 fault signal samples are randomly selected from ten types of bearing states. Among them, 700 samples were used as training samples and 300 samples were used as test samples and the data length of each sample was 15.(4)Fault diagnosis: the method used for fault diagnosis mainly uses the Von Neumann topology to improve the WOA and optimize the LSSVM regularization parameters and the parameters of the nuclear parameters. In the experiment, the maximum number of iterations is set to 100, and then the 700 sets of samples in step 3 are input to the whale position with the best fitness as the parameters of the LSSVM for training. Finally, the remaining 300 sets of test data are diagnosed.

4.2. Introduction to Experimental Equipment

The rolling bearing fault data used in this experiment were obtained from the Case Western Reserve University Bearing Data Center [43], which provides a large number of rolling bearing fault data. The corresponding experimental device is shown in Figure 6. The experimental support consists of a 2 hp motor (left), a torque sensor/decoder (middle), a dynamometer (right), and an electrical control unit (not shown). The treated bearing will be mounted on the drive end of the motor, while the inner ring of the bearing will follow the rolling axis but will remain stationary. The acceleration signal is used to record the vibration signal of the bearing during the whole process. The sampling frequency is 48 kHz. The torque sensor is mainly used to monitor the rotation speed of the bearing. The rotation speed is 1750 rpm, and the load motor provides the load for the bearing. In the test, the model data of SKF6205 deep groove ball bearing is used as the simulation data to test the bearing support motor shaft and three types of test bearings are prepared.

This experiment uses an acceleration sensor adsorbed on a magnetic body to obtain the vibration data. At the drive and blast ends of the motor frame, the sensor is placed at 12 o'clock. The vibration signal was recorded with a 16-channel digital recording recorder and saved as a Matlab (mat) format in the form of a file. The speed and power data are manually recorded by a torque sensor/decoder. By using the bearing test data as the original input signal, the nonlinear characteristic parameters of the bearing normal, inner ring fault, outer ring fault, rolling element fault, and three fault levels (a total of ten bearing states) in various fault modes are extracted. The EDM technology is used to test the inner ring, the outer ring, and the rolling element of the bearing, respectively. Single point damage failures of 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm) with a depth of 0.011 inches (0.2794 mm) were set [44, 45].

“Normal” indicates bearing under normal conditions; “IR” indicates inner ring failure; “OR” indicates outer ring failure; and “B” indicates rolling element failure. The ten bearing data obtained are numbered and indicated by I-X. The specific bearing data are shown in Table 1.

4.3. Feature Extraction

When the motor speed is 1797 rpm (the frequency conversion is also considered to be ), the drive end fault bearing is selected as the analysis object. In the case of bearing failure damage diameters of 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm), respectively, the bearing-vibration signals of the outer ring, the inner ring, and the rolling element position are selected as the object. In the case of bearing failure damage diameters of 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm), the bearing-vibration signals of the three failure positions were selected as objects and compared with healthy bearings. The bearing signals of the various bearing states collected therein are shown in Figures 7(a)7(j) correspond to the bearing health status, outer ring failure status, inner ring fault status, and rolling element fault status.

A time-domain waveform diagram of the bearing in a healthy state is shown in Figure 7(a), and it can be found that its vibration and noise are relatively small. The vibration waveform is disorderly and irregular, and the amplitude is small. Figures 7(b)7(d) show time-domain waveforms for the case, where the bearing inner ring failure damage diameter is 0.007 inches (0.1778 mm), 0.014 inches (0.3556 mm), and 0.021 inches (0.5334 mm). Similarly, Figures 7(e)7(j) are the three impaired waveforms corresponding to the rolling element and the outer ring, respectively. From the waveform in the figure, it can be concluded that when the bearing is partially damaged, an impact signal is generated, so a short-time low-frequency pulse occurs in the vibration signal of the faulty bearing. At the same time, this shock will arouse the high-frequency natural vibration and the amplitude will be modulated by this excitation.

Four characteristic indexes of the center frequency, frequency standard deviation, root mean square frequency, and frequency concentration of the bearing signal are extracted as frequency domain features, and the time-frequency domain characteristic signals are extracted by using the VMD envelope spectrum method to form a multidimensional feature set. Since the corresponding spectrum and the envelope spectrum have the same range of values on the abscissa, image blending is employed and normalized.

The two characteristics of the time-frequency spectrum and the envelope spectrum when the bearing is in normal operation is shown in Figure 8. It can be seen that the bearing has the most prominent frequency conversion, the spectrum is relatively simple, and the energy is concentrated in the low-frequency band. Figures 911 show the corresponding image of the rolling element. The signal is rather messy. This is because when the roller fails, the failed roller not only rotates with the inner ring, but also rotates itself, so the defect information is often submerged in the noise. In the signal, it is difficult to be as obvious as the outer ring and the inner ring, so we will see that there is a strong energy in both the low- and high-frequency bands. Figures 1217 show the image of the inner and outer rings of the bearing. We will find that the energy is mainly concentrated in the middle frequency band because the local position of the damage will cause the natural vibration of each component. On the other hand, for sensors with a resonant frequency less than 20 kHz, the resonance information caused by the impulse signal is also reflected in this frequency band. In addition, the high-frequency natural vibration of the bearing is modulated by the shock caused by these damages, so the fault information can be separated from the chaotic modulation signal by the demodulation technique. By comparing these processes, the results show that when the bearing develops from a normal state to a fault, the main energy in the spectrum will gradually shift from the low-frequency band to the middle-frequency band. At the same time, for the same part of the bearing, as the degree of damage increases, the amplitude corresponding to the same frequency will also become larger.

4.4. Feature Fusion

From the original bearing vibration information, the number of samples is 1000 and the multidomain feature set is extracted from the time domain, the frequency domain, and the time-frequency domain, thereby obtaining a feature matrix of . On this basis, the parameters of PCA, LDA, KPCA, and JADE are reasonably set to ensure the fairness of feature fusion. The PCA selects a k value of 3 and a guaranteed error of less than 0.05 (95% of the information is retained). The dimension reduction selected by LDA is 3. The kernel function used by KPCA is a radial basis function with a parameter value of 4. The dimension of JADE is taken as 3 for feature fusion and sensitive feature extraction. Through the above parameters and dimension settings, on the one hand, it is ensured that enough sensitive feature information is retained and the feature set can be displayed more conveniently and facilitated by us. The distribution of the feature fusions of the 10 fault states using PCA, LDA, KPCA, JADE, and KJADE is shown in Figures 1822.

It can be clearly seen from Figures 1822 that the low-dimensional features extracted by the KJADE method compared with PCA, LDA, KPCA, and JADE have very good clustering effect in the feature space. The KJADE-based feature fusion method is more compact, and the characteristics of each bearing state are basically concentrated in a small area. Moreover, there are few overlapping parts of various sample features and these distribution characteristics will facilitate the identification of different fault types. This indicates that the extracted features obtained by KJADE are more suitable as classifiers than those obtained by conventional methods. In order to better quantitatively describe the clustering performance of the extracted new features, the evaluation factor SS composed of between-class and within-class scatters is employed to depict the clustering performance of the proposed new features quantitatively. It can be seen from Table 2 that the clustering evaluation SS of KJADE features is higher than the traditional method, which further shows that the KJADE algorithm is suitable for extracting more stable and effective low-dimensional feature nonlinear low-dimensional feature components embedded in high-dimensional data. It also shows that KJADE can eliminate the redundancy between features and reduce the dimension of features to obtain effective low-dimensional features. It can reduce the complexity of decision making under the premise of retaining the classifier classification performance in fault diagnosis degree.

4.5. Fault Diagnosis Results

From the total sample of bearing vibration data, 700 samples were randomly selected as training samples and 300 samples were used as test samples, and the data length of each sample was 15. The characteristics of the training set are identified by the category and then input to the VNWOA-LSSVM training and compared with the LSSVM, GA-LSSVM, PSO-LSSVM, and WOA-LSSVM.

It can be seen from the fitness curve of Figure 23 that the fitness value of the VNWOA optimization algorithm proposed in this paper can reach 0.9784 and stabilize after 25 iterations. Its characteristics show that the convergence rate of the population is relatively slow at the beginning, and then the convergence speed is accelerated so that the algorithm quickly converges to the optimal fitness. Compared with several other algorithms, several other optimization algorithms will eventually have higher fitness values but slower convergence. The number of iterations will be 30 or more from the graph. This shows that the VNWOA algorithm is superior to other algorithms, indicating that this method can be used as an effective means of bearing fault pattern recognition.

In Figures 2428, the ordinates 1–10 represent the category labeling symbol I-X in Table 1 , indicating the type of diagnosis in the case of the algorithm. Comparing Figures 2428 with Table 3, it can be seen that the result of fault diagnosis of VNWOA-LSSVM can reach 98.67%, which is greatly improved compared to the result of fault diagnosis without optimized LSSVM of 85.67%. The diagnostic results of GA, PSO, WOA, and VNWOA-optimized LSSVM algorithm can clearly show that the diagnostic accuracy obtained by VNWOA-optimized LSSVM is the highest, indicating that VNWOA-LSSVM can accurately diagnose faults.

Figure 29 shows the time complexity of GA-LSSVM, PSO-LSSVM, WOA-LSSVM, and VNWOA-LSSVM algorithms. It can be seen from the figure that the time complexity of the algorithm for fault diagnosis of LSSVM optimization by different algorithms is also different. The population set in this study is 10, the ordinate is the running time of the algorithm, and the abscissa is the number of iterations. As the number of iterations increases, the time complexity of the four optimization algorithms is increasing. Moreover, it can be clearly found in combination with Table 3 that VNWOA-LSSVM is much more accurate in terms of diagnosis results, but it is also much more complicated in terms of algorithm complexity. On the other hand, it can be seen from Table 3 that after optimizing the LSSVM with the VNWOA algorithm, the training accuracy, test accuracy, and training variance are obviously superior to the other four algorithms, and the mean square error is relatively small, indicating that the algorithm has good stability and generalization ability.

5. Conclusion

Through the normal, inner ring fault, outer ring fault, rolling element fault and three fault levels in various fault modes (a total of ten bearing states), the time domain, the frequency domain, and the method using the VMD envelope spectrum, the time-frequency domain features are extracted to obtain the multidomain feature set of nonlinear feature parameters. Then, the KJADE is used to feature the high-dimensional feature matrix in the feature set. The extracted low-dimensional features have very good clustering in the feature space. The effect and the effective characteristics that are more sensitive to the bearing state can be obtained. At the same time, in the fault diagnosis, the Von Neumann topology is used to improve the WOA and optimize the LSSVM regularization parameters and the parameters of the nuclear parameters. The accuracy and convergence speed of LSSVM are improved and a good overall globality is maintained, thus improving the ability of fault diagnosis of rolling bearings. It shows that the proposed method for fault diagnosis of rolling bearings has a good diagnostic effect and also provides a diagnosis method for rolling bearing fault diagnosis.

Abbreviations

CNN:Convolutional neural network
DBN:Deep belief network
AR:Autoregression
BPNN:Back propagation neural network
PSO-SVM:Particle swarm optimization-support vector machine
HHT:Hilbert–Huang transform
IMF:Intrinsic mode function
EMD:Empirical mode decomposition
EDM:Electrical discharge machining
VMD:Variational mode decomposition
SVM:Support vector machine
LSSVM:Least squares support vector machine
PCA:Principal component analysis
LDA:Linear discriminant analysis
KPCA:Kernel principal component analysis
JADE:Joint approximative diagonalization of eigenmatrix
KJADE:Kernel function joint approximate diagonalization of eigenmatrices
WOA:Whale optimization algorithm
GA-LSSVM:Genetic algorithm-least squares support vector machine
PSO-LSSVM:Particle swarm optimization-least squares support vector machine
WOA-LSSVM:Whale optimization algorithm-least squares support vector machine
VNWOA-LSSVM:Von Neumann topology whale optimization algorithm-least squares support vector machine

Data Availability

The data used to support the findings of this study are available from the specific operational procedures upon request. Specific operational procedures, the bearing data used to support the findings of this study, have been deposited in the “http://csegroups.case.edu/bearingdatacenter/pages/download-data-file.” Operating procedures: (1) Slide the mouse and find‘Normal Baseline Data’. (2) Slide the mouse and find ‘Normal_2’. (3) The page returns to the previous page. (4) Slide the mouse and find’48k Drive End Bearing Fault Data’. (5) Slide the mouse and find ‘IR007_2’. (6) In the same way, click the mouse to download: B007_2, OR007@6_2, IR014_2, B014_2, OR014@6_2, IR021_2, B021_2, and OR021@6_2. (7) End.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This article was supported by Shanghai Polytechnic University Graduate Program Fund [EGD18YJ0003].