Abstract

An increasing popularity of researches focuses on the vibration signal with the characteristics of nonstationary, nonlinear, and strong noise interference. A nonlinear dimension and feature reduction method called multiple empirical mode entropy decomposition-nonlocal orthogonal preserving embedding (MEMED-NLOPE) is proposed to implement condition monitoring in this paper. Different from multiple empirical mode decomposition (MEMD), MEMED adopts maximum entropy method, which can directly output the subsignal with the maximum correlation and realize nonlinear dimensionality reduction. Besides, multiscale feature extraction method is used during preprocessing nonlinear data process, which realizes feature reduction. Finally, nonlocal orthogonal preserving embedding algorithm-exponentially weighted moving average (NLOPE-EWMA) realizes the automatic detection of the fault. Taking the laboratory rolling bearing test and naval gun pendulum mechanism test as cases, the effectiveness of MEMED-NLOPE is verified.

1. Introduction

Mechanical components as the vital parts of mechanical equipment are prone to wear and cracks on the surface with long-term overload operation. Wear increases the mechanical components transmission error, generally resulting in increased vibration, noise, and dynamic loads [1]. If the early minor damage of components cannot be detected in time, once the fatigue deteriorates and the parts break, the mechanical equipment will be shut down. With the deterioration of the fault degree, the mechanical equipment may be shut down for a long time, resulting in catastrophic failures and unexpected economic losses [2]. Therefore, condition monitoring of mechanical components is an effective measure to avoid the continuous deterioration of parts after damage. Vibration signals are widely used to characterize the state of mechanical equipment because of their ease of acquisition, but usually the collected vibration signals have many interference components and have nonstationary and nonlinear characteristics, which also bring difficulties to fault diagnosis.

In recent years, the multivariate statistical process monitoring (MSPM) technology is often used to detect faults in industrial production processes, such as partial least squares (PLS) [3], principal component analysis (PCA) [4], and independent component analysis (ICA) [5]. Those traditional monitoring methods process the intermediate data by dimensionality reduction and extract a small number of components to construct the monitoring statistics that can reflect the characteristics of the original data. At this time, the performance of dimensionality reduction will affect the monitoring effect.

Different from the dimensionality reduction method that maintains the global data structure, manifold learning is used to maintain the characteristics of local data structure, such as locally linear embedding (LLE) [6], Laplacian eigenmap (LE) [7], local preserving projections (LPP) [8], and neighborhood preserving embedding (NPE) [9]. Both LPP and NPE belong to linear projection methods, but these methods may lose the key information contained in the global data structure because they only consider the neighborhood relationship to maintain the local characteristics. Therefore, in order to consider the global and local data structure characteristics, a method combining LPP and PCA method is proposed [10, 11]. The test results show that its monitoring performance is better than that of single method. Besides, orthogonal neighborhood preserving embedding (ONPE) is developed from NPE [12]; by setting additional orthogonal constraints on the projection vector, it not only maintains the characteristics of local structure but also avoids the distortion defects of NPE [13]. In order to fully consider the global and local structure characteristics of data, combined with the basic principles of PCA and ONPE algorithm, a nonlocal orthogonal preserving embedding (NLOPE) algorithm is proposed [14]. However, those methods still belong to linear method and have limitations in dealing with nonlinear data.

In the data preprocessing stage, empirical mode decomposition (EMD) is often used to describe the characteristics of nonlinear and nonstationary signals [15]. However, when processing multiple signals (multichannel signals), EMD may lead to different number and frequency scale of IMF for signal decomposition of each channel [16]. The proposal of multivariate empirical mode decomposition (MEMD) [17] ensures the matching of IMF components in quantity and scale. However, in the process of data preprocessing, the dimension of subsignals and features may increase, which will affect the effect of condition monitoring. To realize subsignals and nonlinear dimensionality reduction, entropy has been widely developed and used in this field, which can measure the correlation, uncertainty, and complexity of signals and features [18, 19].

In this paper, a linear dimension and feature reduction method called multiple empirical mode entropy decomposition-nonlocal orthogonal preserving embedding (MEMED-NLOPE) is proposed on the basis of MEMD and NLOPE. To reduce the redundancy of subsignal set and reduce the complexity of the system, MEMED takes both advantages of MEMD and maximum entropy method into account. To verify the effectiveness of MEMED-NLOPE, MEMED-NLOPE and MEMD-NLOPE are employed to detect the faults of naval gun pendulum mechanism, and MEMED-NLOPE is verified by the experimental data set of rolling bearing in laboratory.

The rest of the paper is organized as follows. MEMD, PCA, ONPE, and NLOPE are reviewed and analyzed in Section 2. The proposed MEMED-NLOPE is developed in Section 3. In Section 4, two cases are used to demonstrate the effectiveness of the proposed method. Finally, conclusions are drawn in Section 5.

2. Background Techniques

2.1. MEMD

EMD is suitable for one-dimensional real signals. For the processing of multichannel signals (multichannel signals), the EMD method often needs to solve the single channel signals, respectively, which may lead to the different number and frequency scale of IMF decomposed by each channel signal; that is, there is the problem of oscillation mode calibration of different channels, which is not conducive to the synchronous correlation analysis between multichannel signal channels. Although CMED [20], BEMD [21], and TEMD [22] are Multivariate Applications of EMD methods in multivariate data, they are limited to multivariate data: only binary and ternary signals. For real multivariate signals, it is still impossible to decompose the signal on the premise of correctly analyzing the physical meaning of the signal. The proposal of MEMD realizes the multichannel synchronous joint analysis of multichannel signal oscillation modes, obtains the common modes of different channels, ensures the matching of intrinsic mode function (IMF) components in quantity and scale, and solves the problem of mode calibration of multichannel signals.

The specific implementation of MEMD can be summarized as the following steps.

Let an -dimensional vector group sequence represent a -tuple signal, the length of the signal sequence is , and represents the direction vector set of the corresponding angle on the -dimensional sphere. If you want to establish direction vectors in spherical space, then .(1)The Hammersley sequence sampling method is used to obtain a suitable set of uniform sampling points on the -dimensional sphere, that is, the direction vector of the -dimensional space.(2)The mapping of the input signal on each direction vector is calculated.(3)Determine the instantaneous time corresponding to the extreme value of the mapping signal of all direction vectors, and represents the extreme point position, .(4)The extreme point is interpolated by multivariate spline interpolation function to obtain multivariate envelopes .(5)For direction vectors in spherical space, the mean of -tuple signal is as follows:(6)Extract the intrinsic mode function through . If meets the judgment standard of multivariate IMF, take the result as the input signal in step (2), continue the iterative calculation in steps (2)∼(6), and extract a new multivariate IMF component ; otherwise, take as the input signal of step (2) and continue the iteration of steps (2)∼(6).

After a series of MEMD decomposition processes, similar to the EMD algorithm, the original -tuple signal is decomposed into a series of addition forms of IMF and Residual , as follows:where represents the decomposed multivariate IMF function, is , is , corresponding to groups of IMF components and margins of -tuple signals, respectively. The number of IMF decomposed by each channel of multivariate signal is the same, and the frequencies of IMF in each layer are different. The first decomposed IMF frequency is high, and then the decomposed IMF frequency is low, and the decomposed residual frequency is the lowest. The IMF corresponding to each variable of -tuple signal is aligned according to the frequency scale in channels to form multiple IMF.

2.2. Principal Component Analysis

PCA, namely, principal component analysis, is one of the most widely used data dimensionality reduction algorithms. In this study, the PCA algorithm is implemented based on eigenvalue decomposition covariance matrix. The specific steps are as follows:Step 1: input data set , which needs to be reduced to dimension.Step 2: deaveraging (i.e., decentralization), that is, each feature subtracts its own average.Step 3: calculate the covariance matrix . Note: dividing or not dividing the number of samples or has no effect on the calculated eigenvector.Step 4: find the eigenvalue and eigenvector of covariance matrix by eigenvalue decomposition method.Step 5: sort the eigenvalues from large to small, and select the largest of them. Then, the corresponding x eigenvectors are used as row vectors to form the eigenvector matrix .Step 6: convert the data into a new space constructed by eigenvectors, i.e., .

2.3. Orthogonal Neighborhood Preserving Embedding

Given data set , as a kind of linear dimensionality reduction method, the goal of the orthogonal neighborhood preserving embedding (ONPE) algorithm is to reduce the dimension of high-dimensional data to low-dimensional data , that is, , using a transformation matrix , and the low-dimensional data can express the essential characteristics of the original high-dimensional data. The NPE algorithm is the basic form of ONPE. NPE maintains the local characteristics in the data structure by constructing the neighborhood graph between adjacent samples. Therefore, each sample can be expressed as a linear combination of adjacent samples and their corresponding weight coefficients. The weight coefficient matrix minimizes the following objective functions:

In order to fully maintain the local characteristics of the data structure, the high-dimensional spatial data are mapped to the low-dimensional feature space to obtain , and the weight coefficients between and its nearest neighbors will be projected to the low-dimensional feature space to be saved to characterize the connection relationship between and its nearest neighbors. The low-dimensional mapping of high-dimensional data can calculate the following loss functions:where , , is the number of nearest neighbors in the neighborhood of . If is not the nearest neighbor of , there is .

ONPE adds an orthogonal constraint on the basis of NPE, that is, mapping high-dimensional data to low-dimensional feature space through an orthogonal projection matrix . According to (3) and (4), the projection matrix is calculated by the following formulas:where , . Through iterative calculation by Lagrange operator, the expression of orthogonal matrix is as follows:(a) is the eigenvector corresponding to the minimum eigenvalue of matrix ;(b) is the eigenvector corresponding to the minimum eigenvalue of matrix , where iswhere .

2.4. Objective Function of Nonlocal Orthogonal Preserving Embedding

In order to fully consider the global and local structure characteristics of data, combined with the basic principles of PCA and ONPE algorithm, a nonlocal orthogonal preserving embedding (NLOPE) algorithm is proposed. Assuming data set , the objective function of NLOPE is as follows:s.t. ,where , .

Using Lagrange operator, projection matrix can be obtained by calculating the following feature decomposition problem:(a) is the eigenvector corresponding to the minimum eigenvalue of matrix ;(b) is the eigenvector corresponding to the minimum eigenvalue of matrix , where iswhere , is the dimensions of data in NLOPE feature space, + , , and .

For the new sample , the mapping in the low-dimensional NLOPE feature space iswhere .

The detailed derivation and calculation of projection matrix are shown in [12]; there is

Thus, is the eigenvector corresponding to the minimum eigenvalue of matrix , and the expression of is as follows:

2.5. Calculation of Parameters

In the construction of the NLOPE model, parameter makes the global data structure characteristics and local data structure characteristics occupy different components in the above model. The selection of parameter affects the extraction of potential features in the data and then affects the effect of mechanical equipment fault detection, fault detection, and degradation performance evaluation.

It can be seen from (12) that the objective function of NLOPE is composed of two subobjective functions. Therefore, the objective function optimization problem of the NLOPE model is essentially a double objective optimization problem. Usually, it is difficult to obtain the optimal solution of the two subobjective functions at the same time. However, by balancing the two subobjective functions, a relatively better solution can be obtained.

By balancing the global data structure characteristics and local data structure characteristics of the model, the calculation of parameter is as follows:where and represent the energy changes of and , respectively.

According to (7), parameter is used to balance matrix and matrix in the NLOPE model, which can be regarded as the energy change of balance and . Based on the principle of the PCA method, the eigenvectors corresponding to the first few large eigenvalues can characterize the distribution of matrix energy. Therefore, the maximum eigenvalues of matrix and matrix can be used to estimate the energy change.

In the NLOPE model, parameter is calculated as follows:where is the spectral radius of the matrix, and matrix and matrix are defined in (7).

2.6. Detection Index

Hotelling’s and statistics are often used as indicators of industrial process fault detection to judge whether the production process is abnormal. Hotelling’s is used to measure the change of sample variables in the potential variable space, and is mainly used to measure the change of sample variables in the residual space. When the statistics or exceed their respective control limits, it indicates that the process may be abnormal. and are calculated as follows:where is the low-dimensional feature sample of sample projected in NLOPE feature space, and is the covariance matrix of the projection vector of training sample in NLOPE feature space.

Among them, .

In order to detect the early faults of mechanical equipment more accurately and reliably, the exponential weighted moving average (EWMA) is proposed by integrating and statistics. The statistic is a linear combination of and statistics, includingwhere and are the control limits of statistics and , respectively, which can be calculated by kernel density estimation (KDE). and normalize and to (0, 1), respectively.

The detection index is calculated as follows:where represents the detection index, which is composed of the current index quantity and the historical index quantity, and is the smoothing coefficient between (0, 1). When takes a larger value, the current detection quantity has a larger proportion in the detection quantity than the historical detection quantity . The control limit of the detection amount is also calculated by the kernel density estimation method. In this chapter, the smoothing coefficient takes the empirical value of 0.2.

3. Proposed Condition Monitoring Model

Based on the analysis of the above background techniques, the MEMED-NLOPE model is proposed. Firstly, the architecture of MEMED-NLOPE is proposed; secondly, the preprocessing stage of nonlinear data is described in detail; finally, the steps of the automatic fault detection model based on NLOPE-EWMA is described.

3.1. Proposed Architecture

The MEMED-NLOPE model is mainly divided into two parts. The first part is nonlinear data preprocessing, and the second part is automatic fault detection model.

For the first part, MEMED decomposes the signals collected by each sensor, quantitatively analyzes the correlation and orthogonality between the multiscale subsignal and the original signal, selects the subsignal with the maximum correlation for the preliminary extraction of multi domain features, and uses the feature measurement criterion based on mutual information to optimize and eliminate redundant features, as the input of the fault detection model.

For the second part, it proposes condition monitoring model, adopts PCA which extracts the correlation between multidimensional variables from the historical normal operation data, and diagnoses abnormalities through their unexpected changes, but PCA only considers the global structure relationship between samples and ignores the local structure relationship. Therefore, based on the PCA multivariate statistical process monitoring method, combined with the local orthogonal preserving embedding (ONPE) algorithm, this project proposes NLOPE, which uses exponential weighted moving average (EWMA) statistics as detection index to realize the construction of the condition monitoring model. The research scheme is shown in Figure 1.

3.2. Preprocessing Nonlinear Data
3.2.1. MEMED

(1) Maximum Entropy Method. Based on the information entropy theory, the mutual information between different subsignals and source signals is measured to characterize the correlation of subsignals, reduce the redundancy of subsignal set, and reduce the complexity of the system. The formula of information entropy is as follows:where represents the amount of information of :

is the probability of occurrence of . The number of information ontologies contained in a randomly generated event is only related to the probability of occurrence of the event. The lower the probability of an event, the larger the information ontology contained in the received information when the event really occurs. The meaning is that the event with probability 0 has a large amount of information; on the contrary, it has a small amount of information. The reason for taking logarithm is to make the product sum. Two independent events : and .

Information entropy is the mathematical expectation of information.

Mutual information is

Average mutual information is the mathematical expectation of mutual information:

From the formula,

(2) MEMED Flow Chart. Based on MEMD, MEMED adopts the maximum entropy method to output the subsignal with the maximum correlation, which realizes the function of dimension reduction and avoid data explosion. The specific flow of MEMD is shown in Figure 2.

3.2.2. Multiscale Feature Extraction Method

(1) Fault Feature Construction Method. The vibration signal is used to evaluate the running state of mechanical equipment. Generally, the corresponding features are extracted from the time domain, frequency domain, and time-frequency domain of the signal as the basis of diagnosis. Time-domain analysis is to describe the change of signal waveform and amplitude with time. Frequency domain analysis is to describe the change of signal power or energy with frequency. Time-frequency analysis is to study the change of signal spectrum with time and represent the distribution of signal strength or energy in both time and frequency dimensions.

Time-domain and frequency-domain features generally include root mean square, kurtosis, skewness, peak factor, spectral mean square deviation, and envelope spectral variance. Table 1 contains 11 times domain characteristic parameters () and 13 frequency domain characteristic parameters (). In each characteristic expression, is the time-domain signal sequence, , are the number of samples, is the spectrum of signal , , are the number of spectral lines, and is the frequency value of the spectral line. The time-domain characteristic parameters and describe the amplitude and energy changes of the time-domain signal; and describe the time series distribution of time-domain signals. The frequency domain characteristic parameter describes the change of frequency domain energy; , and reflect the concentration and dispersion of the spectrum; , reflects the change of the position of the main frequency band.

The time-frequency domain features include sample entropy, permutation entropy, wavelet energy entropy, and EEMD information entropy, which are generally calculated by time-frequency analysis methods such as wavelet analysis and empirical mode decomposition.

(2) Fault Feature Selection Method. Similarly, based on the information entropy theory, the mutual information between different features is measured to characterize the correlation between features and reduce the redundancy of feature sets.

3.3. Automatic Fault Detection Model Based on NLOPE-EWMA

The offline modeling steps based on NLOPE are as follows:Step 1: the features of training samples after MEMD adaptive decomposition, multiscale subsignal selection, and multi-scale feature extraction are constructed as feature samples, and the feature samples are standardizedStep 2: calculate the projection coefficient matrix from (9)Step 3: calculate the sum SPE statistics of all training samples, and calculate the sum of control limits, so as to calculate the detection index EWMA and its control limits

The steps of online detection based on NLOPE are as follows:Step 1: after multiscale feature extraction, the feature samples of each test sample are constructed, and the feature test samples are standardized by using the mean and variance of the training feature samplesStep 2: calculate the projection of the test sample in the low-dimensional feature space from (10)Step 3: calculate the detection amount EWMA corresponding to the test sample and judge whether it exceeds the monitoring limit

The fault detection process based on the NLOPE method is shown in Figure 3. Using the NLOPE method, the offline detection model is constructed by using normal vibration signals, the new data samples are input into the detection model, and the fault detection of mechanical equipment can be carried out by calculating the detection indexes.

4. Experimental Verification and Analysis

To verify the effectiveness of MEMED-NLOPE, the laboratory rolling bearing test and naval gun pendulum mechanism test are taken as cases. Besides, the programming software used in the experiment is MathWorks Matlab R2018a, and the computer configuration is Core i7-10875H CPU @ 2.30 GHz.

4.1. Experimental Data Set of Naval Gun Pendulum Mechanism
4.1.1. Experimental Design

The life test of typical mechanical parts of naval gun is carried out by using the test bench of energy storage mechanism of single 130 mm naval gun pendulum. In the test, the data of the health and fracture damage of the pressing plate and the health and crack damage of the roller are collected. The damage of mechanical parts is shown in Figures 4 and 5. In the test, six vibration acceleration sensors, numbered a1–a6, and acoustic sensors are arranged near the sliding plate and pressing plate mechanism of the pendulum. The location of the sensor measuring points is shown in Figure 6. Two vibration acceleration sensors (No. a7-a8) are arranged near the roller track, and the measuring point positions of the sensors are shown in Figures 7 and 8.

The test data collected are composed of the following:(1)Platen Data. The composition of platen data collected in the test is shown in Table 2, and the sampling frequency is 10 kHz. The number of times of one test in the table indicates that the artillery test bench has completed a complete action cycle of latch closing, recoil, reentry, latch opening, lower swing, and upper swing.(2)Roller Data. The roller test adopts rollers in two states, and the test data composition is shown in Table 3.

4.1.2. Condition Monitoring and Analysis of Pendulum Mechanism

MEMD-NLOPE and MEMED-NLOPE are used to monitor the condition of platen in different status, and the results are shown in Figure 9.

It can be seen from Figure 9 that under the condition monitoring model constructed by MEMD-NLOPE, when the monitoring object is the healthy platen and roller, some of the detection indicators of the test sample exceed the monitoring limit; the part exceeding the monitoring limit indicates that the detection model generates false alarm. While when the monitoring object is the platen and roller tending to be damaged, some of the detection indicators of the test sample are below the monitoring limit. Under the condition monitoring model constructed by MEMED-NLOPE, most of the detection indicators of the test samples are within the normal monitoring range, and the collected training samples are consistent with the state of the naval gun platform.

4.1.3. Summary

Through the action cycle test of mechanical mechanism on the naval gun test bench, the data information of key mechanical parts in the damaged state is obtained. Using the proposed detection model, the normal operation state and abnormal operation state of the naval gun test bench are detected and analyzed. The performance evaluation results based on MEMD-NLOPE and MEMED-NLOPE are shown in Table 4, respectively.

According to the results in Table 4, the performance of condition monitoring based on MEMED-NLOPE is better than that based on MEMD-NLOPE, and the average accuracy of normal and damage detection of platen and roller is greater than 90%, indicating that MEMED-NLOPE can determine the normal operation state of pendulum mechanism and detect the faults of mechanical components. However, when the platen is in a healthy state, the false alarm rate is little high, which reflects that MEMD-NLOPE has certain instability in this test.

4.2. Experimental Data Set of Rolling Bearing in Laboratory
4.2.1. Experimental Design

The mechanical failure test bench used in the laboratory is purchased from Anhui Chaokun Testing Equipment Co., Ltd. The test of bearing is shown in Figure 10.

The data used in the study are shown in Table 5.

4.2.2. Condition Monitoring and Analysis

MEMED-NLOPE is used to monitor the condition of bearing in different status, and the results are shown in Figure 11; the corresponding data of different test groups are as follows:(a)Zc_Data1 (Taining data), Zc_Data2 (First group of test data), and Zc_Data3 (Second group of test data)(b)Zc_Data1 (Training data), Zc_Data4 (First group of test data), and Zc_Data5 (Second group of test data)(c)Zc_Data1 (Training data), Zc_Data6 (First group of test data), and Zc_Data7 (Second group of test data)(d)Zc_Data1 (Training data), Zc_Data8 (First group of test data), and Zc_Data9 (Second group of test data)(e)Zc_Data1 (Training data), Zc_Data10 (First group of test data), and Zc_Data11 (Second group of test data)

4.2.3. Summary

Through the rolling bearing test on the mechanical fault test-bed in the laboratory, the data information of the bearing under different states is obtained, and the normal operation state and abnormal operation state of the bearing are detected and analyzed by using the proposed detection model. The performance evaluation results of MEMED-NLOPE are shown in Table 6.

It can be seen that MEMED-NLOPE can detect the bearing in different states, and its performance is verified.

5. Conclusions

In this paper, a linear dimension and feature reduction method called multiple empirical mode entropy decomposition-nonlocal orthogonal preserving embedding is proposed. In order to reduce the dimension of multivariate signals and consider the correlation between sub signals and source signals, MEMED adopts the maximum entropy method to directly output the subsignal with the maximum correlation. Then, the multiscale feature extraction method reduces the redundancy of feature set by describing the correlation between features. Finally, the automatic fault detection model based on NLOPE-EWMA is proposed to realize condition monitoring. Based on the results of two cases, the performance of condition monitoring based on MEMED-NLOPE is verified, in which the average accuracy of normal and damage detection is higher in comparison with MEMD-NLOPE. For the future work, the massive amounts of data from multiple sensors could be considered for naval gun in health condition monitoring and fault diagnostics.

Data Availability

The experimental data set of naval gun pendulum mechanism data and rolling bearing in laboratory used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by National Nature Science Foundation of China (nos. 61640308 and 61573364) and Nature Science Foundation of Naval University of Engineering (no. 20161579).