Abstract

In order to diagnose bearing faults under different operating state and limited sample condition, a fault diagnosis method based on adjusted spectrum image of vibration signal is proposed in this paper. Firstly, the Davies–Bouldin index (DBI) is employed to select a proper capture focus (CF) and image size, and the spectrum of vibration signal is computed via fast Fourier transformation (FFT) and adjusted according to the average rotating speed. Then, the spectrum is plotted and captured as a two-dimensional (2D) image with the optimized CF and image size. Two-dimensional principal component analysis (2DPCA) is used to reduce the dimension of images, and finally a nearest neighbour method is applied to classify the faults of bearings. Two experiments are carried out to validate the effectiveness of the proposed method. Besides, a further investigation on the effect of spectrum frequency resolution is conducted and a recommended selection method of frequency resolution is given based on the experimental performances. In our method, the training samples could be from only one operating condition, while the testing samples are from all possible operation conditions. All experiment results have demonstrated that the proposed method could achieve high classification accuracy even with very limited training samples.

1. Introduction

As the key element of rotating machinery, the faults of rolling bearing could lead to mechanical breakdown and great economic loss. In general, the operating speed of rolling bearing is slightly fluctuant due to the influences from the load, controller, and other components; therefore, its vibration signals are commonly considered approximately stationary. However, rolling bearing may operate in different speeds, which results in great challenges for its accurate fault diagnosis, especially when the number of fault samples is small. Most fault diagnosis methods are based on vibration signals [13], and the diagnostic procedure mainly includes two steps: (1) extracting features from the vibration signal; and (2) classifying features with a classifier. In last decades, many methods were proposed to improve the diagnosis accuracy in two different ways. One is to find proper features in order to represent the characteristics of faulty vibration signals; and the other one is to find proper classifiers with strong classification capability.

The features could be extracted in time domain, frequency domain, or time-frequency domain [4], such as peak amplitude, skewness, kurtosis, fractal dimension, Fourier spectrum, cepstrum, and envelope spectrum. Images were also utilized as features for machine fault diagnosis in recent years. For example, Li et al. [5] proposed a method of feature extraction with spectrum image for bearing fault diagnosis. Amar et al. [6] proposed to use vibration spectrum imaging feature for denoising and fault classification. Klein et al. [7] utilized time-frequency image of vibration signal to detect fault signatures with an object detection method. Griffaton et al. [8] realized bearing damage detection of aircraft engines through enhanced visual analysis with the help of image processing method. This feature expression of image for one-dimensional (1D) vector seems to be a very promising way to dig the intrinsic information contained in the vibration signal.

Once the fault features are extracted, the last step is to perform the fault classification to recognize the corresponding health state. Fault classification can be realized with minimum distance classifier, k-nearest neighbour (kNN) classifier, artificial neural network (ANN), support vector machine (SVM), and so on [4]. With proper features, those classifiers may achieve acceptable accuracy of diagnosis. When noise is heavy or the number of fault samples is small, the accuracy may decrease significantly. Recently, deep neural networks (DNNs) were applied for fault diagnosis of rotating machine [911]. Such DNNs can extract useful features from some raw information and achieve high classification accuracy. Nevertheless, DNNs usually require a high number of training samples and high computation efforts. When the operating conditions are different, the training samples under all operating conditions may be required, and the training effort could be extremely high. Therefore, a robust method of feature extraction or pattern classification needs to be further investigated to improve the performance on fault diagnosis.

Focusing on extracting a more effective feature, we proposed to conduct bearing fault diagnosis with adjusted vibration spectrum images. This work is an extension of the spectrum images proposed in our previous publication [5], where spectrum images are first proposed as features for bearing fault diagnosis and preferable diagnostic effects are reached especially in the case of acquiring samples under the same working condition. However, it exhibits a poor performance when the training and testing samples are derived from different operating conditions (see the results marked with in [5]) due to the operating-condition difference and the nonoptimized parameters such as frequency resolution, image size, and capture focus of the spectrum images. Driven by the desire to extract a more robust feature for different operating conditions, an improved fault diagnosis method with adjusted spectrum images is proposed in this paper. We consider a more difficult situation: (1) the operating condition could be different, i.e., the rotating speed and/or load is different; (2) the number of training samples is very limited, and the training samples could be from only one operating condition. Firstly, a validity metric is introduced to optimally select the parameters for generating images, and the spectrum of vibration signal is computed with the help of the fast Fourier transformation (FFT). Secondly, the frequency lines of the spectrum are adjusted according to the rotating speed frequency to reduce its influence on the spectrum image. Thirdly, such adjusted spectrum is captured as a two-dimensional (2D) image with the optimized parameters and then processed through two-dimensional principal component analysis (2DPCA) to reduce its dimension. Finally, fault classification is achieved with a simple nearest neighbour classifier (NNC). Experimental examples are applied to demonstrate the effectiveness of the proposed method. And it is shown that classification accuracy using such features is higher than that obtained by directly using the FFT spectrum.

The main contributions of this paper can be concluded as follows: (1) the influence of operating conditions (especially for the rotating speed) on vibration spectrum is reduced through rotating-speed-adjusting manipulation; (2) the adjusted vibration spectrum with optimized parameters is captured as image feature to perform bearing fault diagnosis; (3) higher classification accuracy can be achieved by serving such adjusted spectrum images as features even in the case where simple classifier and very limited samples are employed especially under different operating conditions.

The paper is organized in the following way. Section 2 presents the generation of adjusted spectrum images, and the parameter determination for generating images is depicted in Section 3. Section 4 is dedicated to illustrating 2DPCA and classification measure, including feature processing with 2DPCA and fault classification based on NNC. Section 5 provides the experimental analysis and discussion is given in Section 6. Finally, Section 7 presents the conclusions.

2. Generation of Adjusted Spectrum Images

The spectrum of vibration signal can be computed through FFT. Such a spectrum contains lots of useful information of bearing, and the vector of spectrum coefficients is widely used as features for fault diagnosis [4, 9]. As we know, the bearing characteristic frequencies are associated with the rotating speed. As a result, the frequency lines corresponding to those characteristic frequencies and their harmonics could be dominant components of the spectrum. When the rotating speed is different, those dominant frequency components are also changing with the speed. In this case, bearing diagnosis becomes difficult. Hence, we propose to use the adjusted spectrum image as feature for fault diagnosis instead of the spectrum coefficient vector.

Firstly, the spectrum of vibration signal is computed via FFT. We plot the spectrum as a diagram, where the x-axis is the frequency and the y-axis is the amplitude. As discussed before when the speed of bearing is changing, the structure of the diagram is also changing. It is reasonable to adjust all frequency lines of the spectrum according to the rotating speed such that the change of diagram structure can be reduced, i.e.,where is the average rotating speed in Hz, is the frequency line of the original spectrum, and is the frequency line of the adjusted spectrum.

After adjusting, the spectrum diagram can be captured as an image with a given size. Two parameters of the image are selectable, i.e., the coordinate range and image size when storing. The coordinate range includes the range of the x-axis () and the range of the y-axis (). determines the frequency range shown in the image, and determines the amplitude range shown in the image. We call and together as the capture focus (CF). Later, we will show the selection method of CF and image size.

The spectrum is actually a 1D vector of FFT coefficients, while the spectrum image is a 2D representation of the 1D vector. Such images can keep the geometric information of frequency lines and provide richer information for fault diagnosis than the 1D vector of spectrum. By adjusting the spectrum according to the rotation speed, the obtained images are robust against speed variations. In order to demonstrate the effectiveness of the proposed adjusted spectrum images in suppressing the interference of rotating speed and reducing the changes of diagram structure, a comparison is conducted between before and after spectrum adjusting using vibration signals collected from the drive end bearing with inner-race fault size being 0.021 inches under two speed conditions [12]. The analyzed signals both have a length of 1024 data points in time domain. Moreover, they are both transformed into frequency domain through N-point discrete Fourier transform (DFT), where N is derived as the next power of two from the length of the time series, and as well. For simplicity, the coordinate system are not hid to present the role of the spectrum image adjusting, and the CF of the original and adjusted spectrum are set as and , respectively.

To be clear, the spectra’s places may influence diagnostic performance when the spectrum plot, in fact a 2D matrix, is captured as feature for fault diagnosis. Hence, we construct an index to evaluate the performance of adjusted spectrum as follows:where stands for calculating absolute value, and denote the area between the Fourier transform curve and the horizontal axis (frequency axis) under rotating speed frequency and , respectively, and is the spectrum scaling factor for horizontal axis, i.e., for the original spectrum and for the adjusted one here. Obviously, a smaller value of H indicates a better similarity between two curves from the view of image.

As shown in Figure 1(a), the frequency spectrum may mismatch due to the variation of rotating speed. However, the adjusted frequency is employed to plot the spectrum in Figure 1(b). In this process, the spectrum line under the same order of different rotating frequencies is placed in the same position of the adjusted spectrum plot. Obviously, a smaller H represents a better case for fault diagnosis, for the original spectrum, , and for the adjusted spectrum, . It is clear that the adjusted images could decouple the rotating speed for the similar health condition of machines. This adjusted spectrum obtained as image shows its stronger robustness to speed variations thanks to the adjusting manipulation. It will be demonstrated by the subsequent experiments.

3. Determination of CF and Image Size

The main concept of our approach is to catch the structural information of spectrum to perform bearing fault diagnosis. A vibration spectrum image with different sizes may contain different information of the spectrum structure. Generally, the larger a spectrum image is, the more abundant information it carries within a certain range. However, a larger image could consume more storage space and increase the influence of noise. Moreover, different CFs can mostly capture different levels of information in the adjusted spectrum images, and a CF specifies the window for capturing images to define the feature contents. Therefore, it is crucial to determine these two parameters when applying to bearing fault diagnosis.

In this research, a validity indicator, Davies–Bouldin index (DBI) proposed in [13], is employed to evaluate the image size and the CF for capturing adjusted spectrum images. The DBI is defined as the ratio of within-class scatter to between-class scatter and its calculating process can be concluded as follows:

Firstly, the within-class scatter for each cluster (i.e., the fault category in this research) is calculated aswhere is the centroid of the ith cluster, is the number of data samples in ith cluster, represents the data sample in cluster, and p is generally set as 2 in real applications.

Secondly, the between-class scatter for two different clusters is computed aswhere and are the column vector of and , respectively.

Finally, the so-called DBI is denoted aswhere is the number of clusters, and .

From the calculation process of the DBI, it is not hard to find that a image size or CF for capturing adjusted spectrum images with a smaller DBI could be beneficial to achieving better classification.

4. Feature Processing and Classification

Since the size of adjusted spectrum images is large, the computation effort could be considerably high if we carry out diagnosis by directly using those images. Hence, the adjusted spectrum images are firstly processed by 2DPCA in order to reduce their dimension and then classified with the NNC.

4.1. Feature Processing with 2DPCA

Two-dimensional principal component analysis (2DPCA) is a dimension reduction method for image representation. In 2DPCA, the image covariance matrix is constructed directly using the original image matrices, and its eigenvectors are derived for image feature extraction. The principle of 2DPCA can be found in [14].

As illustrated in Figure 2, M adjusted spectrum images are firstly obtained with the size of pixels. Then, the global image scatter matrix G is evaluated according towhere is the image, and is the averaged image of all the M images, i.e., .

Secondly, the eigenvalue and the corresponding eigenvector of G are calculated. Then is normalized and sorted in descending order by . That is, , and the corresponding eigenvalue is , where

The first d largest eigenvalues are selected according to Equation (8):where γ is called eigenvalue contribution and . Then, the projection basis is constructed as

Finally, all the M samples are projected into the so-called eigenspace, and the corresponding eigenimages are obtained. Given an adjusted spectrum image B, it can be projected according towhere . Then the eigenimage of B is obtained as .

4.2. Classification Based on NNC

The projected eigenimage with reduced dimension will be obtained after an adjusted image is processed with 2DPCA. Then the following NNC [15] is applied for fault classification.

Suppose that there are M training images being categorized into classes and the classification labels are defined as

Denote E as the eigenimage of a testing image F, and is the eigenimage of the ith training image. The distance function of F is defined aswhere denotes the Euclidean distance between and , and represents the rth vector of matrix E, and represents the rth vector of matrix .

Then the decision-making process is carried out according towhere means the training image. If the training image belongs to the class , then the classification label of the testing image F is determined as . In fact, the testing image is classified according to the nearest training image, where the Euclidean distance is used.

Other classifiers, such as ANN and SVM, can also be applied to classify the dimension-reduced features, but the model parameters are hard to optimize with limited training samples during the training process. However, the NNC is without training procedure for model-parameter determination during fault classification. Hence, selecting NNC as the classifier for fault diagnosis in this work mainly takes account of the computational efficiency and the limited sample condition.

Therefore, the main procedure of the proposed fault diagnosis scheme can be described as follows:(i)The vibration signals are first converted into adjusted spectrum images via FFT according to Sections 2 and 3(ii)Then the training images are used to calculate the projection basis with 2DPCA according to Equations (6), (8), and (9), and all images are processed with the derived projection basis to obtain the eigenimages according to Equation (10)(iii)Finally, the testing images are classified through NNC according to Equations (12) and (13)

5. Experimental Analysis

The proposed adjusted spectrum image can be widely used in diagnosis of bearing, including fault type classification and fault severity detection. Two experiments are carried out to demonstrate the effectiveness of the proposed method. In the first one, different types of bearing faults are classified. In the second one, single-fault and multifault severity diagnosis of bearings are conducted, where the rotating speed and the load are different. The results of the second experiment are also comprehensively compared with that in previous publications.

5.1. Experiment I: Fault Type Classification of Bearings
5.1.1. Experimental Setup

The first experiment is performed on a machinery fault simulator manufactured by Spectra Quest, Inc. shown in Figure 3. The simulator is a simplified version of complex rotating machines, which consists of motor, bearing, shaft, rotor disc, gearbox, and belt. In this experiment, a normal bearing was installed in the inboard bearing seat and the testing bearings were mounted on the outboard bearing seat in turn, which simulate following health conditions: (i) normal (NO), (ii) inner-race fault (IF), (iii) outer-race fault (OF), and (iv) ball fault (BF). Figure 4 presents the bearing fault conditions. All the bearings are the same type ER-12k with 8 rolling elements. Vibration signals were collected under three operating conditions, i.e., 600 rpm, 900 rpm, and 1200 rpm. And the sampling frequency was set to 20 kHz.

The vibration signals are organized as three datasets according to the operating conditions, i.e., A-600 rpm, B-900 rpm, and C-1200 rpm as described in Table 1. Every dataset contains 300 vibration signal samples from each health condition. Hence, every dataset consists of 1200 samples. The length of each sample is 1024 points in time domain, and each time series is converted into frequency domain via N-point DFT (N is derived as the next power of two from the length of the time series, so as well) to generate its adjusted spectrum image. The number of classification labels is four, where each label is corresponding to a health condition of bearings.

5.1.2. Parameters Determination

The CF selection is first considered with the DBI evaluation approach. In this study, the in CF is manually designated to guarantee that the derived images could contain all the adjusted frequency components, i.e., for dataset A, for dataset B, and for dataset C according to the sampling rate ( Hz). Therefore, the of dataset A is set as , the of dataset B is set as , and the of dataset C is set as .

In order to optimize a proper in CF to generate the final adjusted spectrum images, data samples from dataset A are utilized to calculate the DBI for four bearing health conditions. Here, the image size with pixels is preassigned empirically according to our experimental experiences. In view of the enormous computing effort for DBI evaluations, a scope of from 0.0005 to 0.2 (this range is determined with experience) covering 12 CFs are considered to select an appropriate CF with the assistance of DBI evaluations, i.e.,and different training sizes of each health state () are employed for the DBI assessment. The DBI values for different CFs are presented in Figure 5. It can be found that the DBI reaches a minimum at the 9th , i.e., . In this experiment, the of CF is set to for datasets A, B, and C.

As what considered above, a scope of image size from to pixels are taken into account with experimental experience and 12 image sizes are applied to test on dataset A with the derived CF to select an appropriate one, i.e., pixels, where

Also considering different kinds of image samples, the DBI values are depicted in Figure 6. It can be shown that an optimized image size is achieved at the 6th testing size, i.e., pixels.

Thereafter, the spectrum is obtained for each sample through FFT and adjusted according to Equation (1). The CF of dataset A is set as , the CF of dataset B is set as , and the CF of dataset C is set as . Finally, all the adjusted spectrum images are captured with the size being pixels. Figures 79 show the adjusted spectrum images of dataset A, B, and C, respectively.

5.1.3. Experimental Results

After obtaining the adjusted spectrum images using the optimally selected parameters, feature extraction is carried out with 2DPCA to derive the eigenimages of the adjusted spectrum images. Herein, the dataset A is employed to demonstrate the eigenimages in the phase of feature processing, where the training size of each class is set as 20 and the reduced dimension in Equation (8) is set as manually. Figure 10 shows an example of the eigenimages of dataset A.

With defined in Equation (8) for dimension reduction, the diagnosis performance using the proposed method is tested and shown in Table 2, where different number of samples are used for training the NNC and the average accuracies are presented for 20 randomized trials. Obviously, the classification accuracy reaches 100% with 20 images per class for training, which means that all the health conditions can be correctly classified in these cases. Even when only one sample is used for training, high classification accuracies are still achieved.

5.2. Experiment II: Fault Severity Detection of Bearings
5.2.1. Data Description

In order to further verify the effectiveness of the proposed method, vibration data collected from the bearing data centre of Case Western Reserve University (CWRU) [12] are also applied. The bearing vibration signals under four health conditions are used for evaluation, i.e., NO, IF, BF, and OF, which are the same with experiment I. The sampling rate is 12 kHz. IF, BF, and OF of bearings are with four different fault severities, i.e., 0, 0.007, 0.014, and 0.021 inches. Four load conditions with different rotating speeds are considered, i.e., Load 0 = 0 hp/1797 rpm, Load 1 = 1 hp/1772 rpm, Load 2 = 2 hp/1750 rpm, and Load 3 = 3 hp/1730 rpm, which indicate that the operating condition of this experiment is different in speed and load. The collected vibration signals are organized as dataset D, E, F, and G as shown in Tables 3 and 4.

Dataset D, E, and F are vibration signal samples with IF, BF, and OF, respectively, and they are used for single-fault severity detection. Each dataset consists of four subsets, and each subset is corresponding to a load condition. For instance, D1 is the subset with IF, and its training samples are from Load 0 and its testing samples are from all the four load conditions. Subset D1 contains 50 samples of each fault severity, and totally there are 200 samples in D1. The number of classification labels of dataset D, E, and F is four, where each label is corresponding to a fault severity.

Dataset G includes all fault types, and it is used for multifault severity detection. It consists of five subsets. G1–G4 are the subsets whose training and testing samples are not exactly from the same load condition. For instance, the training samples of G1 are from Load 0, and the testing samples are from all the four load conditions. Subset G1 contains 50 samples of each fault severity and type. There are 10 different combinations of fault severities and fault types, and therefore there are totally 500 samples in G1. Subset G5 contains 200 samples under each fault severity and type, whose training and testing samples are from all the four load conditions, so a total of 2000 data samples are comprised in G5. Ten classification labels are designated to mark each bearing health condition for this dataset.

5.2.2. Parameters Determination

Taking the above-mentioned tests into consideration, we chose the dataset G5 to evaluate the optimized image size and CF for generating the adjusted spectrum images. As for the CF selection, is manually set as to cover the four operating conditions (i.e., 1797, 1772, 1750, and 1730 rpm) as considered in previous experiment. Similarly, the image size is preset as pixels, then 12 CFs () and four kinds of training size for each class () are considered to calculate the DBI values, i.e.,then the DBI values for different CFs are shown in Figure 11. It can be found that the DBI reaches a minimum at the 10th , i.e., .

With the optimized CF for capturing the adjusted spectra, 12 image sizes are utilized to derive an appropriate one with the similar considerations before, i.e., pixels, where

As presented in Figure 12, it can be shown that a minimum DBI is reached at the 3rd testing size, i.e., pixels.

Thus, the parameters for generating images are optimally obtained by DBI evaluations. In this experiment, each signal segment has a length of 1024 data points in time domain and is also converted to frequency domain using N-point DFT to obtain the corresponding adjusted spectrum image, where N is computed in the same manner as before (in Section 5.1). The adjusted spectrum images are obtained with the size being pixels. Figure 13 gives the adjusted image examples of dataset D1 with the optimized CF ().

5.2.3. Experimental Results

Likewise, we first present an example of the eigenimages using the adjusted spectrum images of dataset D1 shown in Table 3. In this example, 10 and 50 are set for the training size of each class and the reduced dimension, i.e., d in Equation (8), respectively. The example of the eigenimages of dataset D1 is demonstrated in Figure 14.

With defined in Equation (8) for dimension reduction, datasets D1–D4 are applied for IF severity detection and the results are shown in Table 5; datasets E1–E4 are used to carry out BF severity detection and the results are shown in Table 6; datasets F1–F4 are employed to conduct OF severity detection and the results are shown in Table 7; and datasets G1–G5 are utilized to evaluate the performance of multifault severity detection and the results are shown in Table 8. Every mentioned test routine is performed for 20 times to obtain the average classification accuracy, where training samples are randomly selected.

As demonstrated in Tables 58, acceptable classification accuracies could be achieved with the presented method, especially in case of IF and OF severity detection. From Tables 5 and 7, we can see that the classification accuracies achieve 100% with only 5 training samples per class. As illustrated in Table 6, the classification accuracy is still over 90% in most tests, but around 75% in individual cases. In the experiments with dataset G1-G5, the proposed method also shows its robust capability in handling multiple classification problems.

5.3. Effect of Different Spectrum Resolution for Fault Classification

As presented above, the proposed method exhibits a desirable performance in bearing fault diagnosis. However, there is still one thing to be noted that the spectrum resolution may influence its robustness for applications. Therefore, apart from the selections of CF and image size, it is worthwhile to investigate what frequency resolution is required to generate the original spectrum by specifying an appropriate sampling length as well as an appropriate sampling frequency.

In order to explore the effect of different spectrum resolution for fault classification, the datasets used for determining the parameters of CF and image size, i.e., the dataset A of experiment I and the dataset G5 of experiment II are also employed to make a further investigation. In this research, considered the computational efficiency of FFT algorithm with Matlab software, different signal lengths of power of two, i.e., (m is a positive integer), are applied to calculate the original spectrum via N-point DFT algorithm (here ). Thus, the frequency resolution of the original spectrum can be expressed as . Herein, the value of is considered in increasing order.

For dataset A of experiment I (sampling rate:  Hz), the frequency resolution series is

For dataset G5 of experiment II (sampling rate:  Hz), the frequency resolution series is

With the above spectrum resolution series, the classification performance is tested on dataset A and dataset G5 separately. In this investigation, the performance tests are evaluated with different training size of each health state under each operating condition (i.e., ) as before. That is, for dataset A and for dataset G5. Figures 15 and 16 demonstrate the corresponding classification results in detailed.

Examining the classification accuracies in Figures 15 and 16, it is noticed that when the training size , a desirable classification rate can be reached at the spectrum resolution of 19.53 Hz for experiment I and at the spectrum resolution of 23.44 Hz for experiment II, respectively. Therefore, we designate the corresponding thresholds of for the two experiments, i.e.,  Hz for experiment I and  Hz for experiment II. In other words, when the frequency resolution satisfies that for experiment I and for experiment II, the proposed methodology could deliver satisfactory performances, respectively. Hence, a recommendation of the selection of frequency resolution for applying this approach is given as

To further take the computational efficiency of FFT algorithm into consideration, we also recommend that the frequency resolution is not smaller than 1 Hz. As a result, the selection method of spectrum frequency resolution can be determined as

As for the validation experiments presented in Sections 5.1 and 5.2, the frequency resolutions are  Hz in experiment I and  Hz in experiment II, which satisfy the requirement of limiting condition on spectrum resolution.

5.4. Comparisons of Classification Performance on CWRU Datasets

In order to evaluate the performance of the proposed method using the adjusted spectrum images for bearing diagnosis, a comparative study is carried out between current work and some recent publications, where the authors adopted the same CWRU bearing datasets.

In [16, 17], different methods for extracting features and detecting single-fault severity of bearings are proposed and tested with the datasets in Table 3. In [16], manifold regularization-based semisupervised learning (SSL) is introduced into fault diagnosis of bearings. In [17], bearing diagnosis is performed by applying SVMs and fractal dimension. The comparisons are presented in Table 9. The proposed method achieves similar classification accuracy with the results in [16, 17] for IF and OF severity detection, but better performance is obtained for BF severity detection with the proposed method.

Similar comparisons for multifault severity detection are also carried out based on the results in some published literatures. DNN is utilized to diagnose the faults of bearings in [9], which deeply exploits the characteristics of vibration signals. In [18], reconstructed phase space (RPS) and Gaussian mixture model (GMM) are applied for fault diagnosis of bearings. In [19], lifting wavelet packet transform (LWPT) and binary tree system are employed to realize bearing fault diagnosis. In [20], the authors utilized multiscale feature extraction (MFE) and multiclass support vector machine (MSVM) for bearing fault feature extraction and classification, respectively. In [21], trace ratio criterion LDA (TR-LDA) and kNN classifier are used for feature reduction and fault classification to realize multifault severity detection of bearings. In [22], the authors applied multiple adaptive neurofuzzy inference system (ANFIS) and genetic algorithm (GA) for the fault diagnosis of bearings. Also in [23], modifed kernel marginal Fisher analysis (MKMFA) is used for feature extraction, and kNN is applied to classify the bearing fault pattern. As shown in Table 10, the classification accuracy of the proposed method could achieve with 10 training samples per class under the same operating conditions, where its classification results are even not inferior to that obtained with DNN.

In addition, aiming at the complicated ten-class classification problem, nonadjusted spectrum images are also utilized to make a comparison, so as to illustrate the effectiveness of spectrum adjusting. In these tests with nonadjusted spectrum images, the CF for capturing nonadjusted spectra is set to and and defined in (8). Each experiment (10 samples/class/load for training) is repeated 20 times to acquire the average accuracy. As depicted in Table 10, for dataset G1-G5, classification rates with 84.97%, 92.30%, 89.66%, 90.72%, and 100% are achieved, respectively, with nonadjusted spectrum images. But with the proposed method for spectrum adjusting, higher classification rates with 99.13%, 99.99%, 98.07%, 95.72%, and 100% could be reached for datasets G1–G5, respectively.

It is clear that the proposed spectrum adjusting scheme could suppress the interference of rotating speed and reduce the changes of diagram structure, which facilitates that this approach could obtain excellent classification accuracies under different operating conditions. The robustness of fault diagnosis could also be highlighted using the adjusted image as feature.

6. Discussion

The adjusted spectrum image is indeed a 2D representation of the 1D vector of FFT coefficients. In [9], the vector of FFT coefficients is utilized as the inputs of DNNs, and the achieved classification accuracy is with a large number of training samples. By using the proposed 2D adjusted spectrum images together with a simple NNC classifier, the classification accuracy is and the number of required training samples is much smaller (Table 10). From this point of view, such a 2D representation in the form of image contains more useful information for bearing fault diagnosis.

The experimental results also demonstrate that the proposed method could diagnose bearing faults effectively under different operating conditions, even with very limited training samples. The training samples in dataset D, E, F, G1, G2, G3, and G4 are only from one operating condition, while the testing samples are from four different operating conditions (Tables 3 and 4). In these cases, high classification accuracies are still achieved with the proposed method as shown in Tables 57. It can also be observed that, a relatively high classification accuracy is also obtained even with one single training sample (Table 8). For the fault pattern classification with limited training samples, artificial intelligence classifiers (e.g., ANN, SVM) may demonstrate worse performance than NNC because there commonly are needed large amounts of samples to obtain the model parameters. Here, taking datasets G1–G4 for example, Table 11 presents their classification performance with NNC and SVM (using polynomial kernel function) under limited training sample condition. According to the comparison results, when there only exist 10 training samples for single-fault type, SVM is inferior to NNC due to lacking of enough training samples to optimize SVM parameters. This is the main reason why NNC is selected as the classifier for fault classification in this work.

It is worth mentioning that the proposed method belongs to the scope of fault classification of bearings; therefore, the training samples must be obtained in advance. However, thanks to the spectrum adjusted by the rotating speed frequency, the geometrical structure of the spectrum image is not significantly influenced by the speed. Therefore, the training data need not contain samples of all operating conditions, but contain those of all fault types. When the geometrical structures of bearing spectra are similar for a same failure type, those bearings could be diagnosed with the proposed method. Furthermore, the proposed feature is easier to compute and postprocess compared with References [6, 7] because the presented spectrum image is a binary image which avoids performing binary conversion. All these characteristics promote it become a robust feature for bearing fault diagnosis.

7. Conclusions

In this paper, the adjusted spectrum image is proposed as feature for bearing fault diagnosis under different operation conditions. First, the validity indicator, DBI, is introduced to optimize the CF and image size for capturing spectrum images. Then with the optimized CF and image-size, the adjusted spectrum images are generated via FFT according to the rotating speed and 2DPCA is applied to reduce the image dimension for facilitating classification with an NNC. Two data examples are employed to verify the effectiveness of the proposed approach. On the base of these bearing data, the effect of spectrum frequency resolution is further investigated and a recommended selection method is presented to determine the frequency resolution. In the proposed method, the training samples could be from one operating condition, while the testing samples could be from different operating conditions. Experimental results illustrate that this methodology could achieve high classification accuracy with very limited training samples.

Data Availability

Some experimental data used in this study are acquired from the bearing data centre of CWRU and the web page http://csegroups.case.edu/bearingdatacenter/home (accessed September 2018). The other data used to support the fndings are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The work was supported by National Key R&D Program of China (grant no. 2016YFC0802908), National Natural Science Foundation of China (grant nos. 51475455 and 51605478), Natural Science Foundation of Jiangsu Province (grant no. BK20160251), and the Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).