Abstract

In this paper, an algorithm based on two novel shape descriptors and support vector machine (SVM) is proposed to improve the recognition accuracy and speed of shaft orbits of rotating machines. Firstly, two novel shape descriptors, respectively, named accurate Fourier height functions 1 (AFHF1) and accurate Fourier height functions 2 (AFHF2) are presented based on height function (HF) and Fourier transformation. Both AFHF1 and AFHF2 shape descriptors are constant to similarity transforms and also have intrinsic invariance to the starting point change and are more compacted than HF. Therefore, they perform well on the global or local features of the contours of shaft orbits. Then, the AFHF1 and AFHF2 shape descriptors are utilized to extract features of shaft orbits in the simulated dataset and measured dataset. Taking extracted feature vectors as the input, SVM is adopted in order to classify the fault types according to the shapes of shaft orbits. Finally, a series of descriptors including shape context (SC), inner-distance shape context (IDSC), triangular centroid distances (TCDs), and HF were compared to verify the performance of the proposed AFHF1 and AFHF2 shape descriptors. The average accuracy of our method in simulated dataset and measured dataset are all higher than 99.83%, the average recognition time of each sample is no more than 19 milliseconds. The experiments demonstrate that the proposed method has the best recognition accuracy and real-time and antinoise performance.

1. Introduction

Rotating machines play a critical role in industrial production [1]. Once failure occurs, it may cause the enterprise to suffer huge economic losses, and even catastrophic accidents. Therefore, the condition monitoring and fault identification of rotating machines are of great significance to improve productivity and reduce maintenance costs and prolong the service life of equipment [2].

The shaft orbit [38] is a two-dimensional representation synthesized by shaft vibration signals [9, 10], which presents a snapshot of the rotor actual motion at its centre [11] and contains a lot of fault information [12]. In the last ten years, many researchers have regarded the identification of shaft orbits as pattern recognition of two-dimensional images and put forward many methods.

Identification methods of shaft orbits mainly include two fundamental procedures: feature extraction and classification. In the first procedure, the quality of the extracted features could directly determine the accuracy of the identification on shaft orbits [1]. Many scholars have proposed many efficient methods for feature extraction of shaft orbits, such as Fourier descriptors (FD) [13], chain code [14, 15], Walsh descriptor (WD) [16], seven improved invariant moment [17], histogram of oriented gradients (HOG) [18], fractal theory [19], and comprehensive geometric characteristic (CGC) [1]. Each of them can extract feature of shaft orbits well, but has its disadvantages.

Fourier descriptors [13] are selected to extract features of shaft orbits because of its simplification and high efficiency, but sensitive to the starting point and indirectly sensitive to similarity transforms. Although chain code [14, 15] feature had low computation complexity and the correlation between local feature points was also preserved, lots of global features were lost. Xiang et al. calculated the distance vector between the points of shaft orbits and the centre point and adopted Walsh orthogonal matrix to transform the distance vector [16]. The Walsh descriptor was obtained with rotation invariance and scale invariance, but it often adopts floating-point operation, which leads to complex calculation. Yan et al. proposed seven improved invariant moment based on Hu invariant moment to extract the features of shaft orbits well [17]. But the ability of invariant moment to represent complex graphics is limited, because the dimension of its features is only 7. Bao et al. applied HOG as the low-level local shape descriptor to extract feature vectors from shaft orbits [18]. But HOG is sensitive to translation and rotation. Chang et al. proposed fractal theory to extract features of shaft orbits [19]. Although the dimension of the feature extracted by the fractal theory is small, the recognition rate is not high enough. Chen et al. proposed a method of comprehensive geometric characteristics [1], which extracted features of shaft orbits from three different aspects: structure, region, and boundary. Although it can achieve satisfactory accuracy, its real-time performance is not good enough, extraction time of which is longer than chain code and Walsh descriptor.

In recent year, Height Function (HF) [20] method is proposed to extract features of 2D images well, and constant to similarity transforms. Several improvement algorithms have been proposed based on the HF method. Multiscale arch height (MARCH) description [21] is a method for mobile retrieval of leaf images, which is based on the K-scale arch height as a measure of the curvature surrounding a contour point. This method utilizes arch height features instead of the height values for leaf shape representation. Nanni et al. [22] presented an approach of a matrix descriptor based on the local phase quantization to improve the performance of shape context (SC) [23], inner-distance shape context (IDSC) [24], and HF [20]. Shekar and Pilar [25] proposed a combined classifier model based on HF and two-dimensional discrete cosine transform (2D-DCT) to exact shape representation and classification [26]. All of these methods are achieved by combining HF with other algorithms. However, there is little work on improving the principle of HF descriptor itself. The principle of the HF algorithm and its deficiencies are shown in the following paragraphs.

In Figure 1, let X = {xi} (i = 1, 2, … , N) denote the sequence of equidistant sample points along the outer contour in a given shape, and the sampling point i follows the contour in an ordered counterclockwise direction. [20] For each sample point xi, its tangent line li is used as the reference axis, which inherits its orientation from the contour orientation. The distance between the jth (j = 1, 2, … , N) sample point xj and the tangent line li is defined as the height value Hi,j, and it is calculated for every sample point. The feature vector of point xj for shape X is the ordered sequence of height values:where denotes the height value of the sample point according to the reference axis of the point .

Feature vectors of each sampling point are calculated, and they are composed of the feature matrix of the shape X according to the sampling sequence. Then the dimension of the feature matrix is reduced from to by smoothing [20], where (the integer part of N/k). Feature matrix is normalized row by row to make the shape representation scale invariant.

However, there are three obvious deficiencies of HF shape descriptor according to its definition. (1) In HF descriptor, the reference axis direction of the point xi is starting from xi−1 to xi+1. Therefore, the line of xi−1 to xi+1 is defined as the reference axis instead of the real tangent line, as shown in Figure 2. For the image with a large shape contour or strong edge noise, the small error of shape feature description will greatly influence the retrieval accuracy, antinoise property, and robustness of the algorithm. (2) For the same contour, the HF descriptors are different when the starting points of the shape contour are different. When HF shape descriptor is used in image matching, it is necessary to find the best corresponding points, so the dynamic programming (DP) is used to find the best corresponding points between different shape contours, which may reduce the retrieval efficiency. (3) Although smoothing process decreases the dimensions of the HF descriptor matrix, it also reduces the local feature characterization performance of the descriptor, and the dimension of the descriptor is still high, and the compactness of HF descriptor is not good enough. Therefore, when using HF descriptors, it is imperative to improve the three deficiencies above to improve its performance.

The second key procedure of identification of shaft orbit is classification. The main methods of classification are support vector machine (SVM) and the neural network. SVM performs better in accuracy and real time than the neural network when the samples are few and has been successfully applied in many different fields such as fault diagnosis [15, 27].

In this paper, two novel descriptors called accurate Fourier height functions (AFHFs) including accurate Fourier height function 1 (AFHF1) and accurate Fourier height function 2 (AFHF2) are proposed based on HF and Fourier transformation to improve the accuracy, starting point invariance, and compactness. Then, we take advantage of AFHFs and SVM, and a novel recognition method for the shaft orbits is proposed in Section 2. Experiments are shown in Section 3. And the conclusion is shown in the final section.

2. Shaft Orbit Identification Method Based on AFHFs and SVM

In this section, a new identification method of shaft orbit in rotating machine based on improved HF descriptors is introduced. Firstly, AFHFs shape descriptors are presented in detail. Then, the orbit identification method based on AFHFs shape descriptors and support vector machine (SVM) is proposed.

2.1. Accurate Fourier Height Functions Shape Descriptor

In order to decrease the error of the HF shape descriptor in contour representation, the HF shape descriptor is corrected by using an accurate height value of the contour point.

As shown in Figure 2, we draw the parallel line li of line l that is the actual axis of the original height function at the reference point xi and set line li as the new axis of improved HF. Therefore, the height error ΔH of original HF can be eliminated and the height value H can be rectified with the exact height value H. At the same time, the distance from the sample point xi to its own axis li is always 0, so height value Hi,i can be removed. Finally, the improved HF vector of point xj which is named as Accurate Height Function (AHF) shape descriptor can be expressed as follows:where denotes the exact height value of the sample point .

In our method, two improvement algorithms are compensated for the second and the third deficiencies. (1) Fourier transformation is performed on each row of the AHF shape descriptor that is not smoothed, and the phase information is discarded to get the new shape descriptor Accurate Fourier Height Function 1 (AFHF1). (2) Fourier transformation is performed for each row of the AHF shape descriptor that is smoothed, and the phase information is discarded to get the new shape descriptor Accurate Fourier Height Function 2 (AFHF2). The above specific algorithms will be showed in the following two subsections.

2.1.1. AFHF1 Shape Descriptor

Firstly, let denote the tth row of the feature matrix of AHF shape descriptor that is not smoothed, where t = 1, 2, … , N−1. The Fourier transformation for is given bywhere and is the absolute value of and represents the modulus of the discrete Fourier transform coefficient . It is easy to prove that is invariant to the starting point of the contour for a given shape [28]. Therefore, the value of Fourier transform coefficient is used to describe the shape contour. In order to make the generated shape descriptors robust and compact, the lowest order coefficients is used, where . So the final AFHF1 shape descriptor can be defined as follows:where ; . is the discrete Fourier transform coefficient.

It can be seen from the definition of AFHF1 that its feature matrix dimension is . The feature matrix of AFHF1 is more compact than that of the original HF shape descriptor whose matrix dimension is , where P is less than M.

2.1.2. AFHF2 Shape Descriptor

AFHF1 removes the smoothing process from the original HF, while AFHF2 reserves the HF smoothing process and directly improves it on the basis of the AHF descriptor. By applying Fourier transforms in each row of the AHF shape descriptor and discarding the phase information, a new descriptor AFHF2 can be obtained. Similarly, the final AFHF2 shape descriptor can be defined as follows:where ; . is the discrete Fourier transform coefficient.

From the definition of AFHF2 shape descriptor, it is not difficult to know that its matrix dimension is M × P, which is lower than that of AFHF1. However, the smoothing process takes the average height values of multiple adjacent points, which may affect the invariance of starting point on AFHF2 shape descriptor. Compared to the original HF shape descriptor, the choice of starting point may have a much smaller impact on AFHF2 shape descriptor.

2.1.3. Characteristic of the Improved Accurate Fourier Height Functions

The matrix dimensions of improved AFHF1 and AFHF2 shape descriptors and related descriptors are shown in Table 1, where N is the number of contour sample points and equal to 128, and the number of inner-distance bins Nd is equal to 8, the number of inner-angle Nθ is equal to 12, the number of Fourier coefficients P is equal to 16, and the smoothed vector dimension M is equal to 20. The descriptors for comparison include IDSC [24], HF [20], and triangular centroid distances (TCDs) [28].

As shown in Table 1, the dimension of the feature matrix of the improved AFHF1 and AFHF2 descriptor is much lower than that of the original HF descriptor. In particular, AFHF2 descriptor is more compact than HF and other descriptors.

Compared to the original HF descriptor, the contour starting point has almost no significant influence on AFHF1 and AFHF2 descriptor. In order to determine the influence of the starting point on the descriptors, the shape descriptors extracted from the contours A and B can be defined as and , respectively. The similarity of them is measured by the distance L1. The smaller the distance is, the more similar they are. The distance is defined as follows:

In order to estimate the influence of the starting point on three descriptors, HF, AFHF1, and AFHF2, 100 sample points are sampled at equal intervals in the clockwise direction on the outline of a petal-shaped shaft orbit, as shown in Figure 3. Different points are selected as the starting point, as shown in Figures 3(b) and 3(c), the red points represent the sample points of each shape contour, and the blue point represents the starting point of each shape contour. Then the feature matrices corresponding to HF, AFHF1, and AFHF2 are calculated, respectively, which are shown in Figures 3(d)3(i). When the starting point is different, HF feature matrix changes greatly, but the AFHF1 and AFHF2 description matrices are almost unchanged.

In order to describe the influence of the starting point on the above three shape descriptors quantitatively, the similarities of three sets of feature matrices are calculated individually by using Equation (6). The similarity between two HF feature matrices in Figures 3(d) and 3(e) is 0.5348, the similarity between two AFHF1 feature matrices in Figures 3(f) and 3(g) is 7.0968 × 10−16, and the similarity between two AFHF2 feature matrices in Figures 3(h) and 3(i) is 7.5922 × 10−16. The results show that the starting point has a great impact on HF shape descriptor, while it has almost no effect on AFHF1 and AFHF2 shape descriptors. Moreover, starting points have slightly less impact on AFHF1 than on AFHF2, which is due to the smoothing process in AFHF2.

2.2. Identification Method of Shaft Orbit Based on AFHFs

Since the starting point has almost no influence on AFHF1 and AFHF2 shape descriptors, there is no need to use the DP method, which is used by the original HF shape descriptors to obtain the matching result. The SVM, which is a classification algorithm based on statistical learning theory, is adopted as the classifier in this paper. SVM has unique advantages in solving a small amount of samples: nonlinear and high-dimension classification and recognition.

It is the most important for SVM to choose the proper kernel function and its parameters δ and the optimal penalty factor c. Different kernel functions of SVM are used to test on the shaft orbit dataset, and the experimental results show that the recognition rate of the linear kernel function is much higher than that of the others. Since the linear kernel function requires few parameters to be set specifically, and the dimension of feature matrices is high, the linear kernel function is adopted.

Combined with the AFHF1 and AFHF2 descriptors proposed in Section 2.1, a shaft orbit identification method for fault diagnosis of rotating machine is proposed based on AFHF1, AFHF2, and SVM. The specific process is shown in Figure 4.

3. Experiment and Analysis of Shaft Orbit Identification

3.1. Experiments and Analysis on Simulation Data
3.1.1. Simulated Shaft Orbit Dataset

Using MATLAB software, the shaft orbits are simulated according to the following equation (6).where , and , are the first and second frequency components, respectively, is the angular velocity, and and represent the initial position of x and y, respectively. The corresponding relationship between the faults and the graphs is shown in Table 2 [16, 29]. In addition, petal-shaped shaft orbit corresponds to Oil whirl [29], which increases the type of shaft orbit and increases the difficulty of identification compared to many researches [13, 1319]. Although both banana-shaped and outer “8” shaft orbit correspond to misalignment faults, they correspond to different severity of misalignment faults [30]. When the severity of the misalignment of the shaft is small, the shaft orbit is an ellipse. But the short axis of ellipse shaft orbit will gradually become shorter when the severity of the fault is increasing, and when the fault is increased to a certain extent, the shaft will become an outer “8” shape.

200 images of shaft orbits are simulated by MATLAB for each fault type, of which 100 images are for training and the remaining 100 images are for testing. The typical samples of shaft orbit dataset are shown in Figure 5.

3.1.2. Feature Extraction and Identification of Simulated Shaft Orbit Datasets

Firstly, a series of descriptors including SC, IDSC, TCDs, and HF were compared to verify the performance of the proposed AFHF1 and AFHF2 shape descriptors. Secondly, in order to illustrate the superiority of SVM, it is compared with BP neural network.

All algorithms selected 100 feature points as samples. SC, IDSC, TCDs, HF, AFHF1, and AFHF2 shape descriptors are used to extract the feature of shaft orbits, respectively. And the parameter P of AFHF1 and AFHF2 is set to 16 in all experiments. The smoothed vector dimension M of AFHF2 is set to 19. Then half of the feature vectors extracted by each shape descriptor are randomly selected as training samples, and the rest are selected as testing samples. The training samples are used to train SVM and BP neural network, and the testing samples are used to test the trained SVM and BP neural network for verifying the performance of each algorithm. The experiment flowchart is shown in Figure 6.

The parameters of SVM are set as follows: the “linear kernel function” is selected and the other parameters are the default. The parameters of BP neural network in MATLAB toolbox are set as follows: the period is set to 1000, the target error is set to 0.0001, the node number of hidden layer is set to 15, S-type function “logsig” is selected as the excitation function, and linear function “Purelin” is adopted as the output layer excitation function.

Experimental results of simulated shaft orbit dataset are shown in Tables 3 and 4, where SVM and neural network are used as the classifier, respectively. The accuracy comparison of all algorithms is intuitively shown in Figure 7. The conditions of all experiments are the same.

From the perspective of shape descriptors, the accuracies of the algorithms using the AFHF1 and AFHF2 shape descriptors are higher than 99.57%, but the highest accuracy of the algorithms using other shape descriptors is 98.03%.

From the perspective of the classifier, the performance of the algorithms using SVM as classifier are slightly better than using BP neural network as classifier in accuracy and real time when AFHF1 and AFHF2 shape descriptors are used to extract feature in the experiments of simulated shaft orbit identification. Therefore, the proposed method has the best performance, its average time for identifying a shaft orbit is less than 19 milliseconds, and the average accuracy is higher than 99.88%.

3.2. Experiments and Analysis on Actual Measured Data

In order to verify the performance of the proposed algorithm, a rotor test bench is used to produce different faults, and the same methods which are used in the simulation experiment are adopted for contrast. The measured database of shaft orbits is created on the bearing rotor test bench as shown in Figure 8. The rotor test bench consists of the rotor table, the sensors, the DC motor controller, the signal acquisition card, and the analysis software. Different state experiments were performed on the rotor test bench, 200 shaft orbits were collected in each state, 100 samples of each type were randomly selected as training samples, and the remaining 100 samples were used as testing samples.

As shown in Figure 9(a), the original measured shaft orbit has much noise, so the features of the shaft orbit cannot be extracted directly and the image must be preprocessed. The preprocessing method used in this paper includes two steps, wavelet filtering and binarization. Figure 9(b) shows a shaft orbit image after preprocessing. The same processing is applied to different types of shaft orbits, and then the features of these images are extracted and classified. And experimental results of measured shaft orbit dataset are shown in Tables 5 and 6, where SVM and neural network are used as the classifier, respectively.

By comparing experimental results on simulated shaft orbits and actual measured shaft orbits in Tables 36, the accuracy of identification on the measured shaft orbit is lower than the simulation shaft orbit with the same algorithm. This is because simulated shaft orbits are noiseless. However, the reduction in the accuracies of the algorithms using the AFHF1 and AFHF2 shape descriptors is small, not exceeding 0.3%, and the accuracies of the algorithms using other shape descriptors decreased greatly, ranging from 0.7% to 3.3%. It shows that the AFHF1 and AFHF2 shape descriptors proposed in this paper have a great antinoise performance.

Similar to the analysis of simulation result in section 3.1.2, the following conclusions can be drawn:(1)From the perspective of shape descriptors, AFHF1 and AFHF2 shape descriptors are more suitable for identification on actual measured shaft orbit than SC, IDSC, TCDs, and HF shape descriptors.(2)From the perspective of the classifier, the performance of the algorithms using SVM as classifier is slightly better than that using BP neural network as classifier in accuracy and real time when AFHF1 and AFHF2 shape descriptors are used to extract feature in the experiments of actual measured shaft orbit identification.

4. Conclusions and Future Works

An algorithm based on two novel shape descriptors and SVM is proposed. In the algorithm, two novel shape descriptors, respectively, named AFHF1 and AFHF2 based on HF and Fourier transformation are presented to extract features from shaft orbits. Both AFHF1 and AFHF2 shape descriptors are constant to similarity transforms, also have intrinsic invariance to the starting point change and are more compacted than HF. Therefore, they perform well on the global or local features of the contours of shaft orbits. SVM is adopted as the classifier to efficiently identify the extracted feature. In our experiments, AFHF1 and AFHF2 are compared with SC, IDSC, HF, and TCDs, and BP neural network is compared with SVM. The identification accuracies on the simulated and actual measured shaft orbit datasets and the comparisons with the algorithms mentioned above are obtained. The experiments show that the proposed algorithm can quickly and accurately identify shaft orbits. As for simulated and measured shaft orbits, the average time of identifications is all less than 19 milliseconds and the average of accuracies are all higher than 99.83%. The proposed method has the best recognition accuracy, and real-time and antinoise performance.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This study is supported by the National Natural Science Foundation of China (nos. 51775177and 51675166).