Research Article  Open Access
Fen Wei, Gang Wang, Bingyin Ren, Jianghua Ge, Yaping Wang, "Multisensor Fused Fault Diagnosis for Rotation Machinery Based on Supervised SecondOrder Tensor Locality Preserving Projection and Weighted Nearest Neighbor Classifier under Assembled Matrix Distance Metric", Shock and Vibration, vol. 2016, Article ID 1212457, 14 pages, 2016. https://doi.org/10.1155/2016/1212457
Multisensor Fused Fault Diagnosis for Rotation Machinery Based on Supervised SecondOrder Tensor Locality Preserving Projection and Weighted Nearest Neighbor Classifier under Assembled Matrix Distance Metric
Abstract
In order to sufficiently capture the useful faultrelated information available in the multiple vibration sensors used in rotation machinery, while concurrently avoiding the introduction of the limitation of dimensionality, a new fault diagnosis method for rotation machinery based on supervised secondorder tensor locality preserving projection (SSTLPP) and weighted knearest neighbor classifier (WKNNC) with an assembled matrix distance metric (AMDM) is presented. Secondorder tensor representation of multisensor fused conditional features is employed to replace the prevailing vector description of features from a single sensor. Then, an SSTLPP algorithm under AMDM (SSTLPPAMDM) is presented to realize dimensional reduction of original highdimensional feature tensor. Compared with classical secondorder tensor locality preserving projection (STLPP), the SSTLPPAMDM algorithm not only considers both local neighbor information and class label information but also replaces the existing Frobenius distance measure with AMDM for construction of the similarity weighting matrix. Finally, the obtained lowdimensional feature tensor is input into WKNNC with AMDM to implement the fault diagnosis of the rotation machinery. A fault diagnosis experiment is performed for a gearbox which demonstrates that the secondorder tensor formed multisensor fused fault data has good results for multisensor fusion fault diagnosis and the formulated fault diagnosis method can effectively improve diagnostic accuracy.
1. Introduction
As one of the most common mechanical equipment classes, rotation machinery occupies an important role in industrial applications such as manufacturing, metallurgy, energy, and transportation. Due to tough working environments, similar materials, and structural properties, rotation machinery can be subject to malfunctions or failures. This can significantly decrease machinery service performance including manufacturing quality and operation safety and cause machinery to break down, which may lead to serious catastrophes [1]. Accordingly, research into fault diagnosis of rotation machinery has attracted considerable attention by researchers in related domains in recent years. The vibration signals collected from velocity or accelerator sensors located in machinery housing are generally regarded as the foundation of fault diagnostic procedures. However, most existing studies on fault diagnosis of rotation machinery have empirically or experimentally focused on analyzing single sensor signals [2–4], and the remaining studies have performed multisensor fused fault diagnosis through complex fusion algorithms such as blind source separation (BSS) [5] and DS evidence theory. The singlesensorbased fault diagnosis methods belonging to the former category of studies generally lead to loss of valuable information available from multiple sensors, and the multisensor fused diagnosis methods appearing in the latter category of studies are tend to cause a high computational load. To tackle these issues, this paper presents a secondorder tensor representation of fault samples including fault feature dimensions and sensor locations dimensions, which is used in an efficient multisensor fused fault diagnosis framework.
Large volumes of feature parameters generated by timedomain, frequencydomain, and timefrequencydomain analysis of vibration signals are commonly integrated into a high dimensional data set to obtain accurate fault diagnostic results [6]. This highdimensional feature set can provide more valuable information, but it also increases the computational load and may even trigger dimensionality issues. One approach to address this problem is to apply dimension reduction technology. Compared with classical linear dimensionality reduction methods such as principle component analysis (PCA) [7], linear discriminate analysis (LDA) [8], and multidimensional scaling (MDS) [9], a new technology for discovering intrinsic lowdimensional structure of nonlinear distributed data hidden in highdimensional space has emerged which is known as manifold learning and has become a current research focus. Representative manifold learning methods include isometric mapping (ISOMAP) [10], locality linear embedding (LLE) [11], Laplacian eigenmaps (LE) [12], and local tangent space alignment (LTSA) [13]. The effectiveness of these basic manifold learning algorithms and their variants for fault diagnosis of rotation machinery has been validated frequently by a large number of studies. For instance, Li et al. [14]. proposed a fault diagnosis method using dimension reduction with linear local tangent space alignment (LLTSA). Ding et al. [15] developed a fusion feature extraction method based on locality preserving projection (LPP) for rolling element bearing fault classification. Additionally, an envelope manifold demodulation method was investigated for planetary gear fault detection in [16]. It should be observed that the input sample for these methods is generally represented by a vector with a highdimensional feature space. It is obvious that these manifold learning algorithms are not suitable when a multisensor fused faulty sample is represented as a secondorder tensor, namely, a matrix. Furthermore, tensor representation based manifold learning methods have received little investigation for fault diagnosis. Fortunately, there are several secondorder or higherorder tensor extended manifold algorithms, such as secondorder tensor locality preserving projection (STLPP) [17], tensor neighborhood preserving embedding (TNPE) [18], a tensor version of discriminant locality linear embedding (DLLE/T) [19], and tensor PCA [20]. These algorithms have been progressively applied in the areas of twodimensional or higherdimensional image classification, computer vision, and pattern recognition and offer a feasible solution for tensorrepresented fault diagnosis. Out of the methods mentioned above, STLPP possesses the ability to discover intrinsic local geometric and topological properties of a manifold embedded in a secondorder tensor space, on the basis of inherited strengths of LPP. However, it has been found that there are several limitations of the STLPP algorithm. For instance, STLPP is an unsupervised method for dimension reduction and thus does not consider discriminant information which is useful for fault classification. Secondly, the similarity with secondorder tensor formed samples in traditional STLPP has been computed using the Frobenius distance measure [17] which is the same as the Euclidean distance of the vectorized version of matrix formed samples, so it may still cause a loss of spatial locality information. To tackle these problems, this paper introduces the concept of supervision into the framework of a traditional STLPP and employs an assembled matrix distance metric (AMDM) which has been successfully utilized in 2DPCA [21] into the construction of a similarity weighting matrix to obtain better matching between two secondorder tensor formed faulty samples.
To further improve the accuracy and the efficiency of fault diagnosis, intelligent classification methods are considered as an indispensable component in the diagnostic procedure. These methods include artificial neural networks (ANN) [22], support vector machines (SVM) [23], and fuzzybased systems [24] as well as Bayesian based classifiers [25]. Compared with these methods, the knearest neighbor classifier (KNNC) ranks k neighbors of testing samples from training samples and uses the class labels of similarity neighbors to classify input test samples by evaluating the similarity between samples in the feature space [26, 27]. The KNNC method has many benefits, including a lower calculation requirement, quicker speed, and higher pattern recognition accuracy [28]. Therefore, it is considered to be the simplest tool for faulty pattern recognition. There are some existing shortcomings for traditional KNNC which classifies the sample labels using unified weights, and thus the weighted knearest neighbor classifier (WKNNC) was developed which assigns different weights to nearest neighbors to represent the impact of each neighbor on each unknown sample. Therefore, this paper uses the WKNNC to establish the relationships between features of samples and conditional classifications. Additionally, the AMDM mentioned above is also employed for the similarity evaluation of lowdimensional secondorder formed samples after SSTLPP based dimension reduction in WKNNC.
The remainder of this paper is organized as follows. The proposed supervised secondorder tensor locality preserving projection based on assembled matrix distance metric (SSTLPPAMDM) algorithm is discussed in detail in Section 2. The weighted knearest neighbor classifier with an assembled matrix distance metric (WKNNCAMDM) is described in Section 3. Section 4 provides the overall framework for the proposed multisensors fused fault diagnosis. In Section 5, a fault diagnosis experiment is performed for a gearbox to validate the proposed method. Finally, the conclusions are given in Section 6.
2. Supervised SecondOrder Tensor Locality Preserving Projection Based on Assembled Matrix Distance Metric (SSTLPPAMDM)
2.1. Introduction to SecondOrder Tensor Locality Preserving Projection (STLPP)
As the tensor extension of LPP, TLPP is essentially equivalent to finding a linear approximation of the eigenfunctions of the Laplace Beltrami operator in a tensor space. The incipient TLPP which was initially presented by He et al. [17] in 2005 is a secondorder case and was reviewed and then extended for a universal norder version by Dai and Yeung [18] in 2008. Since the multisensor fused faulty sample studied in this paper is represented by a secondorder tensor form, namely, in a matrix form, the secondorder TLPP (STLPP) algorithm is the focus in the following discussion. Given matrix formed samples , the aim of STLPP is to find two transformation matrices and by optimizing the following formulation:where is the Frobenius norm of the matrix; that is, ; denotes the elements of the weight matrix of the nearest neighbor graph , which is equal to when is one of the nearest neighbors of or is one of the nearest neighbors of ; otherwise it is equal to zero. is a diagonal matrix; .
Using a series of mathematical derivations, the optimal values for and are obtained by iteratively computing the generalized eigenvectors of the following formulations:where , , , and .
Finally, the lowdimensional representations of the original data are obtained using .
2.2. Computation of a Supervised Similarity Weighting Matrix Based on AMDM
As described in the previous section, there are a certain number of limitations when using the prevailing computation method for the similarity weighting matrix of the nearest neighbor graph . For instance, the Frobenius distance metric (FDM) used for the similarity evaluation between different secondorder tensor formed samples is essentially the Euclidean distance of the vectorized version of the matrix formed samples, and thus it neglects spatial geometrical information of each element in matrix formed samples and thus has poor matching performance for different samples. Additionally, class label information of the training samples is not effectively used in the traditional STLPP, although this information can be helpful for subsequent accurate classification assignment. To address these issues, this paper formulates a novel supervised similarity matrix computation method that decides the similarity between matrix formed samples using an assembled matrix distance metric that takes the classification information into account.
Firstly, for any two arbitrary matrix formed samples and , the distance between the two samples can be measured using the following assembled matrix distance metric (AMDM) [21]:where denotes a variable parameter which strongly affects the representation ability of the defined distance function for subsequent classification assignment. It is obviously that the Frobenius distance metric is a special case of the AMDM with , and the Yang distance metric proposed by Yang et al. in [29] is another special case for . It has also been theoretically and experimentally verified that an assembled matrix distance metric with a lower value of , that is, , outperforms existing Frobenius distance and Yang distance measures in terms of the final classification accuracy. Accordingly, the value of for the employed AMDM is set between 0 and 1, , and its exact value is determined by repeated experiments.
Secondly, by understanding the class label information of the training samples and the AMDM based distances between samples, the proposed supervised similarity weighting matrix based on AMDM can be defined aswhere denotes the element at column and row in the new formulated supervised similarity matrix , which represents the similarity degree of the matrix formed samples and . and are the class labels of samples and , respectively. is the penalty coefficient which is used to characterize the reduction in the similarity degree. Since is one of the nearest neighbors of or is one of the nearest neighbors of , the corresponding class labels are inconsistent, and thus the value of should be set to .
The newly formulated similarity weighting matrix computation equation shown in (4) can be viewed as the combination and extension of the prevailing “0heat kernel function” and the “01” binary mode, in which the former is intimately related to the manifold structure and the latter is regarded as the direct expression of the label information. The properties and corresponding advantages of the supervised similarity weighting matrix based on AMDM can be summarized as follows. (i) A more accurate representation of the matching relationship between matrix formed samples can be achieved using AMDM rather than traditional STLPP, which uses the Frobenius distance metric. (ii) The inclusion of the penalty parameter results in larger differences between 1 and as the assembled matrix distance increases, which allows the interclass and intraclass similarity to be easily distinguished.
2.3. SSTLPPAMDM Algorithm
This paper proposes a novel supervised secondorder tensor locality preserving projection algorithm with the assembled matrix distance metric (SSTLPPAMDM) that uses the improvements in both the matrix distances computation of samples in the projection space and the similarity weighting matrix computation expression. In contrast to traditional STLPP, the two transformation matrices and that represent both the neighborhood graph structure and the class label information are obtained by solving the following objective function:The distance between two mapped sample points and in the embedded tensor space is measured using the assembled matrix distance metric to achieve a better matching result. The element of the supervised similarity weighting matrix which is computed by (4) is employed to represent the neighboring degree of samples and and considers both the local structure and class information. The diagonal matrix has the ability to characterize the degree of importance of the mapped sample point in the embedded tensor space to represent the original sample point .
The optimal transformation matrices and are solved in a similar way to traditional STLPP by applying an iterative scheme. The specific implementation process can be described as follows. Firstly, an initial matrix is set as an identity matrix and the first iterative solution of is then obtained by solving the generalized eigenvector problem shown in (6). Secondly, is updated by solving the generalized eigenvector problem shown in (7). By iteratively computing the generalized eigenvectors of (6) and (7) for a predefined number of repetitions, the optimal transformation matrices and are obtained. Finally, the secondorder lowdimensional projection of the original secondorder highdimensional sample is obtained.
In summary, there are two main advantages to the newly proposed SSTLPPAMDM. () The local structure information and the class information act cooperatively in the computation of the similarity weighting matrix, and thus the supervised similarity weighting matrix proposed in this paper outperforms other prevailing similarity weighting matrix computation methods in terms of representation of the similarity degree between samples. () The application of AMDM to measure the distance between both the sample points in the original secondorder tensor space and the mapped sample points in the embedded secondorder tensor space ensures that the measured samples have a better matching performance than the existing Frobenius distance measure. Therefore, the SSTLPPAMDM algorithm has superior classification and dimension reduction characteristics than traditional STLPP.
3. Weighted Nearest Neighbor Classifier with Assembled Matrix Distance Metric (WKNNCAMDM)
As stated above, the KNNC method proposed by Cover and Hart in 1967 [28] is regarded by many as the simplest pattern classification algorithm. Due to its advantages of a lower calculation requirement, quicker speed, and higher identification accuracy, KNNC has been widely applied to various types of pattern recognition problems, especially fault diagnosis issues. The main KNNC concept is described in the following two steps.
Step 1. For a given unknown labeled sample , k similar samples in the training sample set are searched to construct a neighbor set .
Step 2. A maximum voting rule is used on all samples in to obtain the class that belongs to.
The above description shows that there are two focus points to KNNC: a similarity measurement method between samples and the establishment of a decision rule. For the first focus point, there have been many similarity measurement methods suggested by previous publications, such as the Euclidean distance, the Manhattan distance, and the cosine angle. However, these vector representations of the databased metric indexes described above are unsuitable for similarity measurement of the matrix formed data points appearing in this paper. Thus, the AMDM is introduced for the similarity computation of samples in KNNC. It is known that AMDM outperforms common FDM in terms of the similarity presentation between matrix formed samples for classification. Additionally, since selection of neighbors is greatly impacted by the sparsity of the sample distribution, this paper employs a novel assembled matrix distance based on density to efficiently measure the similarity between and its neighbor , using the following formula:
Unlike the classical KNNC voting strategy that uses unified weights for neighbors, in this paper, a weighted voting strategy is used to form the weighted knearest neighbor classifier (WKNNC), which assigns different weights to each sample in , reflecting the influence each neighbor has on an unknown sample . A new neighbor set is generally reconstructed in ascending order of distance; that is, , and thus the voting weight of sample is computed using the following equation:
Consequently, the class label of an unknown labeled sample can be determined as follows:where denotes the class label of in and is the Di carat function which has the functional value equal to 1 when , and otherwise it is equal to zero.
Additionally, the selection of is an issue that requires attention in the WKNNC algorithm. In this paper the value of is set to , since the classification precision is only just assured when the number of samples in equals , where the number of classes in training set is [30].
4. Overall Framework of the Proposed Fault Diagnostic Method
Based on the preparations above, this paper proposes a novel multisensor fused fault diagnosis method based on SSTLPPAMDM and WKNNCAMDM for rotation machinery. The flow chart for the proposed method is shown in Figure 1. There are three main steps to the diagnostic procedure, which will be discussed in detail in this section.
Firstly, through prevalent multidomain signal analysis and truncated sampling, a multisensor fused faulty sample set with an dimensional thirdorder tensor representation is constructed and then decomposed into an dimensional training sample set and an testing sample set, where is the number of vibration sensors located in the equipment being diagnosed, is the number of features originating from the vibration signal of a single sensor, and is the number of samples and .
The second step is compression of the highdimensional tensor into a relatively lowdimensional tensor using SSTLPPAMDM. By constructing a supervised similarity weighting matrix based on AMDM, the minimization problem is formulated for the weighted sum of the assembled matrix distances between samples in the embedded tensor space, in order to find the optimal transformation matrices and .
Finally, the lowdimensional projection of the testing sample set and the lowdimensional projection of the training sample set obtained in the previous step are input into WKNNC for fault diagnosis.
5. Experimental Results and Analysis
5.1. Experimental Setup
The validity of the newly proposed method will now be demonstrated using a fault diagnosis experiment of a singlestage gearbox. As shown in Figure 2(a), this paper employed a rotation machinery fault diagnosis experiment platform system of type QPZZII, which was converted into a gearbox fault test bench while the timing belt pulley at the side of gearbox was connected to the motor shaft. A diagram of the gearbox fault experiment system is displayed in Figure 2(b). Seven displacement sensors and accelerometers which were reinstalled to the input shaft and the end housings of the four bearings were employed to collect faulty vibration signals. Specific location information is shown in Table 1.

(a)
(b)
During the experiment, the sampling frequency was 5120 Hz, there were 53248 sampling points, the rotation speed of the drive motor was 880 rev/min, and the load was 0.2 A. There were six types of conditions used in the gearbox fault simulation experiment: () normal (Norm), () corrosion of the gearwheel (C_G), () broken teeth in the gearwheel (B_G), () wear of the pinion (W_P), () broken teeth in the gearwheel coupled with wear of the pinion (B_G_C_W_P), and () corrosion of the gearwheel coupled with wear of the pinion (C_G_C_W_P). Figure 3 shows the timedomain waveforms of the faulty samples originating from the seven different sensors under each condition, and the timedomain waveforms of the faulty samples originating from a single sensor under the six conditions are displayed in Figure 4. It can be observed from these graphs that the sensors reinstalled to different equipment positions have the distinct ability to characterize changes in the machinery condition, and thus it is a feasible method of fusing faulty information from multiple sensors for accurate fault diagnosis.
(a)
(b)
(c)
(d)
(e)
(f)
(a)
(b)
(c)
(d)
(e)
(f)
(g)
50 samples under each condition from a single sensor were subsequently selected, and 30 of these samples were used to train the fault diagnosis model, with the remaining samples used for the testing purposes. The length of each sample was 1024. Furthermore, five timedomain feature parameters and five frequencydomain parameters were calculated to construct a feature set: root mean square, skewness, kurtosis, impulse factor, peak factor, mean frequency, frequency center, root mean square frequency, standard deviation frequency, and kurtosis frequency, as commonly defined in the previous literature [31, 32]. Accordingly, a dimensional tensor formed sample set labeled with the corresponding classes was modeled to act as the input for the entire fault diagnosis experiment, which was composed of a dimensional tensor formed sample set for the training sample set and the remainder as the testing set.
5.2. Performance Analysis and Comparison of Different Dimension Reduction Algorithms
This subsection validates the effectiveness of the proposed SSTLPPAMDM algorithm for dimension reduction in fault diagnosis of rotation machinery, as well as its superiority to the traditional STLPP method. Using the calculation procedure described in Section 2.2 with the training sample set constructed above as the input, SSTLPPAMDM based dimension reduction is implemented to obtain the explicit transformation matrices and , as well as the lowdimensional secondorder tensor formed projection . In this experiment, the neighbor parameter was set to 13, the similarity penalty coefficient was 2, and the value of in the employed AMDM was 0.35. The first threedimensional diagram of the vectorrepresented dimension reduction result is shown in Figure 5(a). For comparison purposes, the traditional STLPPbased dimension reduction result for the same input is also displayed in Figure 5(b). It can be observed from the scatter plots that, after dimension reduction, the SSTLPPAMDM based samples have better separation with a distinct clustering distribution between classes. In contrast, after dimension reduction, the traditional STLPPbased samples show inferior plots with some overlapping samples, which indicates that the proposed SSTLPPAMDM dimension reduction algorithm is superior to the traditional STLPP in terms of the clustering performance of lowdimensional projection of the original highdimensional secondorder tensor formed samples.
(a)
(b)
For further confirmation of the superiority of the proposed secondorder tensor formed faulty samples originating from multisensor fusion over the vectorrepresented multisensor fused samples and the prevailing vectorformed faulty samples that originated from merely a single sensor, two further groups of experiments were designed. These experiments are the LPPbased dimension reduction of vector expressed multisensor fused samples (LPPVM) and the LPPbased dimension reduction of faulty samples from any single sensor (LPPVS) and their purpose is to compare with SSTLPPAMDM which is input by the proposed secondorder tensorrepresented faulty samples in terms of the dimension reduction effect shown as Figure 5(a). The dimension reduction results of these two experiments are displayed in Figures 6(a) and 6(b), respectively. In contrast to Figure 5(a), these results demonstrate that the proposed secondorder tensorrepresented multisensor fused samples combined with SSTLPPAMDM achieve the best clustering performance of the three experiments, that is, LPPVM, LPPVS, and the proposed SSTLPPAMDM. Beyond that, by comparing Figure 6(a) with Figure 6(b), it can be intuitively seen that the first diagram shows better sample clustering results than the second diagram, not only in terms of betweenclass decentralization but also in terms of withinclass aggregation. This indirectly demonstrates the benefit of using multisensor data fusion to increase the integrity of the fault information. In order to ensure the precision of these experimental verification conclusions, two other sets of comparison experiments were further implemented. The first was a quantitative analysis of the dimension reduction results of ten types of approaches which respectively adopted SSTLPPAMDM, STLPP, and LPP as well as three different inputs including the secondorder tensor formed multisensor fused data, the vectorrepresented multisensor fused data, and the vectored samples data from a single sensor. The detailed comparison results are provided in Table 2 and Figure 7. The other group of experiments compared the classification accuracies of these ten types of fault feature dimension reduction results using three popular intelligent fault classifiers: a support vector machine (SVM), a multilayer perception (MLP) neural network, and a support vector data description (SVDD). The specific experimental description is discussed in the following paragraphs and the comparison results are shown in Table 3.


(a)
(b)
(a)
(b)
(c)
The results shown in the scatter distribution diagrams in Figure 6 were used for the qualitative analysis to assess the characteristics of the dimension reduction results based on SSTLPPAMDM, STLPP, and LPP combined with different inputs. Three commonly used clustering performance measure indicators were used to quantitatively evaluate the ability of the dimension reduction algorithms to be used for subsequent fault classification: the withinclass scatter , the betweenclass scatter , and the synthesized withinclassbetweenclass scatter . The mathematical equations for these three indicators can be written as follows:where is the number of conditional classes, is the total number of samples where is the number of samples belonging to the th class, is the feature value of the th sample in the th class, is the mean feature value of the th class, and is the total mean feature value of all classes. It should be noted that the clustering performance of each feature is proportional to the values of the betweenclass scatter and the synthesized withinclassbetweenclass scatter and yet is inversely proportional to the value of the withinclass scatter .
The previouslymentioned training sample set is used, which contains sixclass faulty condition data for the gearbox as the input. Ten groups of experiments are performed to calculate the corresponding scatter parameters of the first threedimensional features of the vectored dimension reduction results: () SSTLPPAMDM based dimension reduction for secondorder tensor formed multisensor fused data (SSTLPPAMDM for STMD), () traditional STLPPbased dimension reduction for secondorder tensor formed multisensor fused data (STLPP for STMD), () LPPbased dimension reduction for vectorrepresented multisensor fused data (LPP for VMD), and ()–() LPPbased dimension reduction of vectored sample data from seven different positional sensors (LPP for VSD1~7). The scatter computation results for each dimensional feature based on each of the different ten methods are shown in Table 2 and the corresponding average scatter parameter values are displayed in Figure 7. It can be seen that the SSTLPPAMDM based dimension reduction results of tensor formed multisensor fused samples have the smallest withinclass scatter , the largest betweenclass scatter , and thus the largest synthesized scatter . The traditional STLPPbased dimension reduction for tensor formed multisensor fused data and the LPPbased dimension reduction for vectorrepresented multisensor fused data obtain larger values, smaller values, and smaller values than SSTLPPAMDM for STMD but achieve smaller values, larger values, and larger values than the other seven types of LPPbased dimension reduction approaches for vectored sample data originating from a single sensor. These comparison and analysis results indicate that SSTLPPAMDM for STMD is much more effective than any of the other nine dimension reduction methods for different types of samples data in terms of the clustering performance of the dimension reduction results.
As mentioned earlier, in order to acquire direct evidence of the superiority of the proposed SSTLPPAMDM algorithm as well as the multisensor data fusion, three frequentlyused intelligent classifiers (SVM, MLP neural network, and SVDD), respectively, acted on the first threedimensional features of the vectored dimension reduction results of the ten methods (M1~M10), which are marked as F1~F10. Each experiment is carried out ten times. For the SVM classifier, this paper employs a radial basis kernel function and the value of the kernel parameter is 1. For the MLP neural network, the commonly used three layers structure is employed: input layer, hidden layer, and output layer, and the numbers of nodes in the input and output layers are set to 3 and 6. These values depend on the number of input features and output classes. The geometric pyramid rule determines that the number of hidden layer nodes is 5. The Gaussian kernel function is used for the SVDD model, and the corresponding kernel parameter is set to 3. The classification results of the three models which are applied to each of the ten types of feature sets originating from the previous experiment are listed in Table 3.
As shown in Table 3, the reduced feature set of the tensor formed multisensor fused fault data based on the proposed SSTLPPAMDM dimension reduction algorithm (F1) achieves higher classification accuracy than the other nine types of reduced feature sets (F2~F10) for all three classifiers. The reduced feature sets of the tensor formed multisensor fused fault data based on the traditional STLPP algorithm (F2) and that of the vectorrepresented multisensor fused data based on LPP (F3) achieve the second and third highest classification accuracy. The seven types of reduced feature sets of vectored fault data originating from a single sensor (F4~F10) have the lowest level of classification accuracy. These results further confirm the effectiveness of the proposed SSTLPPAMDM combined with the formulated tensorrepresented multisensor data fusion to increase the amount of useful information in the feature set and facilitate the subsequent classification task.
5.3. Overall Performance Validation of the Proposed Fault Diagnosis Approach
The following experiments and analysis were also employed to verify the superiority of the proposed WKNNCAMDM method, as well as the overall fault diagnosis approach proposed by this paper. Using the implementation procedure for the proposed fault diagnosis method shown in Figure 1, a final fault diagnostic result is achieved by inputting the lowdimensional tensor formed testing sample set after dimension reduction with SSTLPPAMDM into WKNNCAMDM. Furthermore, the classification performance of WKNNCAMDM combined with the lowdimensional secondorder tensor formed multisensor fused sample data after dimension reduction is compared with the WKNNCFDM and the KNNCFDM, which both have the same input data. For all three classifiers, the neighborhood size was set to 13. Each experiment was performed ten times and the classification results of the three classifiers for the fault sample data of a gearbox including the cumulative number of false classification samples (Cum. number of FCS), the distribution of false classification samples within the six different faulty classes (number of FCS within Classes 1~6), and the total testing accuracy are listed in detail in Table 4.

It can be seen from Table 4 that although the same input data is used for the three classifiers, namely, the lowdimensional tensor formed testing sample set after dimension reduction with SSTLPPAMDM, the proposed WKNNC under AMDM has a classification accuracy of 100%, which is higher than the 89.17% accuracy achieved using WKNNC under traditional FDM and the 70.83% accuracy achieved using the classical KNNC with FDM. These results indicate that the WKNNCAMDM method has superior classification performance to WKNNCFDM and KNNCFDM, due to the addition of the assembled matrix distance metric for the similarity representation of the secondorder tensor formed samples and the weighted voting strategy for the nearest neighbor classifier. This experiment also effectively demonstrated the performance of the overall proposed total fault diagnosis framework, which comprehensively includes the SSTLPPAMDM based dimension reduction and the WKNNCAMDM.
6. Conclusions
This paper has presented a novel multisensor fused fault diagnosis approach for rotation machinery based on SSTLPPAMDM and WKNNCAMDM. Based on significant experimental analysis and comparisons that were performed, the main conclusions can be summarized as follows.(1)In contrast with traditional STLPP, the proposed SSTLPPAMDM algorithm can obtain better dimension reduction effects for the original highdimensional secondorder tensorrepresented samples. This was achieved by the addition of the class label information and improvement of the similarity evaluation method for matrix formed samples by AMDM. Furthermore, it was also verified that SSTLPPAMDM based dimension reduction of multisensor fused secondorder tensor formed samples is superior to LPPbased dimension reduction of multisensor fused vectorformed samples and LPPbased dimension reduction of vectorformed samples from a single sensor in terms of the clustering performance of samples of different classes after reduction.(2)The proposed WKNNCAMDM can obtain higher classification accuracy than WKNNCFDM and KNNCFDM due to the introduction of weighted voting strategy and assembled matrix distance metric for similarity representation of secondorder tensor formed samples.(3)Using the advantages of secondorder tensor formed multisensor fused faulty sample representation, SSTLPPAMDM for efficient dimension reduction, and WKNNCAMDM for rapid fault classification, the proposed fault diagnosis approach achieves higher classification accuracy for rotation machinery than the other homogenous methods.
In summary, the proposed fault diagnosis approach has the following strengths: more adequate fault information, lower calculation complexity, and higher fault recognition accuracy. Therefore, it is extremely suited to engineering applications for fault diagnosis of rotation machinery.
Competing Interests
The authors declare that they have no competing interests.
Acknowledgments
This research is supported by National Natural Science Foundation of China (no. 51575143).
References
 Y. Lei, J. Lin, Z. He, and M. J. Zuo, “A review on empirical mode decomposition in fault diagnosis of rotating machinery,” Mechanical Systems and Signal Processing, vol. 35, no. 12, pp. 108–126, 2013. View at: Publisher Site  Google Scholar
 Z. Feng, M. Liang, and F. Chu, “Recent advances in timefrequency analysis methods for machinery fault diagnosis: a review with application examples,” Mechanical Systems and Signal Processing, vol. 38, no. 1, pp. 165–205, 2013. View at: Publisher Site  Google Scholar
 Y. Lei, J. Lin, M. J. Zuo, and Z. He, “Condition monitoring and fault diagnosis of planetary gearboxes: a review,” Measurement, vol. 48, no. 1, pp. 292–305, 2014. View at: Publisher Site  Google Scholar
 Y. Wang, J. Xiang, R. Markert, and M. Liang, “Spectral kurtosis for fault detection, diagnosis and prognostics of rotating machines: a review with applications,” Mechanical Systems and Signal Processing, vol. 6667, pp. 679–698, 2016. View at: Publisher Site  Google Scholar
 Z. Li, X. Yan, Z. Tian, C. Yuan, Z. Peng, and L. Li, “Blind vibration component separation and nonlinear feature extraction applied to the nonstationary vibration signals for the gearbox multifault diagnosis,” Measurement, vol. 46, no. 1, pp. 259–271, 2013. View at: Publisher Site  Google Scholar
 J. Zhang, W. Ma, J. Lin, L. Ma, and X. Jia, “Fault diagnosis approach for rotating machinery based on dynamic model and computational intelligence,” Measurement, vol. 59, pp. 73–87, 2015. View at: Publisher Site  Google Scholar
 Z. Li, X. Yan, C. Yuan, Z. Peng, and L. Li, “Virtual prototype and experimental research on gear multifault diagnosis using waveletautoregressive model and principal component analysis method,” Mechanical Systems and Signal Processing, vol. 25, no. 7, pp. 2589–2607, 2011. View at: Publisher Site  Google Scholar
 S. W. Ji and J. P. Ye, “Generalized linear discriminant analysis: a unified framework and efficient model selection,” IEEE Transactions on Neural Networks, vol. 19, no. 10, pp. 1768–1782, 2008. View at: Publisher Site  Google Scholar
 T. F. Cox and M. A. Cox, MultiDimensional Scaling, Chapman & Hall, London, UK, 1994.
 J. B. Tenenbaum, V. De Silva, and J. C. Langford, “A global geometric framework for nonlinear dimensionality reduction,” Science, vol. 290, no. 5500, pp. 2319–2323, 2000. View at: Publisher Site  Google Scholar
 S. T. Roweis and L. K. Saul, “Nonlinear dimensionality reduction by locally linear embedding,” Science, vol. 290, no. 5500, pp. 2323–2326, 2000. View at: Publisher Site  Google Scholar
 M. Belkin and P. Niyogi, “Laplacian eigenmaps for dimensionality reduction and data representation,” Neural Computation, vol. 15, no. 6, pp. 1373–1396, 2003. View at: Publisher Site  Google Scholar
 Z. Y. Zhang and H. Y. Zha, “Principal manifolds and nonlinear dimensionality reduction via tangent space alignment,” SIAM Journal on Scientific Computing, vol. 26, no. 1, pp. 313–338, 2004. View at: Publisher Site  Google Scholar  MathSciNet
 F. Li, B. Tang, and R. Yang, “Rotating machine fault diagnosis using dimension reduction with linear local tangent space alignment,” Measurement: Journal of the International Measurement Confederation, vol. 46, no. 8, pp. 2525–2539, 2013. View at: Publisher Site  Google Scholar
 X. Ding, Q. He, and N. Luo, “A fusion feature and its improvement based on locality preserving projections for rolling element bearing fault classification,” Journal of Sound and Vibration, vol. 335, pp. 367–383, 2015. View at: Publisher Site  Google Scholar
 W. Wen, R. X. Gao, and W. Cheng, “Planetary gearbox fault diagnosis using envelope manifold demodulation,” Shock and Vibration, vol. 2016, Article ID 3952325, 13 pages, 2016. View at: Publisher Site  Google Scholar
 X. He, D. Cai, and P. Niyogi, “Tensor subspace analysis,” in Advances in Neural Information Processing Systems, pp. 499–506, 2005. View at: Google Scholar
 G. Dai and D.Y. Yeung, “Tensor embedding methods,” in Proceedings of the Neural Conference on Artificial Intelligence, pp. 330–335, July 2006. View at: Google Scholar
 X. Li, S. Lin, S. Yan, and D. Xu, “Discriminant locally linear embedding with highorder tensor data,” IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, vol. 38, no. 2, pp. 342–352, 2008. View at: Publisher Site  Google Scholar
 K. Lee and H. Park, “Probabilistic learning of similarity measures for tensor PCA,” Pattern Recognition Letters, vol. 33, no. 10, pp. 1364–1372, 2012. View at: Publisher Site  Google Scholar
 W. Zuo, D. Zhang, and K. Wang, “An assembled matrix distance metric for 2DPCAbased image recognition,” Pattern Recognition Letters, vol. 27, no. 3, pp. 210–216, 2006. View at: Publisher Site  Google Scholar
 N. Saravanan and K. I. Ramachandran, “Incipient gear box fault diagnosis using discrete wavelet transform (DWT) for feature extraction and classification using artificial neural network (ANN),” Expert Systems with Applications, vol. 37, no. 6, pp. 4168–4181, 2010. View at: Publisher Site  Google Scholar
 F. Chen, B. Tang, T. Song, and L. Li, “Multifault diagnosis study on roller bearing based on multikernel support vector machine with chaotic particle swarm optimization,” Measurement, vol. 47, no. 1, pp. 576–590, 2014. View at: Publisher Site  Google Scholar
 N. Saravanan, S. Cholairajan, and K. I. Ramachandran, “Vibrationbased fault diagnosis of spur bevel gear box using fuzzy technique,” Expert Systems with Applications, vol. 36, no. 2, pp. 3119–3135, 2009. View at: Publisher Site  Google Scholar
 J. Yu, M. Liu, and H. Wu, “Local preserving projectionsbased feature selection and Gaussian mixture model for machine health assessment,” Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, vol. 225, no. 7, pp. 1703–1717, 2011. View at: Publisher Site  Google Scholar
 R. Stoklasa, T. Majtner, and D. Svoboda, “Efficient kNN based HEp2 cells classifier,” Pattern Recognition, vol. 47, no. 7, pp. 2409–2418, 2014. View at: Publisher Site  Google Scholar
 F. Li, J. Wang, B. Tang, and D. Tian, “Life grade recognition method based on supervised uncorrelated orthogonal locality preserving projection and Knearest neighbor classifier,” Neurocomputing, vol. 138, pp. 271–282, 2014. View at: Publisher Site  Google Scholar
 T. M. Cover and P. E. Hart, “Nearest neighbor pattern classification,” IEEE Transactions on Information Theory, vol. 13, no. 1, pp. 21–27, 1967. View at: Publisher Site  Google Scholar
 J. Yang, D. Zhang, A. F. Frangi, and J.Y. Yang, “Twodimensional PCA: a new approach to appearancebased face representation and recognition,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 1, pp. 131–137, 2004. View at: Publisher Site  Google Scholar
 G. Gates, “The reduced nearest neighbor rule,” IEEE Transactions on Information Theory, vol. 18, no. 3, pp. 431–433, 1972. View at: Publisher Site  Google Scholar
 B. Samanta, “Gear fault detection using artificial neural networks and support vector machines with genetic algorithms,” Mechanical Systems and Signal Processing, vol. 18, no. 3, pp. 625–644, 2004. View at: Publisher Site  Google Scholar
 Y. Lei, Z. He, Y. Zi, and X. Chen, “New clustering algorithmbased fault diagnosis using compensation distance evaluation technique,” Mechanical Systems and Signal Processing, vol. 22, no. 2, pp. 419–435, 2008. View at: Publisher Site  Google Scholar
Copyright
Copyright © 2016 Fen Wei et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.