A Novel Graph Constructor for Semisupervised Discriminant Analysis: Combined Low-Rank and <svg xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://www.w3.org/2000/svg" style="vertical-align:-0.2723999pt" id="M1" height="12.533pt" version="1.1" viewBox="-0.0657574 -12.2606 8.79534 12.533" width="8.79534pt"><g transform="matrix(.017,0,0,-0.017,0,0)"><path id="g113-108" d="M480 416C480 431 465 448 438 448C388 448 312 383 252 330C217 299 188 273 155 237H153L257 680C262 700 263 712 253 712C240 712 183 684 97 674L92 648L126 647C166 646 172 645 163 606L23 -6L29 -12C51 -5 77 2 107 8C115 62 130 128 142 180C153 193 179 220 204 241C231 170 259 106 288 54C317 0 336 -12 358 -12C381 -12 423 2 477 80L460 100C434 74 408 54 398 54C385 54 374 65 351 107C326 154 282 241 263 299C296 332 351 377 403 377C424 377 436 372 445 368C449 366 456 368 462 375C472 386 480 402 480 416Z"/></g></svg>-Nearest Neighbor Graph

Zu, Baokai; Xia, Kewen; Pan, Yongke; Niu, Wenjia

doi:https://doi.org/10.1155/2017/9290230

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Related Work Analysis Conclusions Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2017 | Article ID 9290230 | https://doi.org/10.1155/2017/9290230

A Novel Graph Constructor for Semisupervised Discriminant Analysis: Combined Low-Rank and -Nearest Neighbor Graph

Baokai Zu,^1,2,3Kewen Xia ,^1,2Yongke Pan,^1,2and Wenjia Niu^1,2

Academic Editor: Manuel Graña

Received18 Jun 2016

Revised21 Nov 2016

Accepted19 Jan 2017

Published20 Feb 2017

Abstract

Semisupervised Discriminant Analysis (SDA) is a semisupervised dimensionality reduction algorithm, which can easily resolve the out-of-sample problem. Relative works usually focus on the geometric relationships of data points, which are not obvious, to enhance the performance of SDA. Different from these relative works, the regularized graph construction is researched here, which is important in the graph-based semisupervised learning methods. In this paper, we propose a novel graph for Semisupervised Discriminant Analysis, which is called combined low-rank and -nearest neighbor (LRKNN) graph. In our LRKNN graph, we map the data to the LR feature space and then the is adopted to satisfy the algorithmic requirements of SDA. Since the low-rank representation can capture the global structure and the -nearest neighbor algorithm can maximally preserve the local geometrical structure of the data, the LRKNN graph can significantly improve the performance of SDA. Extensive experiments on several real-world databases show that the proposed LRKNN graph is an efficient graph constructor, which can largely outperform other commonly used baselines.

1. Introduction

For the real-world data mining and pattern recognition applications, the labeled data are very expensive or difficult to obtain, while the unlabeled data are often copious and available. So how to improve the learning performance using the copious unlabeled data has attracted considerable attention [1, 2]. Semisupervised dimensionality reduction can be directly used in the whole dataset which does not need training set and testing set [3].

Illuminated by semisupervised learning [4–6], Semisupervised Discriminant Analysis (SDA) is first proposed by Cai et al. [2]. It can easily resolve the out-of-sample problem [7]. In SDA algorithm, the labeled samples are used to maximize the different classes reparability and the unlabeled ones to estimate the data’s intrinsic geometric information. From then on, many kinds of semisupervised LDA were proposed. Zhang and Yeung proposed SSDA [3] using path-based similarity measure. In a similar way, SMDA [8] and UDA [9] execute LDA under semisupervised setting manifold regularization. And [6] utilizes unlabeled data to maximize an optimality criterion of LDA and uses the constrained concave-convex procedure to solve the optimization problem and so forth.

Although these methods perform semisupervised LDA in different ways, they all need the geometric relationships between the whole data by constructing a regularized graph. The graph remarkably impacts the performance of these methods. However, little attention has been paid to graph constructor methods. So in this paper we study the regularized graph construct problem of SDA [2]. Below we summarize our main contributions in this paper.(i)Inspired by low-rank representation (LRR) [10] and the -nearest neighbor algorithm, we construct a novel graph called combined low-rank and -nearest neighbor graph. LRR jointly obtains the representation of all the samples under a global low-rank constraint. Thus it is better at capturing the global data structures.(ii)Since is used to satisfy the algorithmic requirements of SDA, the affinity of local geometrical structure can be maximally preserved after using the LRKNN graph.(iii)Extensive experiments on real-world datasets show that our proposed LRKNN regularized graph can significantly boost the performance of Semisupervised Discriminant Analysis.

The rest of the paper is organized as follows. We briefly review the related work in Section 2. We give the preliminary in Section 3. We then introduce the combined low-rank and -nearest neighbor graph construct framework in Section 4. Then Section 5 reports the experiment results on real-world database tasks. In Section 6, we conclude the paper.

This paper proposes a combined low-rank and -nearest neighbor graph to boost the performance of Semisupervised Discriminant Analysis. Our work is related to both Semisupervised Discriminant Analysis improvement techniques and graph conductor design. We briefly discuss both of them.

Cai et al. [2] proposed a semisupervised dimensionality reduction algorithm SDA, which captures the local structure for data dimensionality reduction. Zhang and Yeung proposed SSDA [3] using path-based similarity measure to capture global manifold structure of the data. The works in SMDA [8] and UDA [9] also perform semisupervised LDA with manifold regularization. Nie et al. [11] proposed an orthogonal constraint semisupervised orthogonal discriminant analysis method. Zhang et al. [1] utilized must-link constraints or cannot-link constraints to capture the underlying structure of dataset. Song et al. [5] utilized labeled data to discover class structure and utilized unlabeled data to capture the intrinsic local geometry. Probabilistic Semisupervised Discriminant Analysis (PSDA) algorithm is presented by Li et al. [12], which utilizes unlabeled samples to approximate class structure instead of local geometry. In the work [13], Dhamecha et al. presented an incremental Semisupervised Discriminant Analysis algorithm, which utilizes the unlabeled data for enabling incremental learning. The work [14] developed a graph-based semisupervised learning method based on PSDA for dimensionality reduction.

Our work is also related to another line of research, the graph conductor design. There are many methods proposed for graph construction, including -nearest neighbors based method and -ball based method [15] which are two most popular methods for graph adjacency construction. Based on these two methods, various approaches such as heat kernel [15] and inverse Euclidean distance [16] are used to set the graph edge weights. However, all these methods are to find pairwise Euclidean distances, which are very sensitive to data noise. Moreover, since only the local pairwise relationship between data points is taken into account, the constructed graph cannot reveal sufficiently the clustering relationship among the samples. Yan et al. proposed an -graph via sparse representation [10, 17]. An -graph over a dataset is derived by encoding each datum as a sparse representation of the remaining samples. In the work [18], Zhuang et al. proposed a novel method to construct an informative low-rank graph (LR-graph) for semisupervised learning. And Gao et al. proposed a novel graph construction method via group sparsity [19]. Li and Fu [20] developed an approach to construct graph-based on low-rank coding and -matching constraint and proposed a novel supervised regularization based robust subspace (SRRS) approach via low-rank learning [21]. Zhao et al. proposed a novel approach to construct a sparse graph with blockwise constraint for face representation, named SGB [22]. A sparse and low-rank graph-based discriminant analysis (SLGDA) is proposed, which combines both sparsity and low rankness to maintain global and local structures simultaneously [23]. In the work [24], Li and Fu incorporated KNN constraint and -matching constraint into the low-rank representation model as the balanced (or unbalanced) graph. We focus on constructing a novel graph for SDA, capturing the data using LRR and then utilizing the KNN algorithm to satisfy the algorithmic requirements of SDA.

The work that is most closely related to ours is the low-rank kernel-based Semisupervised Discriminant Analysis [25], which is my previous research. The LRR is used as the kernel in the KSDA [2]. In our current work, we proposed a novel graph for Semisupervised Discriminant Analysis, which is called combined low-rank and -nearest neighbor (LRKNN) graph. In our LRKNN graph, the is adopted to satisfy the algorithmic requirements of SDA. Since the low-rank representation can capture the global structure and the -nearest neighbor algorithm can maximally preserve the local geometrical structure of the data, therefore the LRKNN graph can capture not only the global structure but also the local information of the data, which can largely improve the performance of the SDA.

3. Preliminary

3.1. Overview of SDA

Given a set of samples , where , the first samples are labeled , and the remaining are unlabeled ones. They all belong to classes. The SDA method [2] hopes to find a rejection matrix , which motivates presenting the prior assumption of consistency by a regularized term. The objective function is as follows:where and are the between-class scatter and total class scatter matrix. And is defined as the within-class scatter matrix.

The parameter in (1) balances the model complexity and the empirical loss. The regularized term supplies us with the flexibility to incorporate the prior knowledge in the applications. We aim at constructing graph combining the manifold structure through the available unlabeled samples.

Given a set of samples , we can construct the graph to represent the relationship between nearby samples by . Then put an edge between -nearest neighbors of each other. The corresponding weight matrix is defined as follows:where denotes the set of -nearest neighbors of . Then term can be defined as follows:where is a diagonal matrix whose entries are column (or row since is symmetric) sum of ; that is, . The Laplacian matrix [10] is . We can get the objective function of the SDA with the regularizer term :By maximizing the generalized eigenvalue problem, we can obtain the projective vector .

3.2. Low-Rank Representation

Yan and Wang proposed the low-rank representation and used it to construct the affinities of an undirected graph (here called LR-graph) [10]. It jointly obtains the representation of all the samples under a global low-rank constraint, and thus it is better at capturing the global data structures [16].

Let be a set of samples; each column is a sample which can be represented by a linear combination in the dictionary [26]. Here, we select the samples themselves as the dictionary :where is the coefficient matrix with each being the representation coefficient of . LRR seeks the lowest rank solution by solving the following optimization problem [26]:The above optimization problem can be relaxed to the following convex optimization [27]:Here, denotes the nuclear norm (or trace norm) [28] of a matrix, that is, the sum of the matrixes singular values. By considering the noise or corruption in our real-world applications, a more reasonable objective function iswhere can be the -norm or -norm. In this paper we choose -norm as the error term measurement which is defined as . The parameter is used to balance the effect of low rank and the error term. The optimal solution can be obtained via the inexact augmented Lagrange multipliers (ALM) method [29, 30].

3.3. -Nearest Neighbor Algorithm

The samples and are considered as neighbors if is among the -nearest neighbors of or is among the -nearest neighbors of . There are different methods to assign weights for . The following are three of them.(i)Inverse of Euclidean distance [16] (here we call it KNNE to distinguish different ones):(ii) weighting [15] (here we call it KNNB), where it is used in the original SDA:(iii)Heat kernel weighting [15] (here we call it KNNK): where denotes neighbor neighbors of in (10), (11), and (12). Using this regularization (12), the affinity of local geometrical structure can be maximally preserved.

4. Proposed Algorithm

4.1. Combined Low-Rank and -Nearest Neighbor (LRKNN) Graph Constructor Algorithm

How to find an appropriate subspace for classification is an important task, which we called dimensionality reduction. The dimensionality reduction is aimed at finding labeling of the graph, which is consistent with both the initial labeling and the data’s geometry structure (edges and weights ).

These proposed SDA methods always analyze the relationship of the data using the mode one-to-others. For example, the most common -nearest neighbor graph only shows the edges and the weight graph should be 1, or the -graph and the -graph (SR-graph) determine the graph structure weights by the limitation of -norm or the -norm. And the -graphs lack global constraints, which greatly reduce the performance when the data is grossly corrupted. To solve this drawback, Liu et al. proposed the low-rank representation and used it to construct the affinities of an undirected LR-graph [26]. LR-graph jointly obtains the representation of all the samples under a global low-rank constraint, and thus it is better at capturing the global data structures [31].

Since the LR-graph, -graph, and -graph are asymmetric matrix, in order to satisfy the algorithmic requirements of SDA, similar graph symmetrization process was often used in the previous works; that is, . Since the LRR is good at capturing the global data structures and the local geometrical structure can be maximally preserved by the -nearest neighbor algorithm, here, we proposed a novel solution which uses -nearest neighbor algorithm to satisfy the algorithmic requirements. So the combined LRKNN method can improve the performance to a very large extent. Heat kernel weighting [15] is used here.

4.2. SDA Using Combined Low-Rank and -Nearest Neighbor Graph

Graph structure remarkably impacts the performance of these SDA-likely methods. However, little attention has been paid to graph constructor methods. So in this paper we present a novel combined low-rank and -nearest neighbor graph algorithm, which largely improves the performance of SDA.

Firstly, map the labeled and unlabeled data to the LR-graph feature space. Secondly, obtain the symmetric graph by -nearest neighbor algorithm where heat kernel weighting is used. By choosing appropriate kernel parameter, it can increase the similarities among the intraclass samples and the differences among the interclass samples. Then implement the SDA algorithm for dimensionality reduction. Finally execute the nearest neighbor method for the final classification in the derived low dimensional feature subspace. The procedure is described as follows in Algorithm 1.

Input: The whole dataset , where samples are labeled
and are unlabeled ones.
Output: The classification results.
Step 1. Map the labeled and unlabeled data to feature space by the LRR algorithm.

Step 2. Obtain the symmetric graph by -nearest neighbor algorithm.

Step 3. Implement the SDA algorithm for dimensionality reduction.
Step 4. Execute the nearest neighbor approach for the final classification.

5. Experiments and Analysis

To examine the performance of the LRKNN graph in SDA algorithm, we conducted extensive experiments on several real-world datasets. In this section, we introduce the datasets we used and the experiments we performed, respectively; then we present the experimental results as well as the analysis. The experiments are conducted on machines with Intel Core CPUs of 2.60 GHz and 8 GB RAM.

5.1. Experiment Overview

5.1.1. Datasets

We evaluate the proposed method on 4 real-world datasets including three face databases and the USPS database. In these experiments, we normalize the sample to a unit norm.

(i) ORL Database [10]. The ORL dataset contains 10 different images of each for 40 distinct subjects. The images are taken at different times, varying the lighting, facial expressions, and facial details. Each face image is manually cropped and resized to 32 × 32 pixels, with 256 grey levels per pixel.

(ii) Extended Yale Face Database B [32]. This dataset now has 38 individuals and around 64 near frontal images under different illuminations per individual. Each face image is resized to 32 × 32 pixels. And we select the first 20 persons and choose 20 samples of each subject.

(iii) CMU PIE Face Database [2]. It contains 68 subjects with 41,368 face images. The face images were captured under varying poses, illuminations, and expressions. The size of each image is resized to 32 × 32 pixels. We select the first 20 persons and choose 20 samples of per subject.

(iv) USPS Database [33]. The USPS handwritten digit database is a popular subset containing 9298, 16 × 16 handwritten digit images in total. Here, we randomly select 300 examples for the experiments.

5.1.2. Comparative Algorithms

In order to demonstrate how the SDA dimensionality reduction performance can be improved by the combined LRKNN graph, we list out several graphs also including combined SR and LLE with the KNNK algorithm and the separate algorithm (without ) SR, LLE, and the KNNK for comparison. For the separate LR, SR, and LLE algorithm, the previous symmetrization process is used here to satisfy the algorithmic requirements of SDA, which is used in previous works.

(i) SR-Graph [29]. SR-graph considers the reconstruction coefficients in the sparse representation by solving the following problem: . The graph weight is defined as .

(ii) LLE-Graph [34]. LLE-graph considers the situation of reconstructing a sample from its neighbor points and then minimizes the reconstruction error. if does not belong to the neighbors of . The number of the nearest neighbors is set to 4.

(iii) KNNK Graph [29]. We adopt Euclidean distance as our similarity measure and use a Gaussian kernel to reweight the edges. The number of the nearest neighbors is set to 4. Similarly, the original SDA using KNNB graph is also set to 4.

5.2. Experiment 1: Performances of SDA Using Different Regularized Graphs

To examine the effectiveness of the proposed combined LRKNN graph for SDA, we conduct experiments on the four databases. In our experiments, we randomly select 30% samples from each class as the labeled samples to evaluate the performance with different numbers of selected features. The evaluations are conducted with 20 independent runs for each algorithm. We average them as the final results. First we utilize different graph construction methods to get the term, and then we implement the SDA algorithm for dimensionality reduction. Finally, the nearest neighbor approach is employed for the final classification in the derived low dimensional feature subspace. For each database, the classification accuracy for different graphs is shown in Figure 1. Table 1 shows the performance comparison of different graph algorithms. Note that the results are the best results of all these different selected features mentioned above. The bold numbers represent the best results of different graph algorithms. From these results, we can observe the following:(i)In most cases, our proposed LRKNN graph consistently achieves the highest classification accuracy compared to the other graphs. The results indicate that the classification accuracy is much higher than the other graph algorithms. So it improves the classification performance to a large extent, which suggests that LRKNN graph is more informative and suitable for SDA.(ii)In most conditions, the performance of the combined algorithm is always superior to the separate algorithm (without ), which means that our proposed graph construct methods combined algorithm is extremely effective, especially for the LRR algorithm.(iii)Since the SR-graph (-graph) lacks global constraints, the performance improvement is not obvious even if it is combined with the algorithm.(iv)In some cases (maybe some certain enough high dimensionality), the traditional construct graph methods such as -graph and LLE-graph may achieve good performances in some databases, but they are not as stable as our proposed algorithm.

(a) ORL database

(b) Extended Yale Face Database B

(c) CMU PIE face database

(d) USPS database

Table 2 shows the execution time of the eight methods mentioned. We compute the total time with 20 independent runs for 10 features. And Table 2 gives the average runtime of the 20 runs for 10 features. We can see that although our algorithm is slower than the traditional algorithms, the performance is much better than these baseline algorithms at an acceptable runtime.

5.3. Experiment 2: Parameters Settings

We examine the effect of the heat kernel parameters in LRKNN, SR-, LLE-, and KNNK graph. We vary the graph parameters and examine the classification accuracy on the four databases. We also select 30% samples from each class to evaluate the classification performance. The evaluations are conducted with 20 independent runs and the averaged results are adopted. We adopt the average results of the 10 different numbers of selected features mentioned in Section 5.2 as the final result, which are shown in Figure 2. We can see that the classification accuracy is influenced by the kernel parameters.

(a) ORL database

(b) Extended Yale Face Database B

(c) CMU PIE face database

(d) USPS database

We also evaluate the performance of different nearest neighbor numbers for the LRKNN graph, namely, the value for the algorithm. Here we conduct the experiments on the ORL database and Extended Yale Face Database B. The procedure is the same as the experiments above. We adopt the average results of the 20 different runs as the final result, which are shown in Figure 3. We can see that the classification accuracy is improving by the increasing of numbers of nearest neighbors. And when the nearest neighbors reach some numbers like 3 or 4, the performance has a slight decrease, since here we choose 4 as the number of nearest neighbors in our experiments.

(a) ORL database

(b) Extended Yale Face Database B

5.4. Experiment 3: Influence of the Label Number

We evaluate the influence of the label number in this subsection. The experiments are conducted with 20 independent runs for each algorithm. We average them as the final results. The procedure is the same as the experiments in Section 5.2. The bold numbers represent the best results. And the percentage number after the database is the label rate. For each database, we vary the percentage of labeled samples from 20% to 50% and the recognition accuracy is shown in Table 3, from which we observe the following.

In most cases, our proposed LRKNN graph consistently achieves the best results, which is robust to the label percentage variations. And it is worth noting that even in very low label rate our proposed method can achieve high classification accuracy, while some other compared algorithms are not as robust as our LRKNN algorithm especially when the label rate is low. Thus, our proposed method has much superiority compared with the traditional construct graph methods. Sometimes these traditional methods may achieve good performances in some databases with high enough label rate. But they are not as stable as our proposed algorithm. Since the labeled data is very expensive and difficult, our proposed graph for SDA algorithm is more robust and suitable for the real-world data.

5.5. Experiment 4: Performance of LRKNN Graph with Different Weight Methods

We evaluate the performance of the different weight methods mentioned in Section 5.2 for our LRKNN graph. We conduct 20 independent runs for each algorithm. We average them as the final results. The procedure is the same as the experiments in Section 5.2. For each database, we show the performance for the three weight methods (KNNE, KNNB, and KNNK) of for our LRKNN graph in Figure 4, from which we observe the following.

(a) ORL database

(b) Extended Yale Face Database B

(c) CMU PIE face database

(d) USPS database

Overall, the KNNK based LRKNN graph achieves the best results compared with the other two methods. And we can see that in some datasets the performance gap of the three methods is very small, while in some other datasets the performance gap is much bigger, since the KNNE and KNNB cannot capture the local structure very well in some datasets. They are not as stable as KNNK algorithm, since here we choose the heat kernel weighting method for LRKNN graph.

5.6. Experiment 5: Robustness to Different Types of Noises

In this test we compare the performance of different graphs in the noisy environment. Extended Yale Face Database B is used in this experiment. The Gaussian white noise, “salt and pepper” noise, and multiplicative noise are added to the data, respectively. The Gaussian white noise is with mean 0 and different variances from 0 to 0.1. The “salt and pepper” noise is added to the image with different noise densities from 0 to 0.1. And multiplicative noise is added to the data , using the equation , where and are the original and noised data and is uniformly distributed random noise with mean 0 and varying variance from 0 to 0.1.

The number of labeled samples in each class is 30%. The experiments are conducted 20 runs for each graph. We average them as the final results. The procedure is the same as the experiments in Section 5.2. The bold numbers represent the best results. For each graph, we vary the parameter of different noise. The results are shown in Tables 4, 5, and 6. As we can see, the results of our method are stable for Gaussian noise, “salt and pepper” noise, and multiplicative noise. And because of the robustness of the low-rank representation to noise, our method LRKNN is much more robust than other graphs. With the different kinds of gradually increasing noise, some kinds of methods’ performance fall a lot, while our method's performance is robust and decrease little with the increasing noises.

6. Conclusions

In this paper, we propose a novel combined low-rank and -nearest neighbor graph algorithm, which largely improves the performance of SDA. The LRR can naturally capture the global structure of the data. And the -nearest neighbor algorithm can maximally preserve the local geometrical structure of the data. Therefore, it can largely improve the performance using the algorithm to satisfy the SDA’s algorithmic requirements. Empirical studies on four real-world datasets show that our proposed LRKNN graph for Semisupervised Discriminant Analysis is more robust and suitable for the real-world applications.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (no. 51208168), Tianjin Natural Science Foundation (no. 13JCYBJC37700), Hebei Province Natural Science Foundation (no. E2016202341, no. F2013202254, and no. F2013202102), and Hebei Province Foundation for Returned Scholars (no. C2012003038).

References

D. Zhang, Z.-H. Zhou, and S. Chen, “Semi-supervised dimensionality reduction,” in Proceedings of the 7th SIAM International Conference on Data Mining (SDM '05), pp. 629–634, SIAM, Minneapolis, Minn, USA, April 2007.
View at: Google Scholar
D. Cai, X. He, and J. Han, “Semi-supervised discriminant analysis,” in Proceedings of the IEEE 11th International Conference on Computer Vision (ICCV '07), pp. 1–7, IEEE, Rio de Janeiro, Brazil, October 2007.
View at: Publisher Site | Google Scholar
Y. Zhang and D.-Y. Yeung, “Semi-supervised discriminant analysis using robust path-based similarity,” in Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '08), pp. 1–8, Anchorage, Alaska, USA, June 2008.
View at: Publisher Site | Google Scholar
M. Sugiyama, T. Idé, S. Nakajima, and J. Sese, “Semi-supervised local Fisher discriminant analysis for dimensionality reduction,” Machine Learning, vol. 78, no. 1-2, pp. 35–61, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
Y. Song, F. Nie, C. Zhang, and S. Xiang, “A unified framework for semi-supervised dimensionality reduction,” Pattern Recognition, vol. 41, no. 9, pp. 2789–2799, 2008.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
Y. Zhang and D.-Y. Yeung, “Semi-supervised discriminant analysis via cccp,” in Machine Learning and Knowledge Discovery in Databases, vol. 5212 of Lecture Notes in Computer Science, pp. 644–659, Springer, Berlin, Germany, 2008.
View at: Publisher Site | Google Scholar
Y. Bengio, J.-F. Paiement, P. Vincent, O. Delalleau, N. Le Roux, and M. Ouimet, “Out-of sample extensions for LLE, isomap, MDS, eigenmaps, and spectral clustering,” Advances in Neural Information Processing Systems, vol. 16, pp. 177–184, 2004.
View at: Google Scholar
R. Xiao and P. Shi, “Semi-supervised marginal discriminant analysis based on QR decomposition,” in Proceedings of the 19th International Conference on Pattern Recognition (ICPR '08), pp. 1–4, IEEE, December 2008.
View at: Publisher Site | Google Scholar
H. Qiu, H. Lai, J. Huang, and Y. Chen, “Semi-supervised discriminant analysis based on udp regularization,” in Proceedings of the 19th International Conference on Pattern Recognition (ICPR '08), pp. 1–4, IEEE, Tampa, Fla, USA, December 2008.
View at: Publisher Site | Google Scholar
S. Yan and H. Wang, “Semi-supervised learning by sparse representation,” in Proceedings of the SIAM International Conference on Data Mining (SDM '09), pp. 792–801, SIAM, 2009.
View at: Google Scholar
F. Nie, S. Xiang, Y. Jia, and C. Zhang, “Semi-supervised orthogonal discriminant analysis via label propagation,” Pattern Recognition, vol. 42, no. 11, pp. 2615–2627, 2009.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
W. Li, Q. Ruan, and J. Wan, “Semi-supervised dimensionality reduction using estimated class membership probabilities,” Journal of Electronic Imaging, vol. 21, no. 4, Article ID 043010, 2012.
View at: Publisher Site | Google Scholar
T. I. Dhamecha, R. Singh, and M. Vatsa, “On incremental semi-supervised discriminant analysis,” Pattern Recognition, vol. 52, pp. 135–147, 2016.
View at: Publisher Site | Google Scholar
W. Li, Q. Ruan, and J. Wan, “Dimensionality reduction using graph-embedded probability-based semi-supervised discriminant analysis,” Neurocomputing, vol. 138, pp. 283–296, 2014.
View at: Publisher Site | Google Scholar
M. Belkin and P. Niyogi, “Laplacian eigenmaps for dimensionality reduction and data representation,” Neural Computation, vol. 15, no. 6, pp. 1373–1396, 2003.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
C. Cortes and M. Mohri, “On transductive regression,” in Advances in Neural Information Processing Systems 19, pp. 305–312, 2007.
View at: Google Scholar
B. Cheng, J. Yang, S. Yan, Y. Fu, and T. S. Huang, “Learning with-graph for image analysis,” IEEE Transactions on Image Processing, vol. 19, no. 4, pp. 858–866, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
L. Zhuang, H. Gao, J. Huang, and N. Yu, “Semi-supervised classification via low rank graph,” in Proceedings of the 6th International Conference on Image and Graphics (ICIG '11), pp. 511–516, IEEE, Hefei, China, August 2011.
View at: Publisher Site | Google Scholar
H. Gao, L. Zhuang, and N. Yu, “A new graph constructor for semi-supervised discriminant analysis via group sparsity,” in Proceedings of the 6th International Conference on Image and Graphics (ICIG '11), pp. 691–695, August 2011.
View at: Publisher Site | Google Scholar
S. Li and Y. Fu, “Low-rank coding with b-matching constraint for semi-supervised classification,” in Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI '13), pp. 1472–1478, Beijing, China, August 2013.
View at: Google Scholar
S. Li and Y. Fu, “Robust subspace discovery through supervised low-rank constraints,” in Proceedings of the 14th SIAM International Conference on Data Mining (SDM '14), pp. 163–171, SIAM, Philadelphia, Pa, USA, April 2014.
View at: Publisher Site | Google Scholar
H. Zhao, Z. Ding, and Y. Fu, “Block-wise constrained sparse graph for face image representation,” in Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG '15), 6, p. 1, May 2015.
View at: Publisher Site | Google Scholar
W. Li, J. Liu, and Q. Du, “Sparse and low-rank graph for discriminant analysis of hyperspectral imagery,” IEEE Transactions on Geoscience and Remote Sensing, vol. 54, no. 7, pp. 4094–4105, 2016.
View at: Publisher Site | Google Scholar
S. Li and Y. Fu, “Learning balanced and unbalanced graphs via low-rank coding,” IEEE Transactions on Knowledge and Data Engineering, vol. 27, no. 5, pp. 1274–1287, 2015.
View at: Publisher Site | Google Scholar
B. Zu, K. Xia, S. Dai, and N. Aslam, “Low-rank kernel-based semisupervised discriminant analysis,” Applied Computational Intelligence and Soft Computing, vol. 2016, Article ID 2783568, 9 pages, 2016.
View at: Publisher Site | Google Scholar
G. Liu, Z. Lin, and Y. Yu, “Robust subspace segmentation by low-rank representation,” in Proceedings of the 27th International Conference on Machine Learning (ICML '10), pp. 663–670, June 2010.
View at: Google Scholar
E. J. Candes, X. Li, Y. Ma, and J. Wright, “Robust principal component analysis?” Journal of the ACM, vol. 58, no. 3, article no. 11, 2011.
View at: Publisher Site | Google Scholar | MathSciNet
J.-F. Cai, E. J. Candès, and Z. Shen, “A singular value thresholding algorithm for matrix completion,” SIAM Journal on Optimization, vol. 20, no. 4, pp. 1956–1982, 2010.
View at: Publisher Site | Google Scholar | MathSciNet
Z. Lin, M. Chen, and Y. Ma, “The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices,” 2010, https://arxiv.org/abs/1009.5055.
View at: Google Scholar
G. Liu, Z. Lin, S. Yan, J. Sun, Y. Yu, and Y. Ma, “Robust recovery of subspace structures by low-rank representation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 1, pp. 171–184, 2013.
View at: Publisher Site | Google Scholar
L. Zhuang, H. Gao, Z. Lin, Y. Ma, X. Zhang, and N. Yu, “Non-negative low rank and sparse graph for semi-supervised learning,” in Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '12), pp. 2328–2335, June 2012.
View at: Publisher Site | Google Scholar
D. Cai, X. He, and J. Han, “Spectral regression for efficient regularized subspace learning,” in Proceedings of the IEEE 11th International Conference on Computer Vision (ICCV '07), IEEE, Rio de Janeiro, Brazil, October 2007.
View at: Publisher Site | Google Scholar
D. Cai, X. He, J. Han, and T. S. Huang, “Graph regularized nonnegative matrix factorization for data representation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, no. 8, pp. 1548–1560, 2011.
View at: Publisher Site | Google Scholar
S. T. Roweis and L. K. Saul, “Nonlinear dimensionality reduction by locally linear embedding,” Science, vol. 290, no. 5500, pp. 2323–2326, 2000.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2017 Baokai Zu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

992

Downloads

1147

Citations

Computational Intelligence and Neuroscience

A Novel Graph Constructor for Semisupervised Discriminant Analysis: Combined Low-Rank and -Nearest Neighbor Graph

Abstract

1. Introduction

2. Related Work

3. Preliminary

3.1. Overview of SDA

3.2. Low-Rank Representation

3.3. -Nearest Neighbor Algorithm

4. Proposed Algorithm

4.1. Combined Low-Rank and -Nearest Neighbor (LRKNN) Graph Constructor Algorithm

4.2. SDA Using Combined Low-Rank and -Nearest Neighbor Graph

5. Experiments and Analysis

5.1. Experiment Overview

5.1.1. Datasets

5.1.2. Comparative Algorithms

5.2. Experiment 1: Performances of SDA Using Different Regularized Graphs

5.3. Experiment 2: Parameters Settings

5.4. Experiment 3: Influence of the Label Number

5.5. Experiment 4: Performance of LRKNN Graph with Different Weight Methods

5.6. Experiment 5: Robustness to Different Types of Noises

6. Conclusions

Competing Interests

Acknowledgments

References

Copyright