Advances in Nonnegative Matrix and Tensor Factorization

Cichocki, A.; Mørup, M.; Smaragdis, P.; Wang, W.; Zdunek, R.

doi:https://doi.org/10.1155/2008/852187

Computational Intelligence and Neuroscience

On this page

Acknowledgments Copyright Related Articles

Special Issue

Advances in Non Negative Matrix and Tensor Factorization

View this Special Issue

Editorial | Open Access

Volume 2008 | Article ID 852187 | https://doi.org/10.1155/2008/852187

Advances in Nonnegative Matrix and Tensor Factorization

A. Cichocki,¹M. Mørup,²P. Smaragdis,³W. Wang,⁴and R. Zdunek⁵

Received16 Jun 2008

Accepted16 Jun 2008

Published06 Jul 2008

Nonnegative matrix factorization (NMF) and its extension known as nonnegative tensor factorization (NTF) are emerging techniques that have been proposed recently. The goal of NMF/NTF is to decompose a nonnegative data matrix into a product of lower-rank nonnegative matrices or tensors (i.e., multiway arrays). An NMF approach similar to independent component analysis (ICA) or sparse component analysis (SCA) is very useful and promising for decomposing high-dimensional datasets into a lower-dimensional space. A great deal of interest has been given very recently to NMF models and techniques due to their capability of providing new insights and relevant information on the complex latent relationships in experimental datasets, and due to providing meaningful components with physical or physiological interpretations. For example, in bioinformatics, NMF and its extensions have been successfully applied to gene expression, sequence analysis, functional characterization of genes, clustering, and text mining. The main difference between NMF and other classical factorizations such as PCA, SCA, or ICA methods relies on the nonnegativity, and usually also additional constraints such as sparseness, smoothness, and/or orthogonality imposed on the models. These constraints tend to lead to a parts-based representation of the data, because they allow only additive, not subtractive, combinations of data items. In this way, the nonnegative components or factors produced by this approach can be interpreted as parts of the data. In other words, NMF yields nonnegative factors, which can be advantageous from the point of view of interpretability of the estimated components. Furthermore, in many real applications, data have a multiway (multiway array or tensor) structure. Exemplary data are video stream (rows, columns, RGB color coordinates, time), EEG in neuroscience (channels, frequency, time, samples, conditions, subjects), bibliographic text data (keywords, papers, authors, journals), and so on. Conventional methods preprocess multiway data, arranging them into a matrix. Recently, there has been a great deal of research on multiway analysis which conserves the original multiway structure of the data. The techniques have been shown to be very useful in a number of applications, such as signal separation, feature extraction, audio coding, speech classification, image compression, spectral clustering, neuroscience, and biomedical signal analysis.

This special issue focuses on the most recent advances in NMF/NTF methods, with emphasis on the efforts made particularly by the researchers from the signal processing and neuroscience area. It reports novel theoretical results, efficient algorithms, and their applications. It also provides insight into current challenging areas, and identifies future research directions.

This issue includes several important contributions which cover a wide range of approaches and techniques for NMF/NTF and their applications. These contributions are summarized as follows.

The first paper, entitled “Probabilistic latent variable models as nonnegative factorizations” by M. Shashanka et al., presents a family of probabilistic latent variable models that can be used for analysis of nonnegative data. The paper shows that there are strong ties between NMF and this family, and provides some straightforward extensions which can help in dealing with shift invariances, higher-order decompositions, and sparsity constraints. Furthermore, it argues through these extensions that the use of this approach allows for rapid development of complex statistical models for analyzing nonnegative data.

The second paper, entitled “Fast nonnegative matrix factorization algorithms using projected gradient approaches for large-scale problems” by R. Zdunek and A. Cichocki, investigates the applicability of projected gradient (PG) methods to NMF, based on the observation that the PG methods have high efficiency in solving large-scale convex minimization problems subject to linear constraints, since the minimization problems underlying NMF of large matrices well match this class of minimization problems. In particular, the paper has investigated several modified and adopted methods, including projected Landweber method, Barzilai-Borwein gradient projection, projected sequential subspace optimization, interior-point Newton algorithm, and sequential coordinatewise minimization algorithm, and compared their performance in terms of signal-to-interference ratio and elapsed time, using a simple benchmark of mixed partially dependent nonnegative signals.

The third paper, entitled “Theorems on positive data: on the uniqueness of NMF” by H. Laurberg et al., investigates the conditions for which NMF is unique, and introduces several theorems which can determine whether the decomposition is in fact unique or not. Several examples are provided to show the use of the theorems and their limitations. The paper also shows that corruption of a unique NMF matrix by additive noise leads to a noisy estimation of the noise-free unique solution. Moreover, it uses a stochastic view of NMF to analyze which characterization of the underlying model will result in an NMF with small estimation errors.

The fourth paper, entitled “Nonnegative matrix factorization with Gaussian process priors” by M. N. Schmidt and H. Laurberg, presents a general method for including prior knowledge in NMF, based on Gaussian process priors. It assumes that the nonnegative factors in the NMF are linked by a strictly increasing function to an underlying Gaussian process specified by its covariance function. The NMF decompositions are found to be in agreement with the prior knowledge of the distribution of the factors, such as sparseness, smoothness, and symmetries.

The fifth paper, entitled “Extended nonnegative tensor factorisation models for musical sound source separation” by D. FitzGerald et al., presents a new additive synthesis-based NTF approach which allows the use of linear-frequency spectrograms as well as imposing strict harmonic constraints, resulting in an improved model as compared with some existing shift-invariant tensor factorization algorithms in which the use of log-frequency spectrograms to allow shift invariance in frequency causes problems when attempting to resynthesize the separated sources. The paper further studies the addition of a source filter model to the factorization framework, and presents an extended model which is capable of separating mixtures of pitched and percussive instruments simultaneously.

The sixth paper, entitled “Gene tree labeling using nonnegative matrix factorization on biomedical literature” by K. E. Heinrich et al., addresses a challenging problem for biological applications, that is, identifying functional groups of genes. It examines the NMF technique for labeling hierarchical trees. It proposes a generic labeling algorithm as well as an evaluation technique, and discusses the effects of different NMF parameters with regard to convergence and labeling accuracy. The primary goals of this paper are to provide a qualitative assessment of the NMF and its various parameters and initialization, to provide an automated way to classify biomedical data, and to provide a method for evaluating labeled data assuming a static input tree. This paper also proposes a method for generating gold standard trees.

The seventh paper, entitled “Single-trial decoding of bistable perception based on sparse nonnegative tensor decomposition” by Z. Wang et al., presents a sparse NTF-based method to extract features from the local field potential (LFP), collected from the middle temporal visual cortex in a macaque monkey, for decoding its bistable structure-from-motion perception. The advantages of the sparse NTF-based feature-extraction approach lie in its capability to yield components common across the space, time, and frequency domains, yet discriminative across different conditions without prior knowledge of the discriminating frequency bands and temporal windows for a specific subject. The results suggest that imposing the sparseness constraints on the NTF improves extraction of the gamma band feature which carries the most discriminative information for bistable perception.

The eighth paper, entitled “Pattern expression nonnegative matrix factorization: algorithm and applications to blind source separation” by J. Zhang et al., presents a pattern expression NMF (PE-NMF) approach from the view point of using basis vectors most effectively to express patterns. Two regularization or penalty terms are introduced to be added to the original loss function of a standard NMF for effective expression of patterns with basis vectors in the PE-NMF. A learning algorithm is presented, and the convergence of the algorithm is proved theoretically. Three illustrative examples for blind source separation including heterogeneity correction for gene microarray data indicate that the sources can be successfully recovered with the proposed PE-NMF when the two parameters can be suitably chosen from prior knowledge of the problem.

The last paper, entitled “Robust object recognition under partial occlusions using NMF” by D. Soukup and I. Bajla, studies NMF methods for recognition tasks with occluded objects. The paper analyzes the influence of sparseness on recognition rates for various dimensions of subspaces generated for two image databases, ORL face database, and USPS handwritten digit database. It also studies the behavior of four types of distances between a projected unknown image object and feature vectors in NMF subspaces generated for training data. In the recognition phase, partial occlusions in the test images have been modeled by putting two randomly large, randomly positioned black rectangles into each test image.

Acknowledgments

The guest editors of this special issue are extremely grateful to all the reviewers who took time to carefully read the submitted manuscripts and to provide critical comments which helped to ensure the high quality of this issue. The guest editors are also much indebted to the authors for their important contributions. All these tremendous efforts and dedication have contributed to make this issue a reality.

A. Cichocki
M. Mørup
P. Smaragdis
W. Wang
R. Zdunek

Copyright

Copyright © 2008 A. Cichocki et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2858

Downloads

1340

Citations