Abstract

The emerging theory of compressive sensing (CS) provides a new sparse signal processing paradigm for reconstructing sparse signals from the undersampled linear measurements. Recently, numerous algorithms have been developed to solve convex optimization problems for CS sparse signal recovery. However, in some certain circumstances, greedy algorithms exhibit superior performance than convex methods. This paper is a followup to the recent paper of Wang and Yin (2010), who refine BP reconstructions via iterative support detection (ISD). The heuristic idea of ISD was applied to greedy algorithms. We developed two approaches for accelerating the ECME iteration. The described algorithms, named ECME thresholding pursuits (EMTP), introduced two greedy strategies that each iteration detects a support set I by thresholding the result of the ECME iteration and estimates the reconstructed signal by solving a truncated least-squares problem on the support set I. Two effective support detection strategies are devised for the sparse signals with components having a fast decaying distribution of nonzero components. The experimental studies are presented to demonstrate that EMTP offers an appealing alternative to state-of-the-art algorithms for sparse signal recovery.

1. Introduction

Sparsity exploiting has recently received a great amount of attention in the applied mathematics and signal processing community. Sparse signal processing algorithms have been developed for various applications. A recent Proceedings of the IEEE special issue on applications of sparse representation and compressive sensing devoted to this topic. Some of the exciting developments addressed in [1โ€“7]. Compressed sensing, also known as compressive sensing, compressive sampling (CS) [8, 9], has been the subject of significant activity in sparse signal processing where one seeks to recover efficiently a sparse unknown signal vector of dimension ๐‘› via a much smaller number (๐‘š) of undersampled linear measurements. For ๐‘˜-sparse unknown signal ๐‘ฅ0โˆˆโ„๐‘›, the sparse signal recovery is intimately related to solving an underdetermined system of linear equations ๐‘ฆ=๐ด๐‘ฅ0 with sparseness constraint ๎€ท๐‘ƒโ„“0๎€ธ:min๐‘ฅโ€–๐‘ฅโ€–โ„“0s.t.๐ด๐‘ฅ=๐‘ฆ,(1.1) where ๐‘ฅ is true signal to be recovered, โ€–๐‘ฅโ€–โ„“0 is used to denote the number of nonzero components of ๐‘ฅ, ๐ดโˆˆโ„๐‘šร—๐‘› is the measurement matrix, and ๐‘ฆโˆˆโ„๐‘š is the measurement vector. The key insight in the pioneering work on CS [8, 9] is to replace โ„“0 by โ„“1 for the (๐‘ƒโ„“0) problem due to nonconvexity and combinatorial effect. In [10], it is the basis pursuit problem (BP):min๐‘ฅ0โ€–โ€–๐‘ฅ0โ€–โ€–โ„“1s.t.๐ด๐‘ฅ=๐‘ฆ.(1.2) Hereafter, numerous researchers developed various computational algorithms for sparse signal recovery [11, 12]. Generally, there are three major classes of computational algorithms: convex relaxation, bayesian inference, and greedy pursuit. Convex relaxation replaces the combinatorial problem (๐‘ƒโ„“0) with a convex optimization problem (BP), such as basis pursuit [10], basis pursuit denoising [13], the least absolute shrinkage and selection operator (LASSO) [14], and least angle regression (LARS) [15]. Bayesian inference derives the maximum a posteriori estimator from a sparse prior model, such as sparse bayesian learning (SBL) [16, 17] and bayesian compressive sensing (BCS) [18, 19]. Another popular approach is to use greedy algorithms which iteratively refine a sparse solution by successively selecting one or more elements. These algorithms include matching pursuit (MP) [20], orthogonal matching pursuit (OMP) [21โ€“23], subspace pursuit (SP) [24], compressive sampling matching pursuit (CoSaMP) [25], and iterative thresholding algorithms [26, 27].

Iterative hard thresholding (IHT) [27] is a simple and powerful approach for sparse recovery. Recently, an alternative algorithm has been present to alleviate the convergence speed issue of IHT in [28, 29]. Qiu and Dogandzic [28, 29] derived an expectation conditional maximization either (ECME) iteration from a probabilistic framework based on the Gaussian-distributed signals with unknown variance and proposed an acceleration method, termed double overrelaxation (DORE) thresholding scheme, to improve the convergence speed of the ECME algorithm. In addition, Qiu and Dogandzic [28, 29] further proposed an automatic double overrelaxation (ADORE) thresholding method for conditions that the underlying sparsity level is not available. As in study [30], Wang and Yin presented an iterative support detection (ISD) algorithm to refine the failed reconstructions by thresholding the solution of a truncated (BP) problem.

Inspired by the theoretical and empirical evidence of favorable recovery performance of ECME [29] and ISD [30], we combine ECME [29] and ISD [30] to devise novel sparse signal recovery methods, dubbed as ECME thresholding pursuits (EMTP). EMTP detects a support set ๐ผ using an ECME iteration and estimates the reconstructed signal by solving a truncated least-squares problem on the support set ๐ผ, and it iterates these two steps for a small number of times. We present two effective support detection strategies (hard thresholding and dynamic thresholding) for the sparse signals with components having a fast-decaying distribution of nonzeros (called fast decaying signals in [30]), which include sparse Gaussian signals, sparse Laplacian signals, and certain power-law decaying signals.

This paper considers the iterative greedy algorithms and abstracts them into two types [31], One-Stage Thresholding (OST) and Two-Stage Thresholding (TST), as discussed in Sections 2.1 and 2.2. Then, we review the initial work of Iterative Support Detection (ISD) in Section 2.3. In Section 3, we describe the proposed approaches. After that, Section 4 details the experimental results. Finally, we conclude this paper in Section 5.

1.1. Notation

We introduce the notation used in this paper.(i)๐‘ฅ(๐‘ก): the algorithms described in this paper are iterative and the reconstructed signal ๐‘ฅ in current iteration ๐‘ก is denoted as ๐‘ฅ(๐‘ก). The same convention is used for other vectors and matrices.(ii)๐ผ, ๐ด๐ผ: index set ๐ผ, the matrix ๐ด๐ผ denotes the submatrix of ๐ด containing only those columns of ๐ด with indexes in ๐ผ. The same convention is used for vectors.(iii)[1,๐‘›]โงต๐ผ: the complement of set ๐ผ in set {1,2,โ€ฆ,๐‘›}.(iv)supp(๐‘ฅ): the support set of a vector ๐‘ฅ, that is, the index set corresponding to the nonzeros of ๐‘ฅ, supp(๐‘ฅ)={๐‘–โˆถ๐‘ฅ๐‘–โ‰ 0}.(v)๐ป๐‘˜(๐‘ฅ): the hard thresholding that sets all but the largest in magnitude ๐‘˜ elements of a vector ๐‘ฅ to zero.(vi)|๐‘ฅ|,โ€–๐‘ฅโ€–โ„“๐‘,๐‘ฅ๐‘‡: the absolute value, โ„“๐‘ norm, and transpose of a vector ๐‘ฅ, respectively.(vii)๐ดโ€ : the Moore-Penrose pseudoinverse of matrix ๐ดโˆˆโ„๐‘šร—๐‘›. ๐ดโ€ =๐ด๐‘‡(๐ด๐ด๐‘‡)โˆ’1 for ๐‘šโ‰ค๐‘›; ๐ดโ€ =(๐ด๐‘‡๐ด)โˆ’1๐ด๐‘‡ for ๐‘šโ‰ฅ๐‘›.

2.1. One-Stage Thresholding

Qiu and Dogandzic derived an expectation conditional maximization either (ECME) iteration based on a probabilistic framework [29] ๐‘ฅ(๐‘ก)=๐ป๐‘˜๎‚€๐‘ฅ(๐‘กโˆ’1)+๐ด๐‘‡๎€ท๐ด๐ด๐‘‡๎€ธโˆ’1๎€ท๐‘ฆโˆ’๐ด๐‘ฅ(๐‘กโˆ’1)๎€ธ๎‚.(2.1) Note that ECME iteration reduces to IHT step when the measurement matrix ๐ด has orthonormal rows (i.e., (๐ด๐ด๐‘‡) is the identity matrix). These one-stage thresholding algorithms (e.g., IHT [27] and ECME [29]) are guaranteed to recover sparse signals and converge with limited iterations. However, OST is not the empirical choice for practical applications due to slow convergence. To this end, Qiu and Dogandzic proposed an acceleration method, termed double overrelaxation (DORE) thresholding scheme [28, 29], to improve the convergence speed of the ECME algorithm. DORE utilizes two overrelaxation steps: ๐‘ฅ1(๐‘ก)=๐‘ฅ(๐‘ก)+๐›ผ1๎€ท๐‘ฅ(๐‘ก)โˆ’๐‘ฅ(๐‘กโˆ’1)๎€ธ,๐‘ฅ2(๐‘ก)=๐‘ฅ1(๐‘ก)+๐›ผ2๎‚€๐‘ฅ1(๐‘ก)โˆ’๐‘ฅ(๐‘กโˆ’2)๎‚,(2.2) where ๐›ผ1,๐›ผ2 are the line search parameters. Finally, an additional hard thresholding step ๐‘ฅ(๐‘ก)=๐ป๐‘˜๎‚€๐‘ฅ2(๐‘ก)๎‚(2.3) ensures that the resulting signal is guaranteed to be ๐‘˜-sparse. In addition, Qiu and Dogandzic further presented an automatic double overrelaxation (ADORE) thresholding method for conditions that the underlying sparsity level is not available.

2.2. Two-Stage Thresholding

The algorithms described in this paper fall into the category of a general family of iterative greedy pursuit algorithms. Following [31], we adopt the name โ€œTwo-Stage Thresholdingโ€ (TST). Considering a CS recovery problem, the sparse recovery algorithms aim to detect the support set ๐ผ and approximate ๐‘ฆ using ๐‘ฆ=๐ด๐ผ๐‘ฅ๐ผ.(2.4) Starting with initial solution ๐‘ฅ=0, TST iterates between 2 main steps:

Step 1 (support detection). Detect the support set ๐ผ of the signal ๐‘ฅ, that is, select atoms of measurement matrix ๐ด which have been used to generate ๐‘ฆ; in other words, determine active atoms in sparse representation of a signal ๐‘ฅ. In some literature, this step also is called basis selection or atom selection.

Step 2 (signal estimation). Update the signal ๐‘ฅ using the least-squares solution on the detected support set ๐ผ. ๐‘ฅ๐ผ=argminz{โ€–๐‘ฆโˆ’๐ด๐ผ๐‘งโ€–22,supp(๐‘ง)โŠ†๐ผ}, ๐‘ฅ[1,๐‘›]โงต๐ผ=0.

Many algorithms (e.g., Orthogonal Matching Pursuit (OMP) [23], Subapace Pursuit (SP) [24], Compressed Sensing Matching Pursuit (CoSaMP) [25], and Gradient Pursuits (GP) [32]) developed various approaches for Step 1 or Step 2.

2.3. Iterative Support Detection

Considering the failed reconstructions of BP, Wang and Yin [30] proposed an algorithmic framework to refine the BP constructions, called iterative support detection (ISD). ISD alternates between two steps: support detection and signal reconstruction. Initialize the detected support ๐ผ=โˆ… and set the iteration number ๐‘ก=0; ISD iteratively calls the following steps:(1)signal reconstruction:โ€‰solve the truncated BP with ๐‘‡=๐ผ๐ถ: ๐‘ฅ(๐‘ก)=argmin๐‘ฅโ€–โ€–๐‘ฅ๐‘‡โ€–โ€–1s.t.๐‘ฆ=๐ด๐‘ฅ;(2.5)(2)support detection:โ€‰detect support set ๐ผ using ๐‘ฅ(๐‘ก) as the reference.

The reliability of ISD relies on the support detection. Wang and Yin devised serval detection strategies for the sparse signals with components having a fast decaying distribution of nonzero components (called fast decaying signals [30]). One of the detection strategies is based on thresholding (we use ISD defined by (2.6) to refer the implementation algorithm in the following context): ๐ผ(๐‘ก)=๎‚†||๐‘ฅ๐‘–โˆถ๐‘–(๐‘ก)||>๐›ฝ๐‘ก||๐‘ฅmax(๐‘ก)||๎‚‡,๐›ฝโˆˆ(0,1).(2.6)

3. ECME Thresholding Pursuits

In this section, we describe our proposed approaches, named ECME Thresholding Pursuits (EMTP), that combine OST and TST using the heuristic idea of ISD. The detailed description of the proposed algorithms are presented as follows

Step 1 (initialization). We have the following.โ€‰โ€‰Initialize the reconstruction signal ๐‘ฅ(0)=0,โ€‰โ€‰initialize residual signal ๐‘Ÿ=๐‘ฆ,โ€‰โ€‰initialize support set ๐ผ(0)=โˆ…,โ€‰โ€‰set the iteration counter ๐‘ก=1.

Step 2 (support detection). We have the following.โ€‰โ€‰Update signal approximation: ๐‘ฅ(๐‘ก)=๐‘ฅ(๐‘กโˆ’1)+๐ดโ€ ๐‘Ÿ(๐‘กโˆ’1).(3.1)โ€‰โ€‰Detect the support set ๐ผ:โ€‰โ€‰strategy 1: hard thresholding ๐ผ(๐‘ก)=supp(๐ป๐‘˜(๐‘ฅ(๐‘ก)));โ€‰โ€‰strategy 2: dynamic thresholding ๐ผ(๐‘ก)={๐‘–โˆถ|๐‘ฅ๐‘–(๐‘ก)|>๐›ฝ๐‘กmax|๐‘ฅ(๐‘ก)|}.

Step 3 (signal estimation). We have the following.โ€‰Estimate the signal: ๐‘ฅ๐ผ(๐‘ก)(๐‘ก)=๐ดโ€ ๐ผ(๐‘ก)๐‘ฅ๐‘ฆ,(๐‘ก)[1,๐‘›]โงต๐ผ(๐‘ก)=0.(3.2)โ€‰Update the residual: ๐‘Ÿ(๐‘ก)=๐‘ฆโˆ’๐ด๐‘ฅ(๐‘ก).(3.3)

Step 4 (halting). Check the stopping condition and, if it is not yet time to stop, increment ๐‘ก=๐‘ก+1 and return to Step 2. If it is time to stop, the recovered signal ๐‘ฅ has nonzero entries in support set ๐ผ(๐‘ก) and corresponding support vector lies in ๐‘ฅ๐ผ(๐‘ก)(๐‘ก).

We present two thresholding proposals and corresponding algorithm EMTP-๐‘˜ (shown as Algorithm 1) and EMTP-๐›ฝ (shown as Algorithm 2), where ๐›ฝโˆˆ(0,1) is the thresholding parameter.

Input: Measurement matrix ๐ด , measurements y, sparsity level ๐‘˜
Output: The reconstructed signal ๐‘ฅ
( 1 ) โ€‰โ€‰Initialization:
( 2 ) โ€‰โ€‰ ๐‘ก = 1 //iteration number
( 3 ) โ€‰โ€‰ ๐‘ฅ ( 0 ) = ๐ŸŽ //initial signal
( 4 ) โ€‰โ€‰ ๐‘Ÿ ( 0 ) = ๐‘ฆ //initial residual
( 5 ) โ€‰โ€‰ ๐ผ ( 0 ) = โˆ… //initial support set
( 6 ) โ€‰โ€‰whileโ€‰โ€‰halting criterion false do
( 7 ) โ€ƒ ๐‘ฅ ( ๐‘ก ) = ๐ป ๐‘˜ ( ๐‘ฅ ( ๐‘ก โˆ’ 1 ) + ๐ด โ€  ๐‘Ÿ ( ๐‘ก โˆ’ 1 ) )
( 8 ) โ€ƒ ๐ผ ( ๐‘ก ) = { ๐‘– โˆถ ๐‘ฅ ๐‘– ( ๐‘ก ) โ‰  0 }
( 9 ) โ€ƒ ๐‘ฅ ๐ผ ( ๐‘ก ) ( ๐‘ก ) = ๐ด โ€  ๐ผ ( ๐‘ก ) ๐‘ฆ
( 1 0 ) โ€ƒ ๐‘ฅ ( ๐‘ก ) [ 1 , ๐‘› ] โงต ๐ผ ( ๐‘ก ) = 0
( 1 1 ) โ€ƒ ๐‘Ÿ ( ๐‘ก ) = ๐‘ฆ โˆ’ ๐ด ๐‘ฅ ( ๐‘ก )
( 1 2 ) โ€ƒ ๐‘ก = ๐‘ก + 1
( 1 3 ) โ€‰โ€‰end while
( 1 4 ) โ€‰โ€‰returnโ€‰โ€‰ ๐‘ฅ

Input: Measurement matrix ๐ด , measurements ๐‘ฆ , thresholding parameter ๐›ฝ
Output: The reconstructed signal ๐‘ฅ
( 1 ) โ€‰โ€‰Initialization:
( 2 ) โ€‰โ€‰ ๐‘ก = 1 //iteration number
( 3 ) โ€‰โ€‰ ๐‘ฅ ( 0 ) = ๐ŸŽ //initial signal
( 4 ) โ€‰โ€‰ ๐‘Ÿ ( 0 ) = ๐‘ฆ //initial residual
( 5 ) โ€‰โ€‰ ๐ผ ( 0 ) = โˆ… //initial support set
( 6 ) โ€‰โ€‰whileโ€‰โ€‰halting criterion false do
( 7 ) โ€ƒ ๐‘ฅ ( ๐‘ก ) = ๐‘ฅ ( ๐‘ก โˆ’ 1 ) + ๐ด โ€  ๐‘Ÿ ( ๐‘ก โˆ’ 1 )
( 8 ) โ€ƒ ๐ผ ( ๐‘ก ) = { ๐‘– โˆถ | ๐‘ฅ ๐‘– ( ๐‘ก ) | > ๐›ฝ ๐‘ก m a x | ๐‘ฅ | }
( 9 ) โ€ƒ ๐‘ฅ ๐ผ ( ๐‘ก ) ( ๐‘ก ) = ๐ด โ€  ๐ผ ( ๐‘ก ) ๐‘ฆ
( 1 0 ) โ€ƒ ๐‘ฅ ( ๐‘ก ) [ 1 , ๐‘› ] โงต ๐ผ ( ๐‘ก ) = 0
( 1 1 ) โ€ƒ ๐‘Ÿ ( ๐‘ก ) = ๐‘ฆ โˆ’ ๐ด ๐‘ฅ ( ๐‘ก )
( 1 2 ) โ€ƒ ๐‘ก = ๐‘ก + 1
( 1 3 ) โ€‰โ€‰end while
( 1 4 ) โ€‰โ€‰returnโ€‰โ€‰ ๐‘ฅ

Remarks. (1) EMTP updates the leading elements of ๐‘ฅ(๐‘ก) using a least-squares solution on the support set ๐ผ. However, DORE updates all the elements of ๐‘ฅ(๐‘ก) using double overrelaxation steps. Finally, DORE needs a hard thresholding to ensure the new approximation is ๐‘˜-sparse. EMTP-๐›ฝ demands no prior knowledge about the underlying sparsity level ๐‘˜.
(2) EMTP is a special case of TST and utilizes OST in the support detection stage. It is different from OMP, SP, and CoSaMP. When the ECME iteration reduces to IHT step, that is, the reference for support detection ๐‘ฅ(๐‘ก)=๐‘ฅ(๐‘กโˆ’1)+๐ด๐‘‡๐‘Ÿ(๐‘กโˆ’1) is the gradient descent step for least-squares, OMP, SP and CoSaMP use the correlation between the residual signal and the atoms of the measurement matrix A (i.e., ๐ด๐‘‡๐‘Ÿ(๐‘กโˆ’1)) to detect support set. ๐ด๐‘‡๐‘Ÿ(๐‘กโˆ’1) is the negative gradient for least squares. It is clear that ECME iteration provides the estimation for the underlying sparse signal. We can employ the heuristic idea of ISD to devise various support detection strategies depending on the underlying sparse signal distribution. The dynamic thresholding method can be performed without the sparsity level ๐‘˜. EMTP directly detects the support of the underlying sparse signal by referencing the ECME iteration while OMP, SP, and CoSaMP augment the support by picking out the leading values of the negative gradient. OMP each step spots one index into the support, so it requires more iterations than EMTP. SP and CoSaMP spot several indexes into the support, so they need an additional step (i.e., orthogonal projection) to make sure the recovered signal is ๐‘˜-sparse. Like ISD, EMTP has an appealing self-correction capacity. An EMTP demo is presented in Figure 1.
(3) Like other greedy pursuits such as Subapace Pursuit [24] and CoSaMP [25], EMTP-๐‘˜ fixes the cardinality of support set ๐ผ and removes previous false detections. However, EMTP-๐›ฝ refines the support set ๐ผ which is not necessarily increasing and nested over the iterations.
(4) The dynamic thresholding strategy used in EMTP-๐›ฝ is inspired by ISD [30]. It finds significant nonzeros by comparing a threshold rather than maintaining fixed number (๐‘˜) of items. It is appealing for the conditions that the underlying sparsity level ๐‘˜ is not available.
(5) EMTP resembles ISD [30]. EMTP and ISD have the same idea in support detection step with iterative behavior. However, the support detection step is based on different sparse recovery methods, ECME and BP, respectively. EMTP updates the reconstruction signal using a least-squares solution on the detected support set ๐ผ. However, ISD iteratively refines the BP solution on the complement of the detected support set ๐ผ.
(6) EMTP-๐›ฝ with large ๐›ฝ obtains high-quality reconstruction from a small number of measurements. However, because support set ๐ผ grows slowly, EMTP-๐›ฝ takes a larger number of iterations (to be discussed in Section 4). EMTP-๐‘˜ wrongly detected elements can often be pruned out in later iterations.
(7) Like ISD (as discussed in [30]), EMTP-๐›ฝ only performs well for fast decaying signals. It does not work on sparse signals that decay slowly or have no decay at all (e.g., trinary and binary sparse signals). EMTP-๐‘˜ performs worse for the sparse signals that nonzero elements have similar magnitude [31, 33]. However, we can apply EMTP to non-fast-decaying signals via linear or nonlinear premapping [34, 35].

3.1. Complexity Analysis

Like DORE algorithm, the basic operation is matrix-vector multiplication and sorting operation. Assume that the pseudoinverse matrix ๐ดโ€  is precomputed (with the cost of ๐’ช(๐‘š2๐‘›+๐‘š๐‘›)). Line 7 in Algorithms 1 and 2 for updating the ECME iteration requires ๐’ช(๐‘š๐‘›). The computation complexity of thresholding operation (line 8 in Algorithms 1 and 2) is ๐’ช(๐‘›log๐‘˜). Line 9 involves the partial least-square solver, costing ๐’ช(๐‘˜2๐‘š+๐‘˜๐‘š). The step for updating residual requires an additional matrix-vector operation, costing ๐’ช(๐‘˜๐‘š). To summarize, one EMTP iteration costs ๐’ช(๐‘š๐‘›+๐‘›log๐‘˜+๐‘˜2๐‘š+2๐‘˜๐‘š), which is significantly less than the complexity of one DORE step (with the cost of ๐’ช(2๐‘›log๐‘˜+2๐‘š2๐‘›+3๐‘š๐‘›)).

4. Experimental Results

To assess the performance of the proposed approaches, we conduct numerical experiments on computer simulations. All algorithms were implemented and tested in MATLAB v7.6 running on Windows XP with 2.53โ€‰GHz Intel Celeron CPU and 2โ€‰GB of memory. We compared the proposed approaches with the accelerated ADORE/DORE approaches. The code of ADORE/DORE is available on the authors homepage (http://home.eng.iastate.edu/~ald/DORE.htm). We initialized ๐‘ฅ with zero sparse signal. The search length of ADORE was set to 1. All results were averaged over 100 times Monte Carlo problem instances. We used our unoptimized code (http://cs-notes.googlecode.com/files/EMTP.zip). The least-squares solution ๐‘ฅ=argminโ€–๐‘ฆโˆ’๐ด๐‘ฅโ€–22โ€‰โ€‰was implemented using MATLAB pseudoinverse function by x = pinv(A)*y. The MATLAB code was partially adapted from [30, 33, 36]. In all the experiments, the measurement matrix ๐ด was generated by uniform spherical ensemble (USE), that is, we generate the measurement matrix by sampling each entry independently from the standard norm distribution and then normalize each column to have unit norm. The MATLAB code is given by

A = randn(m, n); A = A./repmat(sqrt(sum(A.^2)), [m 1]).

The underlying ๐‘˜-sparse vectors were generated by randomly selecting a support set of size ๐‘˜ and each entry in the support set is sampled uniformly from a specific distribution. The sparse signals were generated in MATLAB by

x = zeros(n,1); support = randperm(n); x(support(1โ€‰:โ€‰k))= v;

and the nonzeros v generated by following code.

The sparse Gaussian signals were generated in MATLAB by

v = randn(k,1);

The sparse Laplacian signals were generated by

z = rand([k, 1]);

v = zeros([k, 1]);

in = z <= 0.5;

ip = z > 0.5;

v(in)=1/lambda*log(2 * z(in));

v(ip)= -1/lambda*log(2 * (1-z(ip))).

The power-law decaying signals were generated by

v = sign(randn(k,1)).* ((1โ€‰:โ€‰k). โˆ–^(-1/lambda))โ€™.

The variable lambda controls the rate of decay. We set lambda=10 and lambda=0.5 for sparse Laplacian signals and power-law decaying signals, respectively.

For fair comparison, we stopped iterations once the relative error fell below a certain convergence tolerance or the number of iterations is greater than 100. The convergence tolerance is given by โ€–โ€–๐‘ฆโˆ’๐‘Ÿ(๐‘ก)โ€–โ€–2โ€–๐‘ฆโ€–2โ‰ค10โˆ’6.(4.1) We empirically evaluate reconstruction performance in terms of signal-to-noise ratio (SNR) and number of iterations. The SNR is defined as SNR(dB)=โˆ’20log10โ€–โ€–๐‘ฅ0โ€–โ€–โˆ’๐‘ฅ2โ€–โ€–๐‘ฅ0โ€–โ€–2,(4.2) where ๐‘ฅ0 is the true signal and ๐‘ฅ is the recovered signal.

4.1. Influence of Thresholding Parameter

We evaluated the influence of thresholding parameter ๐›ฝ by varying ๐›ฝ from 0.4 to 0.7. First, we fixed signal dimension ๐‘›=400, the underlying sparsity level ๐‘˜=20, and varied the number of measurements. The plot of influence of thresholding parameter ๐›ฝ for sparse Gaussian signals was presented in Figure 2. It is clear that larger thresholding parameter achieves better recovery performance with fewer measurements. However, the cost is more iterations. Large ๐›ฝ is particularly competitive when the number of measurements is fairly small. It is worth noting that the number of iterations is smaller than the underlying sparsity level ๐‘˜ for exact recovery, especially for large ๐‘˜. As shown in Figure 2, EMTP-๐›ฝ achieves the SNR (dB) around 300, that is, the relative error is almost as low as the double precision. Second, we tested the comparisons by fixing the number of measurements ๐‘š=80, as shown in Figure 3, respectively. Finally, we enlarged the dimension of signals (๐‘›=2000), as shown in Figures 4 and 5. The latter tests come to similar conclusions as the first test.

We fixed the ratios of ๐‘š/๐‘›, ๐‘˜/๐‘š and present plots cases for the number of iterations as a function of problem size ๐‘›. Given sufficient measurements, as shown in Figure 6 with ๐‘š/๐‘›=0.4, ๐‘˜/๐‘š=0.2, n = 200โ€‰:โ€‰400โ€‰:โ€‰2200, EMTP-๐‘˜ requires significantly fewest iterations and EMTP-๐›ฝ requires relatively more iterations than DORE. It is clear that the number of iterations for EMTP-๐‘˜ and DORE is stable to the problem size, and EMTP-๐›ฝ needs acceptably a few more iterations with increasing the problem size. For another case, as shown in Figure 7 with ๐‘š/๐‘›=0.3, ๐‘˜/๐‘š=0.3, n = 200โ€‰:โ€‰400โ€‰:โ€‰2200, DORE requires more iterations, EMTP-๐‘˜ needs significantly fewer iterations with increasing the problem size, and EMTP-๐›ฝ requires relatively stable iterations.

4.2. Comparisons with the Accelerated OST

We present comparisons in terms of SNR (dB) and number of iterations. We omit the comparisons with โ„“1 minimization methods and some other OST/TST methods, as they have already been investigated in [31, 33]. First, we fixed signal dimension ๐‘›=400, the underlying sparsity level ๐‘˜=20, and varied the number of measurements. The result was plotted in Figure 8. Next, we tested the comparisons by fixing the number of measurements ๐‘š=80, as shown in Figure 9. Finally, the test set used larger dimension signals (๐‘›=2000). The corresponding results were depicted in Figures 10 and 11. Figures 10, and 11 show the average SNR (dB) (a) and number of iterations (b) for each indeterminacy level ๐‘š/๐‘›. EMTP achieves significantly larger SNR (dB) than ADORE/DORE. For exact recovery, EMTP obtains the relative error almost as low as the double precision. EMTP-๐›ฝ constantly needs small number of iterations and EMTP-๐‘˜ needs the smallest number of iterations for exact recovery. However, ADORE needs hundreds of iterations. Figure 8 was zoomed in for better illustration. Figures 8, 9, 10 and 11 show the average SNR (dB) and number of iterations for each sparsity level ๐‘˜/๐‘š. For larger-dimension signals (๐‘›=2000), the results are depicted in Figures 15 and 16. The latter tests come to similar conclusions as the first test. These figures do not fully capture the performance of EMTP-๐›ฝ since we only set ๐›ฝ=0.5; however EMTP-๐›ฝ achieved almost superior performance than ADORE/DORE.

4.3. Performance for Other Fast-Decaying Signals

As discussed in [30, 33], thresholding-based sparse recovery methods achieve superior performance only if the nonzero elements of signals have a fast-decaying distribution. We present empirical studies on other fast-decaying signals. Sparse Laplacian signals containing ๐‘˜=20 nonzero elements for dimension ๐‘›=400 were tested and the corresponding result was plotted in Figure 12. For larger dimension signals (๐‘›=2000), the results are shown in Figure 13. Power-law decaying signals containing ๐‘˜=20 nonzero elements for dimension ๐‘›=400 were tested and the corresponding result was plotted in Figure 14. For larger-dimension signals (๐‘›=2000), the results are depicted in Figure 15. ADORE/DORE were derived from a probabilistic framework based on Gaussian distribution. Surprisingly, they can also work for fast-decaying signals, as depicted in Figures 12, 13, 14, and 15. The conclusions for sparse Gaussian signals also can be generalized for sparse Laplacian signals and power-law decaying signals. Power-law decaying signals for low indeterminacy level ๐‘š/๐‘› make an exception that EMTP-๐›ฝ achieves inferior performance.

4.4. Phase Transitions

Following [31, 33], we present numerical comparisons in terms of phase transitions. For the sparse signal recovery from compressed measurements, the indeterminacy ๐›ฟ=๐‘š/๐‘› defines the undersampling of a vector in compressed measurements. Let ๐œŒ=๐‘˜/๐‘š be a normalized measure of the sparsity. We fix the problem size ๐‘›=400 and test 100 Monte Carlo realizations of each sparse vector at each pair of (๐œŒ,๐›ฟ). We test a grid of 16ร—16 linearly spaced (๐œŒ,๐›ฟ) combinations with ๐œŒ,๐›ฟ varying from 0.05 to 0.5 in 16 steps. For each sparsity and indeterminacy pair, then, we find the probability of exact recovery with defining exact recovery when โ€–๐‘ฅ0โˆ’๐‘ฅโ€–2/โ€–๐‘ฅโ€–2<0.01. For each problem indeterminacy, we interpolate the results over all sparsities to find where successful recovery occurs with a probability of 0.5. As a result, we obtain a phase transition plot showing the boundary above which most recoveries fail and below which most recoveries succeed. Figure 16 presents the recovery rates of DORE, SP, and EMTP-๐›ฝ for Gaussian-distributed sparse vectors as a function of problem sparsity and four problem indeterminacies from the thickest to thinnest lines. In Figure 16, we observe EMTP-๐›ฝ outperforms SP, and DORE in recovery rates. Figure 17 shows the phase transitions of DORE, SP and EMTP-๐›ฝ for compressively sensed sparse vectors sampled from Gaussian distribution. Figure 17 indicates that these transitions obey the following hierarchy in recovery performance: EMTP-๐›ฝ> SP > DORE.

4.5. Summary

To sum up, we have compared EMTP with ADORE/DORE. EMTP has significant advantages over the accelerated OST in terms of SNR and number of iterations. EMTP can significantly reduce the number of iterations required and achieves significantly higher SNR. For low indeterminacy level ๐‘š/๐‘›, EMTP requires fewer measurements. EMTP-๐›ฝ can work with no prior knowledge of the underlying sparsity level ๐‘˜ yet achieves recovery performance better than ADORE. Furthermore, we compared EMTP with the state-of-the-art greedy algorithm SP in terms of phase transitions. Among all methods, EMTP-๐›ฝ appeared to be the best.

5. Conclusions and Future Work

In this paper, we propose ECME thresholding pursuits (EMTP) for sparse signal recovery. EMTP detects a support set ๐ผ using the ECME iteration and estimates the reconstructed signal by solving a truncated least-squares problem on the support set ๐ผ, and it iterates these two steps for a small number of times. We present two effective support detection strategies (hard thresholding and dynamic thresholding) for the sparse signals with components having a fast-decaying distribution of nonzero components. The experimental studies are presented to demonstrate that EMTP offers an attractive alternative to state-of-the-art algorithms for sparse signal recovery. EMTP can significantly reduce the number of iterations required. We then turn to the problem of reducing the computational cost of each iteration, especially for large scale applications. Future research includes three directions.(1)EMTP requires the precomputation and storage of the pseudoinverse matrix ๐ดโ€ . So it is the advisable choice that replacing ECME iteration by IHT step in the support detection stage. IHT does not require the computation and storage of the pseudoinverse matrix ๐ดโ€ , which leads to significant advantage for large scale applications.(2)In signal estimation stage, that is, update reconstructed signal by solving least-squares problem, EMTP uses the orthogonal projection, that is, by calculating ๐‘ฅ๐ผ=๐ดโ€ ๐ผ๐‘ฆ where ๐ดโ€ ๐ผ is the pseudoinverse of ๐ด๐ผ. We will approximate the orthogonal projection efficiently by gradient pursuits [32].(3)This paper devotes efforts to devise computational algorithms which are experimentally reproducible and provides empirical studies as useful guidelines for practical applications. Further, future investigations will test other OST methods as the reference and sophisticated thresholding methods for support detection.

Acknowledgment

This work was supported by the National Science Foundation of China under Grant no. 61074167, 61170126, and the Scientific Research Foundation for Advanced Talents by the Jiangsu University, no. 12JDG050.