Abstract

Compressive sensing microarrays (CSM) are DNA-based sensors that operate using group testing and compressive sensing principles. Mathematically, one can cast the CSM as sparse nonnegative recovery (SNR) which is to find the sparsest solutions subjected to an underdetermined system of linear equations and nonnegative restriction. In this paper, we discuss the relaxation of the SNR. By defining nonnegative restricted isometry/orthogonality constants, we give a nonnegative restricted property condition which guarantees that the SNR and the relaxation share the common unique solution. Besides, we show that any solution to the SNR must be one of the extreme points of the underlying feasible set.

1. Introduction

Nowadays, with the rapid development of molecular biology techniques, scientists use compressive sensing microarrays to collect the gene expression changes of patients suffer from specific diseases and test a lot of different drugs on cells genetically to look for medicine being able to change the abnormal gene expression [1, 2]. A DNA microarray is a collection of microscopic DNA spots attached to a solid surface. Each DNA spot contains a string of specific DNA sequences, known as probes. These can be a short section of a gene or other DNA element that are used to hybridize an organism's genetic sample under high-stringency conditions. Probe-target hybridization is usually detected and quantified by detection of chemiluminescence-labeled targets to infer the genetic makeup in the test sample.

Although the number of DNA sequences is extremely large, not all agents are expected to be present in a significant concentration at a given time and location. In traditional microarrays, this results in many inactive probes during sensing. On the other hand, we are often interested in only a small quantity of certain harmful biological agents. Therefore, it is important to not just detect the presence of agents in a sample but also estimate the concentrations with which they are present.

Assume that there are spots and labeled targets, and we have far fewer spots than target agents such that . Mathematically, one can represent the DNA concentration of each organism as an element in a vector and the measurements as . For and , the probe at spot hybridizes to target with probability . The target occurs in the tested DNA sample with concentration , which is clearly nonnegative. Denoting by , the process of DNA microarrays leads to the sparse nonnegative recovery (SNR) which is to find the sparsest solutions subjected to an underdetermined system of linear equations and nonnegative constraints, with the mathematical model as follows: where the variable vector , denotes the number of the nonzero entries of , is the measurement matrix with full row rank, , and .

SNR can be regarded as a special case of the sparse recovery, which is related to program . This program has sparked the significant concern and rapid development in recent years [35] owing to its wide applications. However, with the nonnegativity prior information about the object to be recovered in various applications such as CSM, solutions on tend to be closer to the actual situations and lead to substantial improvements in the image reconstruction. Moreover, with the nonnegative constraints, the feasible set becomes a polyhedral set instead of an affine subspace. This will bring us essential hardness in projecting on the feasible set. Thus, is more likely difficult to solve. Therefore, SNR deserves specific study.

Problem has been shown to be NP-hard [6, 7] in general from the perspective of computational complexity. One popular approach is to reconstruct the vector via the relaxation, which refers to Since is a standard linear program, it is easy to solve. An important issue is how to guarantee the equivalence of and in the sense that they have the same unique -sparse solution under some conditions. Here, we call a vector    -sparse if the number of its nonzero entries is no more than . There has been some increasing interest and activity in this area; see, for example, [814]. Donoho and Tanner [9] firstly proposed that and share the common -sparse unique solution if the polytope is outwardly -neighborly, where is the standard simplex in . Zhang [13] proved that and share the common -sparse unique solution if the null space of is strictly half -balance. Juditsky et al. [11] developed several different necessary and sufficient conditions for the - equivalence in the case of general type sign restrictions, including the nonnegative constraints as its special case. When the feasible set of is a singleton, the unknown can be recovered by optimizing any objective function over this constraint set, and and definitely get the same unique solution. In this case, Bruckstein et al. [8] got the uniqueness of the feasible solution under a sufficient condition that has a rowspan intersecting the positive orthant. Furthermore, Wang et al. [14] proved that the above sufficient condition is also necessary to the uniqueness of the feasible solution. Donoho and Tanner [10] proved that the underlying feasible set is a singleton if and only if the polytope and have the same number of -faces. Khajehnejad et al. [12] gave another equivalent condition of the uniqueness property by characterizing the support size of vectors in the null space of .

For the relaxation of sparse recovery, one of the most significant conditions is the restricted isometry property (RIP), named by Candès and Tao [15] with the groundbreaking work of Donoho et al. [16, 17]. However, to the best of our knowledge, the nonnegative case of RIP has not been investigated. This paper will deal with this issue. We begin with investigating the solution property of SNR and show that any solution to the SNR must be one extreme point of its feasible set in Section 2. We prove in Section 3 the nonemptiness and boundedness of the solution set of and show that any solution of could be stated as the convex combination of its optimal extreme points. In Section 4, by defining the nonnegative restricted isometry/orthogonality constants, we derive a sufficient condition for exact recovery of the sparsest nonnegative image/signal via the linear program relaxation.

Now we give some notations used in the text. We use and to denote the solution set and optimal value of problem . The would be the vector with only the th entry 1 and the rest all 0. is the vector with each entry equal to 1; we also use to demonstrate that for short. The for denote the column vectors of the matrix and . For any , is the th component and is the support set of ; that is, . For any subset , denotes the complement set of out of .

2. Solution Property

Throughout the paper we assume that Apparently, is a polyhedral set in . According to the representation theorem, any could be represented as follows: where , , , and are the extreme point set of ; , , and are the extreme direction set of . Apparently, , ;  ,   ,  .

Define subsets of as follows: where denotes the subspace spanned by the vectors , . Clearly, forms a partition of ; that is, and , . Moreover, for any . Along with the nonemptiness of , it is easy to see that must intersect one of these sets, hence and . Furthermore, we have the following result for the optimal value of .

Lemma 1. Assume that , one must have .

Proof. Suppose that the conclusion is not true; that is, there is with . Without loss of generality, let , . We get Meanwhile, . Thus, must be linearly dependent; that is, there exist , not all zero, such that Assume that . By denoting , we get . Taking , it holds that This is a contradiction with . We complete the proof.

To characterize property of the solution set , we need the next lemma. In particular, this brand new result will play a key role in proposing the sufficient condition of the uniqueness of in Section 4.

Lemma 2. Any two distinct solutions of must have different support sets.

Proof. Assume for contradiction that and are two different solutions of , . If , we have , , for all . Set . Since , it must hold . When , take It is easy to see that Thus, we have , and . This is a contradiction with the optimality of . When , just taking instead, we get the contradiction by a similar way. The proof is completed.

Now we are in a position to give the main theorem in this section.

Theorem 3. Any solution of must be one of the extreme points of .

Proof. Given any solution with representation where , , , , for all ; , , , for all . We only need to prove that in (9). To this end, we have the following three steps.
Firstly, we claim that in (9), According to the fact that , , and , , , one has for any , , which means that . This implies . Similarly, we get . On the contrary, from the optimality of and (9), we know that .
Secondly, we will show that , . If this is not true, there is an index such that , then has at least one positive component. Denote , so and . Noting that , (11) implies the fact that . Take , and Without loss of generality, set . It is easy to verify that hence which is a contradiction with the optimality of .
Thirdly, we will prove that , , . Suppose that there exist , , and and two different extreme points of , say , such that Based on (11), we have , hence , . Nevertheless, by Lemma 2, this is impossible. Hence, we show that We complete the argument.

Theorem 3 tells us that each solution of lies in the extreme point set of . Here is a concrete example.

Example 4. Let . Obviously, the solution set and optimal value of are respectively. In this case, is the only extreme point of . While the solution set and optimal value of are respectively.

At the end of this section, we consider the relaxation of Clearly, is a concave relaxation of . For the program , Ge et al. [7] derived the useful result as the following.

Lemma 5. The set of all extreme points of is exactly the set of all local minimizers to .

This lemma implies that any global solution of relaxation must be one of its extreme points. From Theorem 3 and Lemma 5, we immediately draw a new proposition.

Proposition 6. For any , there exists an extreme point of that is both an exact solution of and a local minimizer of .

This is different from the result of Fung and Mangasarian in [18], where they showed that for sufficiently small , there exists an extreme point of the polyhedral set , obtained by lifting the set , such that is an exact solution of and a global solution of the relaxation.

3. Linear Program Relaxation

Consider the linear program relaxation . Since the linear objective function is bounded below over the feasible set, based on the Frank-Wolfe theorem, the minimum of is attainable. Among all the extreme points of , , we call an optimal extreme point if it also meets .

Proposition 7. Any could be stated as the convex combination of optimal extreme points of . Hence, is bounded.

Proof. Given any with representation (9). If there is such that is not an optimal extreme point of and , we have , hence which is a contradiction.
Similarly, if there is such that , we have , hence which is a contradiction. This completes the proof.

From the above proposition, we know that linear program has at least one optimal extreme point. Thus, we could use simplex method or interior point method to solve .

4. Nonnegative Restricted Property

In the framework of relaxation, a significant problem is how to guarantee the exact recovery of sparse image/signal via the relaxation. One of the most important qualifications is the restricted isometry property; see [15]. Recall that the -restricted isometry constants (RIC) is the smallest scalar satisfying Similarly, the -restricted orthogonality constants (ROC) for is defined as the smallest scalar satisfying where and have disjoint support sets. The RIC and ROC measure how close each submatrix of with certain cardinality is behaving like an orthonormal system. Under some restricted isometry property, one can get the sparse recovery via its relaxation. Nevertheless, for the nonnegative case, the sparse recovery may maintain new characterizations. Above all, we define NRIC and NROC.

Definition 8. Let . We define the nonnegative -restricted isometry constants (NRIC) as the smallest number satisfying Similarly, we define the nonnegative -restricted orthogonality constants (NROC) for as the smallest number satisfying with and being disjoint sets.

Clearly, Moreover, the numbers and are nondecreasing in .

By employing the projections of vectors in the null space of to , we now provide a sufficient condition to determine a solution of .

Theorem 9. Suppose that is such that and with . Then, is a solution of .

Proof. We complete the proof by contradiction. If this is not true, there exists such that , . Set . Clearly, . Take with and being the projections of and to , respectively. We have , , . In particular, Therefore, Thus, we get in which the first inequality is due to (24), (25), and the fact that , and the last inequality is because of the assumption of and the monotonicity of in . This is a contradiction. Therefore, .

With the special result that any two solutions of have different support sets, we next derive a sufficient condition on the uniqueness of solution to .

Theorem 10. Suppose that is such that and with . Then, is the unique solution of .

Proof. Since implies , we know that . Now we just need to verify that is the unique solution of . Assume that this is not true; that is, there is another solution . According to Lemma 2, it must hold . Take . By the argument similar to that in the proof of Theorem 9, we get and the contradiction. We conclude the proof.

Now we are ready to give the main result of this paper, which is called the nonnegative restricted property.

Theorem 11. Assume that is such that and with . Then is exactly the common unique minimizer of and .

Proof. Since (31) implies , by Theorem 10.
Suppose that is a solution of . Take . To get , it suffices to verify that . The proof includes three steps, the first two steps are parallel to that in [19], in the third step, we utilize the technique of projecting the null space of on ; for details, see (42) and the argument around it.
Firstly, we introduce a partition of . Let be the support set of , the index set including the first large components of in , the index set including the next large components of in , and so on. Thus, Moreover, for any , we define which is exactly that for any Therefore, , and
Next, we show that is bounded by . Note that for each , where the second inequality is because of the monotonicity of on , and This gives In fact, which implies By applying (38) and (40), we have
Finally, we show that . By utilizing the projection , where and are projections of and on , we have . Moreover, It is easy to see From , we compute On one hand, based on the definition of NROC, (35), and (43), for , where the last inequality is by the fact that . On the other hand, together with the definition of NRIC and (43), one has Therefore, we compute where the forth inequality is from the fact that . Then the assumption forces Thus, Therefore, we get by (41), hence . This is exactly what we want. We complete the proof.

5. Conclusion

In this paper, we have derived a nonnegative restricted property condition, which ensures the exact recovery of sparse nonnegative image/signal via the linear program relaxation. Since the NRIC and NROC are defined in , there may be more types of measurement matrices satisfying the nonnegative restricted property than that in the case of RIP, regardless of random matrices or deterministic matrices. As a byproduct of the main result, we have investigated the solution property of the sparse nonnegative recovery and shown that any solution of must be one of the extreme points of its feasible set. However, it is not clear whether a given extreme point of the feasible set is a solution to . This can serve as a target for future work.

Acknowledgments

The work was supported in part by the National Basic Research Program of China (2010CB732501) and the National Natural Science Foundation of China (11171018).