Abstract

Combining the Rosen gradient projection method with the two-term Polak-Ribière-Polyak (PRP) conjugate gradient method, we propose a two-term Polak-Ribière-Polyak (PRP) conjugate gradient projection method for solving linear equality constraints optimization problems. The proposed method possesses some attractive properties: (1) search direction generated by the proposed method is a feasible descent direction; consequently the generated iterates are feasible points; (2) the sequences of function are decreasing. Under some mild conditions, we show that it is globally convergent with Armijio-type line search. Preliminary numerical results show that the proposed method is promising.

1. Introduction

In this paper, we consider solving the following linear equality constraints optimization problem: where is a smooth function and is a matrix of rank . In this paper, the feasible region and the feasible direction set are defined, respectively, as follows:

Taking the negative gradient for a search direction () is a natural way of solving unconstrained optimization problems. However, this approach does not work for constrained problems, since the gradient may not be a feasible direction. A basic technique to overcome this difficulty was initiated by Rosen [1] in 1960. To obtain a feasible search direction, Rosen projected the gradient into the feasible region; that is, The convergence of Rosen’s gradient projection method can be proved by Du; see [24].

In fact, Rosen’s gradient projection method is an extension of the steepest-descent method. It is well known that the drawback of the steepest-descent method is easy to suffer from zig-zagging specially when the graph of has an “elongated” form. To overcome the zig-zagging, we want to use the conjugate gradient method to modify the projection direction.

It is well known that nonlinear conjugate gradient methods such as the Polak-Ribière-Polyak (PRP) method [5, 6] are very efficient for large-scale unconstrained optimization problems due to their simplicity and low storage. However, it does not necessarily satisfy the descent conditions , .

Recently, Cheng [7] proposed a two-term modified PRP method (called TMPRP), in which the direction is given by An attractive property of the TMPRP method is that the direction generated by the method satisfies which is independent of any line search. The presented numerical results show some potential advantage of the TMPRP method in Cheng [7]. In fact, we can easily rewrite the above direction (4) as a three-term form: In the past few years, researchers have paid increasing attention to the conjugate gradient methods and their applications. Among others, we mention here the following works, for example, [829].

In the past few years, some researchers also paid attention to equality constrained problems. Martínez et al. [30] proposed a spectral gradient method for linearly constrained optimization by the following way to obtain the search direction: is the unique solution of In this algorithm, can be computed by quasi-Newton method in which the approximate Hessians satisfy a weak secant equation. The spectral choice of steplength is embedded into the Hessian approximation and the whole process is combined with a nonmonotone line search strategy.

C. Li and D. H. Li [31] proposed a feasible Fletcher-Reeves conjugate gradient method for solving linear equality constrained optimization problem with exact line search. Their idea is to use original Fletcher-Reeves conjugate gradient method to modify the Zoutendijk direction. The Zoutendijk direction is the feasible steepest descent direction. It is a solution of the following problem: Li et al. [32] also extended the modified Fletcher-Reeves conjugate gradient method in Zhang et al. [33] to solve linear equality constraints optimization problems which combined with the Zoutendijk feasible direction method. Under some mild conditions, Li et al. [32] showed that the proposed method with Armijo-type line search is globally convergent.

In this paper, we will extend the two-term Polak-Ribière-Polyak (PRP) conjugate gradient method in Cheng [7] to solve linear equality constraints optimization problems (1), which combines with the Rosen gradient projection method in Rosen [1]. Under some mild conditions, we show that it is globally convergent with Armijo-type line search.

The rest of this paper is organized as follows. In Section 2, we firstly propose the algorithm and prove the the feasible descent direction. In Section 3, we prove the global convergence of the proposed method. In Section 4, we give some improvement for the algorithm. In Section 5, we report some numerical results to test the proposed method.

2. Algorithm and the Feasible Descent Direction

In this section, we propose a two-term Polak-Ribière-Polyak conjugate gradient projection method for solving the linear equality constraints optimization problem (1). The proposed method is a combination of the well-known Rose gradient projection method and the two-term Polak-Ribière-Polyak (PRP) conjugate gradient method in Cheng [7].

The iterative process of the proposed method is given by and the search direction is defined by where and is a steplength obtained by a line search.

For convenience, we call the method (9) and (10) as EMPRP method. Now we prove that the direction defined by (10) and (11) is a feasible descent direction of at .

Theorem 1. Suppose that , . is defined by (10). If , then is a feasible descent direction of at .

Proof. From (9), (10), and the definition of , we have This implies that provides a descent direction of at .
In what follows, we show that is a feasible descent direction of at . From (9), we have that It follows from (10) and (13) that When , we have It is easy to get from (14) and (15) that, for all , is satisfied. That is, is feasible direction.

In the remainder of this paper, we always assume that satisfies the following assumptions.

Assumption A. (i) The level set is bounded.
(ii) Function is continuously differentiable and bounded from below. Its gradient is Lipschitz continuous, on an open ball containing ; that is, there is a constant such that

Since is decreasing, it is clear that the sequence generated by Algorithm 2 is contained in . In addition, we get from Assumption A that there is a constant , such that Since the matrix is a projection matrix, it is reasonable to assume that there is a constant , such that

We state the steps of the algorithm as follows.

Algorithm 2 (EMPRP method with Armijio-type line search). Step 0. Choose an initial point , . Let .
Step 1. Compute by (10), where is computed by (11).
Step 2. If stop, else go to Step 3.
Step 3. Given , . Determine a stepsize satisfying
Step 4. Let , and . Go to Step 1.

3. Global Convergence

In what follows, we establish the global convergence theorem of the EMPRP method for general nonlinear objective functions. We firstly give some important lemmas of the EMPRP method.

Lemma 3. Suppose that , . is defined by (10) and (11). Then we have , , .

Proof. By the definition of , we have It follows from (12) and (21) that On the other hand, we also have

Lemma 4. Suppose that Assumption A holds. is generated by Algorithm 2. If there exists a constant such that then there exists a constant such that

Proof. It follows from (20) that the function value sequence is decreasing. We also have from (20) that as is bounded from below. In particular, we have By the definition of , we get from (17), (19), and (24) that Since , as , there exist a constant and an integer , such that the following inequality holds for all : Hence, we have, for any , Letting , we can get (25).

We now establish the global convergence theorem of the EMPRP method for general nonlinear objective functions.

Theorem 5. Suppose that Assumption A holds. is generated by Algorithm 2. Then we have

Proof. Suppose that for all . Then there exists a constant such that We now prove (31) by considering the following two cases.
Case (i). . We get from (22) and (27) that . This contradicts assumption (32).
Case (ii). . That is, there is an infinite index set such that When is sufficiently large, by the line search condition, does not satisfy inequality (20). This means By the mean-value theorem and inequality (17), there is a such that and Substituting the last inequality into (34), for all sufficiently large, we have from (12) that Since is bounded and , the last inequality implies This also yields a contradiction. The proof is then complete.

4. Improvement for Algorithm 2

In this section, we propose techniques for improving the efficiency of Algorithm 2 in practical computation which is about computing projections and the stepsize of the Armijo-type line search.

4.1. Computing Projection

In this paper, as in Gould et al. [34], instead of computing a basis for null space of matrix , we choose to work directly with the matrix of constraint gradients, computing projections by normal equations. As the computation of the projection is a key step in the proposed method, following Gould et al. [34], this projection can be computed in an alternative way.

Let We can express this as where is the solution of Noting that (40) is the normal equation. Since is a matrix of rank , is symmetric positive definite matrix. We use the Doolittle (called LU) factorization of to solve (40).

For the matrix is constant, the factorization of needs to be carried out only once at the beginning of the iterative process. Using a Doolittle factorization of , (40) can be computed in the following form: where , is the Doolittle factor of .

4.2. Computing Stepsize

The drawback of the Armijo line search is how to choose the initial stepsize . If is too large then the procedure needs to call much more function evaluations. If is too small then the efficiency of related algorithm will be decreased. Therefore, we should choose an adequate initial stepsize at each iteration. In what follows, we propose a way to generate the initial stepsize.

We first estimate the stepsize determined by the exact line search. Support at the moment that is twice continuously differentiable. We denote by the Hessian of at and abbreviate as . Notice that the exact line search stepsize satisfies This shows that scalar is an estimation to . To avoid the computation of the second derivative, we further estimate by letting where positive sequence satisfies as . Let the initial stepsize of the Armijo line search be an approximation to It is not difficult to see that if and are sufficiently small, then and are good estimation to .

So to improve the efficiency of EMPRP method in practical computation, we utilize the following line search process.

Line Search Process. If inequality, holds, then we let . Otherwise we let be the largest scalar in the set such that inequality (45) is satisfied.

5. Numerical Experiments

This section reports some numerical experiments. Firstly, we test the EMPRP method and compare it with the Rose gradient projection method in [1] on low dimensional problems. Secondly, we test the EMPRP method and compare it with the spectral gradient method in Martínez et al. [30] and the feasible Fletcher-Reeves conjugate gradient method in Li et al. [32] on large dimensional problems. In the line search process, we set , , .

The methods in the tables have the following meanings.(i)“EMPRP” stands for the EMPRP method with the Armijio-type line search (20).(ii)“ROSE” stands for Rose gradient projection method in [1] with the Armijio-type line search (20). That is, in Algorithm 2, the direction .(iii)“SPG” stands for the spectral gradient method with the nonmonotone line search in Martínez et al. [30], where , .(iv)“FFR” stands for the feasible modified Fletcher-Reeves conjugate gradient method with the Armijio-type line search in Li et al. [32].

We stop the iteration if the condition is satisfied, where . If the iteration number exceeds , we also stop the iteration. Then we call it failure. All of the algorithms are coded in Matlab 7.0 and run on a personal computer with a 2.0 GHZ CPU processor.

5.1. Numerical Comparison of EMPRP and ROSE

We test the performance of EMPRP and ROSE methods on the following test problems with given initial points. The results are listed in Table 1. stands for the dimension of tested problem and stands for the number of constraints. We will report the following results: the CPU time Time (in seconds), the number of iterations Iter, the number of gradient evaluations Geval, and the number of function evaluations Feval.

Problem 1 (HS28 [35]). The function HS28 in [35] is defined as follows: with the initial point . The optimal solution and optimal function value .

Problem 2 (HS48 [35]). The function HS48 in [35] is defined as follows: with the initial point . The optimal solution and optimal function value .

Problem 3 (HS49 [35]). The function HS49 in [35] is defined as follows: with the initial point . The optimal solution and optimal function value .

Problem 4 (HS50 [35]). The function HS50 in [35] is defined as follows: with the initial point . The optimal solution and optimal function value . Moreover, we extend the dimension of function HS51 [35] to 10, 20 with the initial point . The optimal solution and optimal function value .

Problem 5 (HS51 [35]). The function HS51 in [35] is defined as follows: with the initial point . The optimal solution and optimal function value .

From Table 1, we can see that the EMPRP method performs better than the Rosen gradient projection method in [1], which implies that the EMPRP method can improve the computational efficiency of the Rosen gradient projection method for solving linear equality constrained optimization problems.

5.2. Numerical Comparison of EMPRP, FFR, and SPCG

In this subsection, we test the EMPRP method and compare it with the spectral gradient method (called SPCG) in Martínez et al. [30] and the feasible Fletcher-Reeves conjugate gradient method (called FFR) in Li et al. [32] on the following large dimensional problems with given initial points. The results are listed in Tables 2, 3, 4, 5, and 6. We will report the following results: the CPU time Time (in seconds), the number of iterations Iter.

Problem 6. Given a positive integer , the function is defined as follows: with the initial point . The optimization function value . This problem comes from Martínez et al. [30].

Problem 7. Given a positive integer , the function is defined as follows: with the initial point . This problem comes from Asaadi [36] and is called MAD6.

Problem 8. Given a positive integer , the function is defined as follows: with the initial point . The optimization function value .

Problem 9. Given a positive integer , the function is defined as follows: with the initial point . The optimization function value .

Problem 10. Given a positive integer , the function is defined as follows: with the initial point .

From Tables 26, we can see the EMPRP method and the FFR method in [32] perform better than the SPCG method in Martínez et al. [30] for solving large-scale linear equality constrained optimization problems, as the EMPRP method and the FFR method all are first order methods. But the SPCG method in Martínez et al. [30] needs to compute with quasi-Newton method in which the approximate Hessians satisfy a weak secant equation. However, as the EMPRP method also needs to compute projection, the FFR method in [32] performs better than the EMPRP method when the test problem becomes large.

6. Conclusions

In this paper, we propose a new conjugate gradient projection method for solving linear equality constrained optimization problem (1), which combines the two-term modified Polak-Ribière-Polyak (PRP) conjugate gradient method in Cheng [7] with the Rosen projection method. The proposed method also can be regarded as an extension of the recently developed two-term modified Polak-Ribière-Polyak (PRP) conjugate gradient method in Cheng [7]. Under some mild conditions, we show that it is globally convergent with the Armijio-type line search.

Conflict of Interests

The author declares that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The author thanks the anonymous referees for their valuable comments and suggestions that improved the presentation. This work is supported by the NSF of China Grants 11301041, 11371154, 71371065, and 71371195 and the Ministry of Education Humanities and Social Sciences Project Grants 12YJC790027, Project funded by China Postdoctoral Science Foundation, Natural Science Foundation of Hunan Province.