A Modified Gradient Based Algorithm for Solving Matrix Equations
In this paper, we develop a modified gradient based algorithm for solving matrix equations . Different from the gradient based method introduced by Xie et al., 2010, the information generated in the first half-iterative step is fully exploited and used to construct the approximate solution. Theoretical analysis shows that the new method converges under certain assumptions. Numerical results are given to verify the efficiency of the new method.
Consider a linear matrix equation of the following form: where , , , , and are the given constant matrices and is the unknown matrix to be solved. A Sylvester equation is the special case of (1) with and , the notation is the identity matrix of . Such kind of problems frequently arise from many areas of applications in control and system theory , stability of linear systems , analysis of bilinear systems , power systems , signal and image processing , and so forth.
The exact solutions of matrix equations, such as Lyapunov and Sylvester matrix equations, can be obtained from matrix inversion by using the Kronecker product. The drawback of this approach is that considerable computational costs and storage requirements are needed, so that this approach is only applicable for small sized Sylvester equations. Some direct methods have also been proposed in [6–9] which are based on the idea of transforming the coefficient matrix into a Schur or Hessenberg form, by which the original equation can be solved by a backward substitution.
In the numerical linear community, iterative methods are becoming more and more popular. Several iterative schemes for Sylvester equations have been proposed; see, for example, [10–15]. Recently, some efficient gradient based and least squares based iterative algorithms for solving generalized Sylvester equations and coupled (general coupled) Sylvester equations have been presented [16–28]. The basic idea of these approaches is based on a hierarchical identification principle [16–18], which regards the unknown matrix as the system parameter matrix to be identified and then constructs a recursive formula to approximate the unknown solution. Particularly, for general linear matrix equations of form (1), it is illustrated in [3, 4] that the unknown matrix to be identified can be computed by a gradient based iterative algorithm. The convergence properties of the methods are also investigated in . In this paper, a modified gradient based iterative algorithm is proposed for solving linear matrix equations of form (1). The information generated in the first half-iterative step is fully exploited and used to construct the approximate solution by the modified method. The convergence condition of the method is analyzed. The numerical performance of the method is compared with the algorithms in [3, 4]. Numerical results show that the new method is efficient and robust.
The paper is organized as follows. In Section 2, the gradient based iterative method is recalled, and the modified gradient based method is introduced and analyzed in Section 3. In Section 4, performance of the modified gradient based method is compared with the original one. Finally, we conclude the paper in Section 5.
2. A Brief Review of the Gradient Based Iterative Method
We firstly recall an iterative method proposed by Xie et al.  for solving (1). The basic idea is regarding (1) as two linear matrix equations as follows: Then, define two recursive sequences as follows: where is the iterative step size. The above procedures can be regarded as two separate iterative procedures for solving two matrix equations in (3).
With and at hand, then the th approximate solution can be defined by taking the average of two approximate solutions, that is, By selecting an appropriate initial approximate solution , and using to substitute in (4) and in (5), then the above (4)–(6) constitute the gradient based iterative method proposed in . It is shown in  that the gradient based iterative algorithm converges as long as where is the largest eigenvalue of .
According to lots of numerical experiments, GBI algorithm is computationally efficient. However, we observe that the GBI algorithm has some limitations. The convergent rate is slow and the stagnation will happen for ill-conditioned problem. Also, in , the authors pointed out that how to choose a best convergence factor is a subject to be studied and deserves further research. In this paper, we present the optimal convergence factor explicitly, and then propose a modified algorithm for solving the linear matrix equations (1).
3. A Modified Gradient Based Iterative Algorithm
The above GBI process can be accomplished by the following algorithm.
Algorithm 1 (see ). The gradient based iterative algorithm (GBI algorithm).(1)Give two initial approximate solutions and (2)for , until converges(3)(4)(5)(6)end.
In the step of computing , the last approximate solution has been computed. Hence, we can use the information of to update the and present a modification of GBI algorithm.
Algorithm 2. The modified gradient based iterative algorithm (MGBI algorithm).(1)Give two initial approximate solutions and (2)for , until converges(3)(4)(5)(6)(7)end.
Let , , , , , and
Lemma 3 (see ). The Sylvester equation given by (1) has a unique solution if and only if , in this case, the unique solution is given by The corresponding homogeneous equation has a unique zero solution .
The following result discusses the convergence conditions of the Algorithm 2.
Theorem 4. If the linear matrix equation (1) has a unique solution and then the iterative sequence generated by Algorithm 2 converges to ; that is, ; or the error converges to zero for any initial value .
Proof. In the following discussions, we always assume that the Frobenius matrix norm is used. For the clarity of proof, we introduce another variable instead of in the fifth iteration. Define the following error matrices: By using (1) and (10), the proof of the following equalities is trivial: Taking the Frobenius norm of both sides of (11), it follows that From , we have Obviously, . In fact, the iterative sequence , generated by Algorithm 2 can also be viewed as the sequence generated by the double-side iteration in , so . As , , we have It follows that or According to Lemma 3, we have as .
4. Numerical Experiments
Example 1. Consider the matrix equation with From (8), the exact solution is
The coefficient matrices used in this example are taken from . Taking , we apply the GBI algorithm and MGBI algorithm to compute ; the convergence factor is set to be in GBI algorithm and to be , in MGBI algorithm. The relative error is recorded and plotted in Figure 1 by MATLAB command semilogy. From the figure, we can see that the MGBI algorithm converges faster than the GBI algorithm.
Remark. The choice of the convergence factor is an important issue. We experimentally study its influence on the convergence. The effect of changing of the convergence factor for MGBI algorithms in Example 1 is illustrated in Figure 2. We see that is a better convergence factor. However, the convergence factor is problem dependent, so seeking a best convergence factor is still a difficult task.
Example 2. Suppose that , where Then, the solution of from (8) is The coefficient matrices used in this example are taken from . Taking , we apply the GBI algorithm and MGBI algorithm to compute , the convergence factor is set to be in GBI algorithm and to be in MGBI algorithm. The relative error is recorded in Figure 3. From the figure, we can also see that the MGBI algorithm converges faster than the GBI algorithm.
In this paper, a modified gradient based iteration (MGBI) method is proposed for linear matrix equation. The convergence of MGBI is analyzed. The choice of parameter is an important issue, and its influence is experimentally studied. The principle idea of this paper can be extended to the more general setting like generalized (coupled) Sylvester matrix equations.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
B. N. Datta, Numerical Methods for Linear Control Systems, Elsevier Academic Press, 2003.View at: MathSciNet
R. H. Bartels and G. W. Stewart, “Algorithm 432: solution of the matrix equation AX - XB = C,” Communications of the ACM, vol. 15, pp. 820–826, 1972.View at: Google Scholar
W. H. Enright, “Improving the efficiency of matrix operations in the numerical solution of stiff ordinary differential equations,” Transactions on Mathematical Software, Association for Computing Machinery, vol. 4, no. 2, pp. 127–136, 1978.View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
F. Ding, Y. J. Liu, and B. Bao, “Gradient based and least squares based iterative estimationalgorithms for multi-input multi-output systems,” Proceedings of the Institution of MechanicalEngineers I: Journal of Systems and Control Engineering, vol. 226, no. 1, pp. 43–55, 2012.View at: Google Scholar