Research Article  Open Access
Deflated BiCG with an Application to Model Reduction
Abstract
Most calculations in model reduction involve the solutions of a sequence of dual linear systems with multiple righthand sides. To solve such systems efficiently, a new deflated BiCG method is explored in this paper. The proposed algorithm uses harmonic Ritz vectors to approximate left and right invariant subspaces inexpensively via small descenting direction vectors found by subsequent runs of deflated BiCG and then derives the deflated subspaces for the next pair of dual linear systems. This process leads to faster convergence for the next pair of systems. Numerical examples illustrate the effectiveness of the proposed method.
1. Introduction
Large scale simulations play an important role in the study of a great variety of complex physical phenomena, leading often to overwhelming demands on computational resources [1–5]. Hence, the common approach is to produce a surrogate model of much smaller dimension which provides a highfidelity approximation of the original model. For such problems, interpolatory model reduction method combines flexibility and scalability and has proven effectiveness. It transfers function interpolations in the frequency domain to meet various desirable approximation goals. During this process, it requires the solutions of dual linear systems with multiple righthand sides (RHSs):where is a sparse, nonsymmetric matrix and RHSs are not available simultaneously.
For the solutions of primary linear systems , , deflated Krylov methods have been appearing. This is due to the fact that they take advantage of the fact that several systems share the same matrix. In addition, the convergence of Krylov subspace solvers for a linear system, to a great extent, depends on the spectrum of the matrix. If one could project the eigenvectors corresponding to the smallest eigenvalues out from the initial residual and then solve the deflated system it will converge much faster. The process is referred to as deflation. Variants of deflated Krylov solvers for the primary linear systems have been fully studied in the literature [2, 3, 6–14]. However, deflation for dual linear systems has not yet been fully investigated [15, 16].
In this paper we extend this idea to the BiCG algorithm for the dual case. The goal of this paper is to develop a new deflated BiCG method for solving dual linear systems with multiple RHSs. Likewise, for BiCG, it can be shown that if the primal Krylov subspace is deflated with right eigenvectors, the corresponding left eigenvectors are removed from the dual residual. Therefore, while solving a pair of systems, we select approximate left and right invariant subspaces of and then use those to accelerate the convergence of the next pair of systems. The proposed algorithm uses harmonic Ritz vectors to approximate left and right invariant subspaces inexpensively via small descenting direction vectors found by subsequent runs of deflated BiCG and then derives the deflated subspaces for the next pair of dual linear systems. Furthermore, we describe a cheap way to build the deflated subspaces.
In the next section, we describe the outline of the deflated Lanczos algorithm used in the derivation of the deflated BiCG method. In Section 3, we derive a deflated BiCG method using previously computed deflated subspace matrices. How to compute and update such deflated space matrices is given in Section 4. In Section 5, we investigate the deflated BiCG algorithm for solving the dual systems with multiple RHSs. The effectiveness of the proposed method is also demonstrated in Section 6. Finally, some conclusions are summarized in Section 7.
Throughout this paper, is referred to as the transpose conjugate operation of matrix , denotes the inner product, is defined as the zero matrix, and is biorthogonality.
2. The Deflated Lanczos Algorithm
In this section, we describe a deflated Lanczos algorithm that builds two sequences , of vectors such that and , where and are two sets of linearly independent vectors.
We assume that the matrix is nonsingular. Let Since the matrices and are each other’s conjugate transpose, we can apply the standard Lanczos procedure to the above auxiliary matrices. Let unit vectors , be biorthogonal to , , respectively; then we compute the Lanczos vectors which satisfy where , , and are tridiagonal matrices, , are the last element of the last row of , and , respectively.
The Lanczos procedure guarantees that the vectors and . Since and , we have and for , via (5) and (6). Hence the sequences and satisfy the properties of (2).
To make these ideas more concrete, we give a deflated Lanczos algorithm by substituting the righthand sides of (3) and (4).
3. The Deflated BiCG Algorithm
In this section, we derive the deflated BiCG method from Algorithm 1 in exactly the same way as BiCG was derived from the Lanczos biorthogonalization procedure [17]. For convenience, we drop the superscript in (1) and refer toas the primary and the dual system, respectively.

Assume initial guesses and are given such that and . Let , . DefineThe th solutions that are updated in deflated BiCG algorithm become With deflation, the biorthogonality conditions , , and define the Petrov Galerkin conditions
Lemma 1. If , , , and satisfy (9), (10), (11), and (12), respectively, then and with , . Thus the residual is orthogonal to and is orthogonal to .
Proof. From (3) and (5), we have where . Using (9), we get From the orthogonality conditions leads to the following equations: Substituting (15) into (14), we get for some scalar . Similarly, it is able to prove that and .
Now we define , where , , and are diagonal, lower triangular, and upper triangular, respectively. Since a pivotless LDU decomposition of the tridiagonal matrix will lead to a breakdown in BiCG, it can be avoided by performing the LDU decomposition with 2 × 2 block diagonal elements [18]. However, such breakdown rarely happens in practice; hence we will not discuss it for the sake of brevity. Definewhere
Using the above information, twoterm recurrences for dual linear systems can be easily derived from the deflated Lanczos procedure.
Theorem 2 2. The solutions , , the residuals , , and the descent directions , for the dual linear systems satisfy the following recurrence relations:
Proof. It is analogous to the proof of Theorem [7].
Theorem 3 3. The descent direction is orthogonal to . In addition, it is also orthogonal to .
Proof. It is equivalent to prove that is a diagonal matrix and . Consider the following:is diagonal and The proof of goes along the same lines as above. We next find expressions for , , , and with iteration vectors at hand.
Theorem 4. The coefficients in the deflated BiCG method satisfy the following relations: where and .
Proof. The proof follows from [7].
Algorithm 2 provides an outline of the deflated BiCG method.

In order to guarantee and , we take the following forms. Take and as the initial guesses and , as the corresponding initial residuals. Let
4. Computing Approximate Invariant Subspaces
The matrices and are defined as the primary and dual system deflated spaces, respectively. The deflated spaces are fixed to solve the current linear systems; however, the bases of the deflated spaces are updated periodically for the next pair of linear systems. In the literature, approximate invariant subspace of is taken as deflated space. Since we focus on the solutions of the dual linear systems, it is necessary to build the left invariant subspace for the dual and the right invariant for the primary. We use harmonic Ritz vectors [19] to approximate left and right invariant subspaces cheaply. Hence it is needed to solve a small generalized eigenvalue problem whose solution gives the desired approximate invariant subspaces. Although it is cheap to solve the generalized eigenvalue problem, it would be expensive to set up the corresponding matrices in a straightforward manner. For the less matrix vector products, we use the descent directions , to update the deflated subspaces following [7].
Let , For each new system, we define the matrices , as follows: where The harmonic projection method computes the left and right approximate eigenvectors corresponding to the smallest eigenvalues by solving the following generalized eigenproblems: where and . Let , ; then the new deflated subspaces for the next system are defined as For avoiding the matrix multiplication and reducing the number of the matrix vector products in and , we take the same way as [7].
Lemma 5 5. Let , . One has , , andwhere , take and respectively. Then the matrices , can be computed by
Proof. The proof follows from [7].
Lemma 6 6. DefineThen the matrices , , , and are given by
Proof. The proof is analogous to the proofs as given in [7].
5. Systems with Multiple RightHand Sides
In this section, we describe the deflated BiCG algorithm for solving the dual linear systems with multiple RHSs. The algorithm uses descent direction vectors , to improve approximate eigenvectors found by subsequent runs of deflated BiCG and uses deflation to accelerate convergence. The new vectors , are appended to the current deflated subspaces, and . These incrementally built spaces are then used to generate approximate subspaces for deflating the next systems. The resulting algorithm is given in Algorithm 3.

The algorithm requires the storage of , , , , , and for approximating eigenvectors, amounting to vectors of length . In addition, we need to save , , , , and of the first steps of the algorithm.
6. Numerical Examples
In this section we give numerical results which indicate the potential effectiveness of our approach. In all of our runs we use a zero initial guess for the first dual system and take (22) as initial guesses for the remaining systems. Also, we keep the data in first steps and approximate eigenvectors associated with smallest eigenvalues. The stopping criterion is taken as . All the numerical experiments were performed in MATLAB 2011b. The machine we have used is a PC Pentium(R)4, CPU 2.50 GHz, 2.00 GB of RAM.
6.1. Convection Diffusion Problem
In this subsection, we take two different approaches BiCG and deflated BiCG (DBiCG) to solve the same problem for highlighting the efficiency of the deflation. The example is the convection diffusion problem with Dirichlet boundary conditions in the unit square , where , This problem is discretized with a secondorder finite difference scheme for a vertex centered location of unknowns. We adopt to satisfy the Péclet condition, where is the mesh size and is the number of points per direction. Here, , , and are taken; then the problem size is 1024 and has 4992 nonzero entries, which is real, sparse, and nonsymmetric. The primary system righthand side comes from PDE. For convenience, we take the same vector as the dual system righthand side. Their convergence behaviors are plotted in Figure 1.
(a)
(b)
Figure 1 shows the convergence histories of two methods. Using the initial guesses (22), DBiCG (without any deflated subspaces) converges rather slowly in the first run. The numerical behavior of DBiCG will be the same as the ones by applying BiCG to solve the first dual linear systems. The only difference is that the deflated spaces for the next run are generated during the first process of DBiCG. For the second run, we solve the same systems by DBiCGsh with deflated spaces. Compared with the first run, the reduction in iteration is around in the second run.
6.2. Lattice QCD Problem
The second numerical experiments stem from a quantum chromodynamics (QCD) problem. QCD is the gauge theory that describes the strong nuclear force between quarks and gluons [20, 21]. Quark propagators are obtained by solving the inhomogeneous lattice Dirac equation , where is a large but sparse complex nonHermitian matrix representing a periodic nearest neighbour coupling on a fourdimensional Euclidean space time lattice. Following [22], we take a preconditioning process which transforms from the original system to the odd even reduced system. That preconditioning process has onehalf of the dimension and a smaller condition number than the original system; for details see [23, 24]. The number of righthand sides is 4 and the is taken as a vector of all ones and three independent random vectors with entries form .
Figure 2 is helpful for seeing how the deflation works in practise. It is observed that DBiCG leads to less iterations than BiCG.
(a)
(b)
7. Conclusions
In this paper, we have derived a deflated BiCG method for dual linear systems with multiple RHSs. The proposed algorithm uses harmonic Ritz vectors to approximate left and right invariant subspaces inexpensively via small descent direction vectors found by subsequent runs of deflated BiCG and then derives the deflated subspaces for the next pair of dual linear systems. This process leads to faster convergence for the next pair of systems. Numerical examples illustrate the effectiveness of the proposed method, which will encourage us to apply the deflation technique to the BiCGSTAB algorithm in the near future.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Authors’ Contribution
All authors completed the paper together. All authors read and approved the final paper.
Acknowledgments
This research is supported by YBXSZC20131070, the National Natural Science Foundation of China (11026085, 11101071, and 51175443), and the Project sponsored by OATF (UESTC).
References
 J. Bloch and T. Wettig, “Domainwall and overlap fermions at nonzero quark chemical potential,” Physical Review D  Particles, Fields, Gravitation and Cosmology, vol. 76, no. 11, Article ID 114511, 2007. View at: Publisher Site  Google Scholar
 A. Stathopoulos and K. Orginos, “Computing and deflating eigenvalues while solving multiple righthand side linear systems with an application to quantum chromodynamics,” SIAM Journal on Scientific Computing, vol. 32, no. 1, pp. 439–462, 2010. View at: Publisher Site  Google Scholar  Zentralblatt MATH  MathSciNet
 K. Meerbergen and Z. Bai, “The Lanczos method for parameterized symmetric linear systems with multiple righthand sides,” SIAM Journal on Matrix Analysis and Applications, vol. 31, no. 4, pp. 1642–1662, 2010. View at: Publisher Site  Google Scholar  MathSciNet
 J. Cao, “Improved delaydependent stability conditions for MIMO networked control systems with nonlinear perturbations,” The Scientific World Journal, vol. 2014, Article ID 196927, 4 pages, 2014. View at: Publisher Site  Google Scholar
 J. Cao and Z. Lin, “Bayesian signal detection with compressed measurements,” Information Sciences, vol. 289, pp. 241–253, 2014. View at: Publisher Site  Google Scholar  MathSciNet
 A. Chapman and Y. Saad, “Deflated and augmented Krylov subspace techniques,” Numerical Linear Algebra with Applications, vol. 4, no. 1, pp. 43–66, 1997. View at: Google Scholar  MathSciNet
 Y. Saad, M. Yeung, J. Erhel, and F. Guyomarc'h, “A deflated version of the conjugate gradient algorithm,” SIAM Journal on Scientific Computing, vol. 21, no. 5, pp. 1909–1926, 2000. View at: Publisher Site  Google Scholar  MathSciNet
 A. Stathopoulos and K. Orginos, “Computing and deflating eigenvalues while solving multiple righthand side linear systems with an application to quantum chromodynamics,” SIAM Journal on Scientific Computing, vol. 32, no. 1, pp. 439–462, 2010. View at: Publisher Site  Google Scholar  MathSciNet
 A. M. AbdelRehim, R. B. Morgan, D. A. Nicely, and W. Wilcox, “Deflated and restarted symmetric Lanczos methods for eigenvalues and linear equations with multiple righthand sides,” SIAM Journal on Scientific Computing, vol. 32, no. 1, pp. 129–149, 2010. View at: Publisher Site  Google Scholar  Zentralblatt MATH  MathSciNet
 J. Frank and C. Vuik, “On the construction of deflationbased preconditioners,” SIAM Journal on Scientific Computing, vol. 23, no. 2, pp. 442–462, 2001. View at: Publisher Site  Google Scholar  MathSciNet
 J. Cheng, H. Zhu, S. Zhong, Y. Zeng, and X. Dong, “Finitetime ${H}_{\infty}$ control for a class of Markovian jump systems with modedependent timevarying delays via new Lyapunov functionals,” ISA Transactions, vol. 52, no. 6, pp. 768–774, 2013. View at: Publisher Site  Google Scholar
 J. Cheng, H. Zhu, Y. Ding, S. Zhong, and Q. Zhong, “Stochastic finitetime boundedness for Markovian jumping neural networks with timevarying delays,” Applied Mathematics and Computation, vol. 242, pp. 281–295, 2014. View at: Publisher Site  Google Scholar  MathSciNet
 R. B. Morgan and W. Wilcox, “Deflated iterative methods for linear equations with multiple righthand sides,” Tech. Rep. BUHEPP0401, Baylor University, 2004. View at: Google Scholar
 D. Darnell, R. B. Morgan, and W. Wilcox, “Deflated GMRES for systems with multiple shifts and multiple righthand sides,” Linear Algebra and its Applications, vol. 429, no. 10, pp. 2415–2434, 2008. View at: Publisher Site  Google Scholar  MathSciNet
 K. Ahuja, E. de Sturler, S. Gugercin, and E. R. Chang, “Recycling BiCG with an application to model reduction,” SIAM Journal on Scientific Computing, vol. 34, no. 4, pp. A1925–A1949, 2012. View at: Publisher Site  Google Scholar  MathSciNet
 A. M. AbdelRehim, A. Stathopoulos, and K. Orginos, “Extending the eigCG algorithm to nonsymmetric Lanczos for linear systems with multiple righthand sides,” Numerical Linear Algebra with Applications, vol. 21, no. 4, pp. 473–493, 2014. View at: Publisher Site  Google Scholar
 Y. Saad, Iterative Methods for Sparse Linear Systems, SIAM, Philadelphia, Pa, USA, 2003. View at: Publisher Site  MathSciNet
 R. E. Bank and T. F. Chan, “An analysis of the composite step biconjugate gradient method,” Numerische Mathematik, vol. 66, no. 3, pp. 295–319, 1993. View at: Publisher Site  Google Scholar  MathSciNet
 G. L. Sleijpen and H. A. Van der Vorst, “A JacobiDavidson iteration method for linear eigenvalue problems,” SIAM Journal on Matrix Analysis and Applications, vol. 17, no. 2, pp. 401–425, 1996. View at: Publisher Site  Google Scholar  MathSciNet
 T. Muta, Foundations of Quantum Chromodynamics, An Introduction toPerturbative Methods in Gauge Theories, World Scientific, 3rd edition, 2010. View at: MathSciNet
 J. Donoghue, E. Golowich, and B. R. Holstein, Dynamics o f the Standard Model, Cambridge University Press, Cambridge, Mass, USA, 1992.
 A. Frommer, “BiCGStab ($\mathcal{l}$) for families of shifted linear systems,” Computing, vol. 70, no. 2, pp. 87–109, 2003. View at: Publisher Site  Google Scholar  MathSciNet
 B. N. Datta and Y. Saad, “Arnoldi methods for large Sylvesterlike observer matrix equations, and an associated algorithm for partial spectrum assignment,” Linear Algebra and its Applications, vol. 154–156, pp. 225–244, 1991. View at: Publisher Site  Google Scholar  MathSciNet
 A. Frommer and U. Glässner, “Restarted GMRES for shifted linear systems,” SIAM Journal on Scientific Computing, vol. 19, no. 1, pp. 15–26, 1998. View at: Publisher Site  Google Scholar  MathSciNet
Copyright
Copyright © 2015 Jing Meng et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.