Abstract

We study the inverses of block Toeplitz matrices based on the analysis of the block cyclic displacement. New formulas for the inverses of block Toeplitz matrices are proposed. We show that the inverses of block Toeplitz matrices can be decomposed as a sum of products of block circulant matrices. In the scalar case, the inverse formulas are proved to be numerically forward stable, if the Toeplitz matrix is nonsingular and well conditioned.

1. Introduction

Let be an block Toeplitz matrix with blocks of size . We use the shorthand for a block Toeplitz matrix. The block Toeplitz systems arise in a variety of applications in mathematics, scientific computing, and engineering, for instance, image restoration problems in image processing, numerical differential equations and integral equations, time series analysis, and control theory [13]. If we want to solve more than one block Toeplitz linear system with the same coefficient matrix, then we usually solve four or so special block linear systems in order to determine the block Toeplitz inverse formula that expresses as the sum of products of block upper and block lower Toeplitz matrices. For example, Van Barel and Bultheel [4] gave an inverse formula for a block Toeplitz matrix and then derived a weakly stable algorithm to solve a block Toeplitz system of linear equations. The special structure of block Toeplitz matrices has resulted in some closed formulas for their inverses.

In the scalar case, Gohberg and Semencul [5] have shown that if the st entry of the inverse of a Toeplitz matrix is nonzero, then the first and the last columns of the inverse of the Toeplitz matrix are sufficient to reconstruct . In [6], an inverse formula can be obtained by the solutions of two equations (the so-called fundamental equations), where each right-hand side of them is a shifted column of the Toeplitz matrix. Later, Ben-Artzi and Shalom [7], Labahn and Shalom [8], Huckle [9], Ng et al. [10], and Heinig [11] have studied the Toeplitz matrix inverse formulas when the st entry of the inverse of a Toeplitz matrix is zero. In [12], Cabay and Meleshko presented an efficient algorithm (NPADE) for numerically computing Padé approximants in a weakly stable fashion. As an application of NPADE, it has been shown that it can be used to compute stably, in a weak sense, the inverse of a Hankel or Toeplitz matrix.

When , additional problems are encountered in obtaining the inverse formula of a block Toeplitz matrix. A well-known formula of Gohberg and Heinig can construct , provided that the first and last columns together with the first and last rows of the inverse are known [13]. In [14], a set of new formulas for the inverse of a block Hankel or block Toeplitz matrix is given by Labahn et al. The formulas are expressed in terms of certain matrix Padé forms, which approximate a matrix power series associated with the block Hankel matrix. We refer the reader to [15, 16] for the computation of Padé-Hermite and simultaneous Padé systems in detail. In [7], Ben-Artzi and Shalom have proved that each inverse of a Toeplitz matrix can be constructed via three of its columns, and thus, a parametrization of the set of inverses of Toeplitz matrices is obtained. Then they generalized these results to block Toeplitz matrices; see [17]. In [18], Gemignani has shown that the representation of relies upon a strong structure-preserving property of the Schur complements of the nonsingular leading principal submatrices of a certain generalized Bezoutian of matrix polynomials.

In this paper, we focus our attention to the inverses of block Toeplitz matrices with the help of the block cyclic displacement. In [19], Ammar and Gader have shown that the inverse of a Toeplitz matrix can be represented as sums of products of lower triangular Toeplitz matrices and circulant matrices. The derivation of their results is based on the idea of cyclic displacement structure. In [20], Gohberg and Olshevsky also obtained new formulas for representation of matrices and their inverses in the form of sums of products of factor circulant, which are based on the analysis of the factor cyclic displacement of matrices. The results in applications to Toeplitz matrices generalized the Gohberg-Semencul, Ben-Artzi-Shalom, and Heinig-Rost formulas. Motivated by a number of related results on Toepltiz inverse formulas, we study the representation of the inverses of block Toeplitz matrices. Since block Toeplitz matrices have similar displacement structure as Toeplitz matrices, all results about Toeplitz matrices extend quite naturally to block Toeplitz matrices.

At first, for purposes of presentation, we adopt some notations that will be used throughout the paper. We denote the identity matrix by . means a matrix of dimension in which the -th block is the identity matrix , and the remaining blocks are all zeros. is called an block reverse identity matrix, if has the identity matrices on the block antidiagonal and zero elsewhere. Let be a block matrix or a block vector with block size of . is said to be the block transpose of . By we denote the block -circulant with the first column with a block size of ; that is, the matrix of form is defined by

In this work, new formulas for the inverses of block Toeplitz matrices are proposed. As is well known, any block Toeplitz matrix has a property of block persymmetry, that is, , where is the block reverse identity matrix. Unfortunately, the matrix inverse does not hold this property. This can explain why the Gohberg-Heĭnig formula [13] needs the first and last columns together with the first and last rows of the inverse for constructing the inverse. For a block Toeplitz matrix , let and be its block circulant and block skew-circulant parts, respectively. The representation shows that there will exist block circulant matrices and such that . In this paper, by solving four linear equations with the same coefficient matrix , we decompose the inverses of block Toeplitz matrices as a sum of products of block circulant matrices. From the point of view of the computation, the formulas involving block circulants instead of block upper or lower Toeplitz matrices are more attractive. For example, in the scalar case, matrix-vector product by a circulant matrix is roughly twice as fast as matrix-vector product by a triangular Toeplitz matrix of the same size. In fact, the efficient multiplication of a Toeplitz matrix and a vector is achieved by embedding the Toeplitz matrix in a circulant matrix of twice the size. The derivation of the formulas we present is based on the factor cyclic displacement of block matrices. Later on, we give some corollaries about the representations of the inverse matrices. The results generalize the Gohberg-Olshevsky, Heinig-Rost, and Ben-Artzi-Shalom formulas. In the scalar case, we consider the stability of the Toeplitz matrix inverse formulas. We show that if the Toeplitz matrix is nonsingular and well conditioned, the inverse formulas are numerically forward stable.

We summarize briefly the content of our paper. Section 2 reviews some fundamental concepts such as the factor cyclic displacement structure of a block matrix and every square block matrix can be written as a sum of products of block circulant matrices. In Section 3, we decompose the inverses of block Toeplitz matrices as a sum of products of block circulant matrices by solving four equations with the same coefficient matrix . Some corollaries are also obtained in this section. We discuss the stability of the inverse formulas of scalar Toeplitz matrices in Section 4. Some remarks are given in Section 5.

2. The Factor Block Cyclic Displacement

The inverse formulas derived in the next section depend on some results of the factor block cyclic displacement of a block matrix. The focus of this section lies upon the definitions and the properties of the factor block cyclic displacement of a block matrix. We start with the definition of the factor block cyclic displacement.

Let be the block -cyclic lower shift matrix defined by The block -cyclic displacement of an block matrix with blocks size of is defined as The number is referred to as block -cyclic displacement rank of the block matrix . Since all block Toeplitz matrices have a special structure, it is not difficult to obtain that where may be an arbitrary matrix. Therefore, the -cyclic displacement rank of a block Toeplitz matrix is at most .

By straightforward computations, it is easy to see that the equality holds if and only if is a block -circulant matrix. For any invertible block matrix there exists a simple interrelation between the block -cyclic displacement of the inverse matrix and the block -cyclic displacement of . This connection is given by From the previous equation, we know that the block -cyclic displacement rank of the inverse matrix is the same as that of . Moreover, the previous equation also allows us to make use of the block -cyclic displacement technique for the calculation of the inverse matrix. We will study the inverses of block Toeplitz matrices with the help of the block cyclic displacement. We need the following result.

Theorem 1. If the block -cyclic displacement of is given as the sum where and are block vectors with blocks of size , then or where is a block -circulant matrix with the same last block row as that of and is the block -circulant matrix with the same last block column as that of .

Proof. From computing the block -cyclic displacement of matrix defined by (4), it then follows that Note that the -cyclic displacement of any block matrix is a linear operator and two block -circulant matrices can commute; we obtain By computing the block -cyclic displacement of block -cyclic matrix for all , we have where for are the first row of the block -circulant matrices , respectively. Since , it is easy to observe that the sum of the first terms in the previous equality is equal to .
In the following, we need to show that the sum of the last terms in the previous equality is equal to the zero matrix. Using the fact that and , we check Thus we obtain By comparing with , we obtain .
Finally, we discuss why is the block -circulant with the same last block rows as the matrix . We know that the last block row of the matrix is independent of the number . In view of , the last block rows of the matrix and are the same.
Equation (8) is now completely proved. A similar argument shows that (9) can be obtained from (7).

3. Inverses of Block Toeplitz Matrices

In this section, we focus our attention to new formulas on the inverses of block Toeplitz matrices. It is easy to find that any block Toeplitz matrix has a property of block persymmetry, that is, , where is the block reverse identity matrix. Unfortunately, the matrix inverse does not have this property. This can explain why the Gohberg-Heĭnig formula [13] needs the first and last block columns together with the first and last block rows of the inverse to construct it. Unlike the Gohberg-Semencul formula [5], it only needs the first and the last columns of the inverse if the st entry of the inverse is nonzero.

Let

For convenience, we will write and as From (5), (6), (15), and (17), by simple computations we obtain the following equality: Substituting (16) and (18) into the previous equality and computing, we obtain

Theorem 2. Let be a block Toeplitz matrix given by (1). If (15)–(18) are solvable for some and an arbitrary matrix of order , then(a)is invertible;(b) can be denoted as where is an arbitrary number.

Proof. (a) Assume that there exists nonzero vector of dimension such that . Then by computing (16) and (18), we have and (. These imply . Since , we obtain . A similar argument shows that . Premultiplying the equalities by , we can conclude that all elements of the vector are equal to zero. Therefore, the invertibility of is now proved.
(b) According to Theorem 1, we get is the block -circulant matrix with the same last block row as that of . It is not difficult to see that . After computations and arrangement, we obtain the formula (22) on the inverses of block Toeplitz matrices.

Remark 3. In fact, the inverse formula (22) is a generalization of the formula of Gohberg and Olshevsky [20], where they consider scalar Toeplitz matrices. Since the matrix inverse does not hold the property of persymmetry, in the block case, for constructing we need to solve four linear equations with the same coefficient matrix . Labahn et al. also gave this conclusion in terms of certain matrix Padé forms [14].

Remark 4. For the inverse formula (22), we take an interest in two particular cases. When and , the inverse formula (22) can be written as So we obtain that the inverse of a block Toeplitz matrix can be represented as a sum of products of block lower triangular Toeplitz matrices and block circulant matrices. This is a generalization of the formula of Ammar and Gader [19]. And when and , the inverse formula (22) can be written as a sum of products of block skew-circulant matrices and block circulant matrices. This is a generalization of the formula of Ammar and Gader [21], where they considered the Hermitian Toeplitz matrix.

Let , , and be the solutions of the equations , , , and for some integer , respectively. In addition, and are both nonsingular, for and for .

For , we have . Then we obtain For , we have . A similar argument shows that

As a result, we obtain that and are the solutions of (17) and (18), respectively. This result together with Theorem 2 yields the following corollary.

Corollary 5. Let be the block Toeplitz matrix given by (1). If (15) and (16), , , , and are solvable for some integer , in addition, and are both nonsingular, for and for , then is invertible and

Corollary 5 shows how to construct the inverse of a block Toeplitz matrix, provided that the first block column and last block row, along with two successive block columns and block rows of the inverse, are given. More precisely, the choice of gives that and . This means that and . As a consequence, we have the following result.

Corollary 6. Let be the block Toeplitz matrix given by (1). If (15) and (16), , and are solvable, in addition, and are both nonsingular, , then is invertible and

Corollary 6 is a generalization of the well-known formula of Gohberg and Heĭnig. In [13], the authors have shown that the first and last block columns together with the first and last block rows of the inverse are sufficient to construct the inverse . In addition, we decompose the inverses of block Toeplitz matrices as a sum of products of block circulant matrices.

4. Stability Analysis

In the scalar case, the inverse formula (22) in Section 3 is just the well-known formula of Gohberg and Olshevsky; see Theorem 3.1 in [20]. Hence, solving two linear systems are sufficient to reconstruct the inverse of the Toeplitz matrix . Let and be the solutions of (15) and (17), respectively. In the scalar case, formula (22) can be written as where and is an arbitrary number. For convenience, we rewrite as In this section, we want to show that the evaluation of some scalar inverse formulas in Section 3 is forward stable. An algorithm is called forward stable if for all well-conditioned problems, the computed solution is close to the true solution in the sense that the relative error is small. In the matrix computation, roundoff errors occur. Let and ; if we neglect the terms, then for any floating-point arithmetic with machine precision , there are See [22]. According to the floating-point arithmetic, we have the following result.

Theorem 7. Let be a nonsingular Toeplitz matrix given by (1) with and let it be well conditioned then the inversion formula (30) is forward stable for and a given number .

Proof. Assume that we have computed the solutions , in the inverse formula (30) which are perturbed by the normwise relative errors bounded by Let and . Then, we have By using the perturbed solutions , , the inverse formula (30) can be expressed as Here, is the matrix containing the error which results from computing the matrix products, contains the error from subtracting the matrices, and represents the error of the multiplication by . For the error matrices , , , and , we have It follows that After adding all these error bounds, we have Note that and are the solutions of (15) and (17), respectively; then and , where . Thus, the relative error is As is well conditioned, thus, is finite. Obviously, is finite. Therefore, the formula (30) presented in this section is forward stable.

For example, there are two particular cases of the inverse formula (30). When and , the inverse formula , which denotes as a sum of products of skew-circulant and circulant matrices, is numerically forward stable for a well-conditioned and nonsingular Toeplitz matrix . In fact, this is the formula of Ammar and Gader in [21]. And when and , this is the formula of Ammar and Gader in [19]. The inverse formula in which skew-circulant matrices are replaced by upper triangular factors is also forward stable, if is nonsingular and well conditioned.

We also find that even when and a given number , the inverse formula (30) is still numerically forward stable. A similar argument can show that the inverse formulas in Corollaries 5 and 6 for the scalar case are all numerically forward stable when is nonsingular and well conditioned.

5. Concluding Remarks

In this paper, with the help of the block cyclic displacement, the inverses of block Toeplitz matrices are discussed. New formulas for inversion of block Toeplitz matrices are obtained, which can be decomposed in a sum of products of block circulant matrices. In the scalar case, we analyze the stability of the inverse formulas. In addition, it should be remarked that any mention of matrix Padé forms for representing inverses of Toeplitz matrices could also be given in terms of solving block linear systems. Also the proofs of the inverses formulas in [14] could also be extended to prove the inverse formulas in this work.

Acknowledgments

This research is supported by NSFC (61170311), 973 Program (2013CB329404), and Sichuan Province Sci. & Tech. Research Project (2012GZX0080). The authors are grateful to the unknown referee for pertinent comments, especially for suggesting four references, which contributed substantially to this revised work.