Condition Numbers of the Nonlinear Matrix Equation <svg xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://www.w3.org/2000/svg" style="vertical-align:-0.4619007pt" id="M1" height="16.0037pt" version="1.1" viewBox="-0.0657574 -15.5418 115.366 16.0037" width="115.366pt"><g transform="matrix(.017,0,0,-0.017,0,0)"><path id="g113-89" d="M748 650H522L515 622L546 617C580 611 587 604 565 575C518 513 469 451 419 393C376 474 349 534 330 580C318 609 325 612 361 618L383 622L392 650H151L144 622C214 616 224 612 257 543L360 327C270 218 187 124 159 95C106 40 92 34 26 28L17 0H252L259 28L236 31C189 37 188 47 209 78C249 136 308 210 377 294L478 79C494 44 487 37 449 32L418 28L409 0H673L680 28C596 34 591 39 554 116L436 361C526 469 574 521 604 553C659 612 669 614 739 622L748 650Z"/></g><g transform="matrix(.012,0,0,-0.012,13.127,-7.578)"><path id="g50-113" d="M573 302C573 402 527 451 414 451C386 451 343 446 313 437L330 513L320 522C295 508 261 484 243 463L230 415C194 400 159 383 126 359L131 330C159 344 187 357 222 368L109 -147C96 -204 80 -214 18 -223L13 -255L256 -244L259 -212L236 -210C184 -205 180 -195 191 -141L219 -1C240 -10 268 -12 284 -12C352 4 431 48 484 104C543 166 573 240 573 302ZM481 290C481 165 381 37 305 37C280 37 249 56 235 71L302 395C328 399 353 402 372 402C427 402 481 378 481 290Z"/></g><g transform="matrix(.017,0,0,-0.017,24.819,0)"><path id="g117-33" d="M535 230V280H52V230H535Z"/></g><g transform="matrix(.017,0,0,-0.017,38.726,0)"><path id="g113-66" d="M686 28C612 35 607 44 591 112C563 234 541 360 519 489L489 666L457 658L147 121C100 40 89 36 24 28L17 0H240L250 28C168 34 159 41 190 101L262 237H482C495 180 503 137 510 91C517 47 514 35 441 28L433 0H677L686 28ZM475 280H285L429 541H431L475 280Z"/></g><g transform="matrix(.012,0,0,-0.012,50.784,-7.578)"><path id="g50-43" d="M486 158C486 177 478 202 466 220C413 228 386 236 336 262C386 288 413 297 466 304C478 323 486 347 485 366C470 376 444 381 422 380C389 338 368 319 321 288C323 345 329 372 349 422C339 442 322 461 305 470C289 461 271 442 262 422C281 372 287 345 290 288C243 319 222 338 189 380C167 381 142 376 125 366C125 347 133 322 145 304C198 296 225 288 275 262C225 236 198 227 145 220C133 201 125 177 126 158C141 148 167 143 189 144C222 186 243 205 290 236C288 179 282 152 262 102C272 82 289 63 306 54C322 63 340 82 350 102C330 152 324 179 321 236C368 205 390 186 422 144C444 143 470 148 486 158Z"/></g><g transform="matrix(.017,0,0,-0.017,58.809,0)"><path id="g113-102" d="M391 364C391 409 353 448 295 448C249 448 198 426 152 393C65 331 23 225 23 139C23 14 96 -12 146 -12C198 -12 280 9 367 101L351 124C300 78 242 48 194 48C129 48 109 107 109 162V191C208 213 391 266 391 364ZM313 350C313 305 268 261 113 223C132 334 187 381 217 398C227 404 244 405 261 405C290 405 313 385 313 350Z"/></g><g transform="matrix(.012,0,0,-0.012,65.914,-7.578)"><path id="g50-89" d="M782 650H541L536 618L564 614C597 609 604 600 584 574C535 508 486 451 437 395C394 473 361 533 343 575C331 605 335 610 375 615L400 618L407 650H156L147 618C222 612 230 608 267 537L375 330C277 214 186 117 163 96C108 43 97 38 27 32L18 0H265L274 32L251 35C204 41 202 47 224 80C264 138 327 213 397 293L504 80C520 48 516 42 472 36L440 32L433 0H707L716 32C627 38 622 43 581 121L455 361L629 548C685 606 696 610 775 618L782 650Z"/></g><g transform="matrix(.017,0,0,-0.017,75.891,0)"><use xlink:href="#g113-66"/></g><g transform="matrix(.017,0,0,-0.017,92.743,0)"><path id="g117-34" d="M535 323V373H52V323H535ZM535 138V188H52V138H535Z"/></g><g transform="matrix(.017,0,0,-0.017,107.61,0)"><path id="g113-74" d="M405 650H141L135 622C222 616 230 610 215 535L133 116C118 41 113 33 29 28L23 0H289L295 28C209 33 205 40 219 116L298 535C312 609 317 616 399 622L405 650Z"/></g></svg>

Chacha, Chacha Stephen; Naqvi, Syed Muhammad Raza Shah

doi:https://doi.org/10.1155/2018/3291867

Journal of Function Spaces

On this page

Abstract Introduction Preliminaries Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2018 | Article ID 3291867 | https://doi.org/10.1155/2018/3291867

Condition Numbers of the Nonlinear Matrix Equation

Chacha Stephen Chacha¹and Syed Muhammad Raza Shah Naqvi¹

Academic Editor: Henryk Hudzik

Received25 Apr 2018

Accepted19 Jul 2018

Published01 Aug 2018

Abstract

We explore the condition numbers of the nonlinear matrix equation . Explicit expressions for the normwise, mixed, and componentwise condition numbers are derived. The upper bounds for the mixed and componentwise condition numbers are obtained. The numerical result favors the fact that our estimations are fairly sharp. Also, the relative upper perturbation bounds give satisfactory results for small perturbations in the input data.

1. Introduction

We consider the nonlinear matrix equationwhere is a real or complex square matrix, is an identity matrix, is the matrix exponential function, and is a positive integer. The basic general form of (1) is , and it occurs in the analysis of ladder networks, the dynamic programming, control theory, stochastic filtering, and statistics [1]. Ran and Reurings in [2] studied the solutions and perturbation theory for a general matrix equation , where represent a map from the set of all positive semidefinite matrices into a space of complex matrices and satisfy some monotonicity properties. Recently, Gao in [3] studied the Hermitian positive definite solution (HPDS) of the nonlinear matrix equation which is (1) for and derived some necessary and sufficient conditions for the existence of the HPDS. In [4], authors derived the explicit expressions for the normwise, mixed, and componentwise condition numbers and their upper bounds for the nonlinear matrix equation , . Authors in [5] presented a perturbation analysis of the matrix equation , for positive integers , and employed Lyapunov majorant and fixed point-point principle to derive both local and nonlocal bounds. For more details about condition numbers, see ([6, 7]) and the references therein.

To the best of our knowledge, no one has studied the condition numbers of (1). Thus, the objective of this study is to derive the explicit expressions for the normwise, mixed, and componentwise condition numbers as well as the local upper bounds for mixed and componentwise condition numbers of (1). Finally, we give a comparative analysis for the computed condition numbers.

The following notations will be used throughout this paper: stands for normwise condition number; “’’ means equal by definition; stands for a ball with center and radius ; stands for domain of ; denotes the set of all complex Hermitian matrices; if , then means that is positive semidefinite (positive definite) and means that ; the notation (■) stands for spectral radius; signify the transpose conjugate and transpose of matrix , respectively; and denote the Frobenius norm and usual spectral norm, respectively; given and , the Kronecker product is ; the operator is defined by ; means the ratio of the largest singular value to the smallest; stands for the absolute value of .

2. Preliminaries

In this section, we provide useful definitions and lemmas that will be applied in our proofs in the next sections.

Definition 1. The condition number of a matrix function at a point is defined as for any matrix norm. The computed condition number measures the stability or sensitivity of a problem. In this case, the problem is said to be well-structured or well-posed or well-conditioned if the condition number is small and ill-conditioned if the condition number is large, where the definition of large or small condition number is problem dependent.

Definition 2. The Fréchet derivative of a matrix function at a point is a linear operatorsuch that for all . The operator denotes the Fréchet derivative of at in the direction . If such an operator exists, is said to be Fréchet differentiable and

Lemma 3 (see [8], Lemma 4.3.1). Suppose that , , , and . Then,

Lemma 4 (see [9], pp. 178, Theorem 3.3.16(a, b)). Let be given and let . Then, following inequalities hold for the decreasingly ordered singular values of and :(I),(II)

Lemma 5 (see [10], Theorem 15). Let and . If is an eigenvalue of and is an eigenvalue of , then is an eigenvalue of .

For easy expansion and simplification of matrix polynomials, we need Lemma 6.

Lemma 6 (see [11]). Let denote a set of positive integers including zero and such that . Then,(i), , .(ii), and for .(iii)

3. Normwise, Mixed, and Componentwise Condition Numbers

In this section, we concentrate on the derivation of the explicit expressions for the normwise, mixed, and componentwise condition numbers. In order to derive the explicit expressions for the normwise, mixed, and componentwise condition numbers of (1), we consider the perturbed equation (6).Replacing by in (6) yieldsNow, let us make small perturbations in the matrices , and as shown in Subtracting (8) by (1) yieldsUsing Lemma 6, we havewhere . Because have higher orders of , we omit it and consider only the first term of (10). Then, replacing in (9) by gives Using the fact that is symmetric and is real and applying the operator in (11), we getCombining the terms with vec in (12) yieldsThen, we haveIn (14), the term refers to higher order approximation of with respect to and is the vec permutation operator satisfying and it is defined aswhere and is the column of the identity matrix .

Let us define a map , where, represents a real or complex space. We have , where the matrices . According to the implicit function theorem, it is apparent that as , because is a function of .

3.1. Normwise Condition Numbers

In this subsection, we define the two kinds of normwise condition numbers. According to Rice [12], the two kinds of normwise condition numbers of map are defined byNow, suppose that is differentiable at , then using Theorem 4 in [12], we have where is a Fréchet derivative of at . It follows thatwhereDenotingwe have

Now, we prove that matrix is nonsingular in Theorem 7.

Theorem 7. Suppose that is the Hermitian positive definite solution of (1) and is a real matrix with such that . Then,is nonsingular.

Proof. DenoteThen, we have Using Lemmas 4 and 5, it follows thatandFrom , we know that ; this implies that , which means that 1< . Therefore, .
Since , then we have . Therefore, we conclude that is invertible.
From Theorem 7 above, since is invertible, then we have , and the Kronecker Fréchet derivative

Explicit expressions for the two kinds of normwise condition numbers are derived in Theorem 8.

Theorem 8. Suppose that is the HPD solution of (1) and is nonsingular. Then, ①,②, whereand

Proof. ①Using the fact that and (17), we can easily get ②In this case, we rewrite as , where and , then we have . It follows thatFinally, using (16) and (31) and , we see that

3.2. Mixed and Componentwise Condition Numbers

In this subsection, we derive the explicit expressions for mixed and componentwise condition numbers of (1). The following distance function is introduced before defining mixed and componentwise condition numbers. For any vectorsNow, let us define a distance function In the rest of this paper, we assume is finite for any pair , and we extend the function to matrices. That is for any matrices , we have, . For , we denote .

Based on the work by Gohberg and Koltracht [13] on mixed, componentwise, and structured condition numbers, we have Definition 9.

Definition 9. Let be a continuous mapping defined on an open interval set such that and for a given .(I)The mixed condition number of at is defined by (II)The componentwise condition number of at is defined byIf is Fréchet differentiable at point . Then, the explicit expressions of the mixed and componentwise condition numbers of at are given by Lemma 10.

Lemma 10 (see [13, 14]). Assume is Fréchet differentiable at point , we have (1)if , then(2)if such that , for , then Now, we derive the explicit expressions for the mixed and componentwise condition numbers and their upper bounds in Theorem 11.

Theorem 11. Let be the Hermitian positive definite solution of (1). Define the mapping(I)Let denote the mixed condition number defined by (35). Then has the explicit expression where(II)Let denote the componentwise condition number defined by (36). Then, has the explicit expression where is the same as in item (I).
Moreover, we define two simple upper bounds for and given byandrespectively.

Proof. We first prove (I) using of Lemma 10. In this case, for , we obtain that the mixed condition number is . It follows from (27) thatwhere It also holds that So it follows from (45) that the upper bound of is given byNow we prove item (II) for the componentwise condition number of (1). Using (27) together with item of Lemma 10 yields where is the same as in item (I).
Likewise, to estimate the upper bound of , we have This completes our proof.

4. Numerical Experiments

In this section, we provide some numerical examples and results. Our tests were carried out in MATLAB mark 22.0 on an Intel(R) Core(TM)i3-4005u [email protected] 1.70GHz with 64-bit operating system. Four examples are considered. In Example 1, we evaluate normwise, mixed, and componentwise condition numbers for different tridiagonal matrix sizes. In Example 2, two cases are considered, in the first case, a badly scaled input matrix is considered and the three kinds of condition numbers are computed. In the second case, a well-known doubly stochastic matrix of different sizes is considered and the computed condition numbers are compared. In Example 3, we make some small random perturbation in matrices and and evaluate the local upper perturbation bounds for the computed condition numbers. In Example 4, we consider a symmetric matrix with some specified perturbations in the matrices and and evaluate condition numbers and their local upper perturbation bounds.

In each example a comparison table for the computed condition numbers is provided and a general remark is provided for all results.

For , we define , , , and , are solutions of (1) and (6), respectively. The solutions and are computed by a fixed point algorithm. We obtain the local normwise, mixed, and componentwise condition numbers as follows. , , and . Then, we also define the relative mixed and componentwise local upper perturbation bounds as and , respectively.

Here, we propose a fixed point algorithm to compute the solutions and .

Fixed Point Algorithm(I)Input an matrix with , tolerance error tol=eps, and an initial guess , where is the size of matrix and eps is the standard machine precision.(II)For , compute and relative residual(III)Exit the loop if res . Otherwise, go to step (II).(IV)Display the solution .

Example 1. We consider the tridiagonal matrix , where .

The matrix is generated by a MATLAB function “full (gallery (‘tridiag’, , 1,2,1))’’, where denotes the size of matrix . For and using fixed point algorithm we evaluate the solution of (1) and compute relative normwise, mixed, and componentwise condition numbers. The summary of results is recorded in Table 1.

Example 2. (I)We consider (1) with . Then, we evaluate the normwise mixed and condition numbers for different -values. We employ our proposed fixed point method with =eye(). A summary of results is displayed in Table 2.(II)We consider (1) with a well-known doubly stochastic matrix which has applications in communication theory and graph theory [15]. We generate a doubly stochastic matrix such that , where magic, sum, and is the size of . Then, we apply our proposed fixed point method with =eye(). A summary of results is recorded in Table 3

Remark. In Table 2, the numerical result indicates that the mixed condition number is much smaller than the normwise condition number.

Equation (1) converges to a Hermitian positive definite solution if as considered in Theorem 7 and in our proposed fixed point method. In our several trials such matrices did not satisfy componentwise condition numbers to be less than the normwise condition number. Only mixed condition number yielded the best result.

From Table 3, all the computed condition numbers decrease as the size of doubly stochastic matrix and -values are increased.

Example 3. In this example, we use the same matrix as in Example 1 for and . We also set and as the perturbations in the matrices and , respectively. and are used as they were previously defined and is a positive integer.

The proposed fixed point algorithm is used to obtain the solutions of (1) and (6), then a summary of results is recorded in Table 4.

Example 4. In this example, we consider (1) and (6) in which and We also suppose that the perturbations in and arewhere is a positive integer. Then, using the proposed fixed point algorithm we evaluate solutions and for (1) and (6), respectively. A summary of results is recorded in Table 5.

Remarks(I)In Table 1, both mixed and normwise condition numbers indicate that our nonlinear matrix equation is well-conditioned since the computed condition number is very small for different matrix sizes used in the experiment. However, the componentwise condition number shows that our equation is ill-conditioned. Moreover, the componentwise and normwise condition numbers increase as the matrix size increases whereas the results for the mixed condition number remains constant.(II)In Tables 4 and 5, all the computed condition numbers are fairly sharp and the local upper perturbation bounds for the mixed and componentwise condition numbers exist as it was expected.(III)In Tables 4 and 5, all condition numbers decrease as the perturbation in the matrices and are decreased.

5. Conclusion

In this paper, we studied the normwise, mixed, and componentwise condition numbers of (1). Also, we derived the explicit expressions normwise mixed and componentwise condition numbers. Local upper bounds for the mixed and componentwise condition numbers exists as it was expected. A comparative analysis for the studied condition numbers is carried out. Componentwise condition number showed the worst results among the computed condition numbers as shown in Tables 1 and 3. In general, (1) seems to be well-conditioned since the computed condition numbers are relatively small. Results in Tables 1, 2, 3, 4, and 5 indicate that the mixed condition number can reveal the true sensitivity of the problem when its input data are badly scaled or sparse.

Data Availability

Due to the nature of our subject, all the necessary steps are included in the submitted manuscript. However, if more details will be required we will provide them immediately.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors would like to thank Professor Hyun-Min Kim for his remarkable guidance during this research work. This work was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korean Government (MSIP) (NRF-2017R1A5A1015722).

References

R. Loxton, K. L. Teo, and V. Rehbock, “Positive definite solutions of the nonlinear matrix equation ,” Applied Mathematics and Computation, vol. 217, no. 14, pp. 9182–9188, 2011.
View at: Publisher Site | Google Scholar | MathSciNet
A. C. Ran and M. C. Reurings, “On the nonlinear matrix equation : solutions and perturbation theory,” Linear Algebra and Its Applications, vol. 346, pp. 15–26, 2002.
View at: Publisher Site | Google Scholar | MathSciNet
D. Gao, “On Hermitian positive definite solutions of the nonlinear matrix equation ,” Applied Mathematics and Computation, vol. 50, no. 1, pp. 109–116, 2016.
View at: Publisher Site | Google Scholar | MathSciNet
I. P. Popchev, M. M. Konstantinov, P. H. Petkov, and V. A. Angelova, “Norm-wise, mixed and component-wise condition numbers of matrix equation , , ,” International Journal of Applied and Computational Mathematics, vol. 13, no. 1, pp. 18–30, 2014.
View at: Google Scholar | MathSciNet
M. Konstantinov, P. Petkov, I. Popchev, and V. Angelova, “Perturbation bounds for the matrix ,” Comptes Rendus de l’Académie Bulgare des Sciences, vol. 61, no. 9, pp. 1111–1120, 2008.
View at: Google Scholar | MathSciNet
I. G. Ivanov, “Perturbation analysis for solutions of ,” Linear Algebra and its Applications, vol. 395, pp. 313–331, 2005.
View at: Publisher Site | Google Scholar | MathSciNet
S. Wang, H. Yang, and H. Li, “Condition numbers for the nonlinear matrix equation and their statistical estimation,” Linear Algebra and its Applications, vol. 482, pp. 221–240, 2015.
View at: Publisher Site | Google Scholar | MathSciNet
R. A. Horn and C. R. Johnson, Topics in Matrix Analysis, Cambridge University Press, Cambridge, UK, 1991.
View at: Publisher Site | MathSciNet
R. A. Horn and C. R. Johnson, Matrix Analysis, Cambridge press, Cambridge, UK, 1991.
View at: MathSciNet
B. J. Broxson, “The Kronecker Product,” UNF, Theses and Dissertations, Paper 25, http://digitalcommons.unf.edu/etd/25, 2006.
View at: Google Scholar
J.-H. Seo and H.-M. Kim, “Convergence of pure and relaxed Newton methods for solving a matrix polynomial equation arising in stochastic models,” Linear Algebra and Its Applications, vol. 440, pp. 34–49, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
J. R. Rice, “A theory of condition,” SIAM Journal on Numerical Analysis, vol. 3, pp. 287–310, 1966.
View at: Publisher Site | Google Scholar | MathSciNet
I. Gohberg and I. Koltracht, “Mixed, componentwise, and structured condition numbers,” SIAM Journal on Matrix Analysis and Applications, vol. 14, no. 3, pp. 688–704, 1993.
View at: Publisher Site | Google Scholar | MathSciNet
F. Cucker, H. Diao, and Y. Wei, “On mixed and componentwise condition numbers for Moore-Penrose inverse and linear least squares problems,” Mathematics of Computation, vol. 76, no. 258, pp. 947–963, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
R. A. Brualdi, “Some applications of doubly stochastic matrices,” Linear Algebra and Its Applications, vol. 107, pp. 77–100, 1988.
View at: Publisher Site | Google Scholar | MathSciNet

Copyright

Copyright © 2018 Chacha Stephen Chacha and Syed Muhammad Raza Shah Naqvi. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1656

Downloads

811

Citations