On the Solution of a Class of Nonlinear Systems Governed by an <svg style="vertical-align:-0.0pt;width:24.674999px;" id="M1" height="14.7625" version="1.1" viewBox="0 0 24.674999 14.7625" width="24.674999"  xmlns="http://www.w3.org/2000/svg">
	<g transform="matrix(1.25,0,0,-1.25,0,14.7625)">
		<g transform="translate(72,-60.19)">
			<text transform="matrix(1,0,0,-1,-71.95,60.24)">
				<tspan style="font-size: 17.93px; " x="0" y="0">𝑀</tspan>
			</text>

		</g>
	</g>
</svg>-Matrix

Themistoclakis, Woula; Vecchio, Antonia

doi:https://doi.org/10.1155/2012/412052

Discrete Dynamics in Nature and Society

On this page

Abstract Introduction References Copyright Related Articles

Research Article | Open Access

Volume 2012 | Article ID 412052 | https://doi.org/10.1155/2012/412052

On the Solution of a Class of Nonlinear Systems Governed by an -Matrix

Woula Themistoclakis¹and Antonia Vecchio¹

Academic Editor: Rigoberto Medina

Received15 Nov 2011

Accepted13 Mar 2012

Published08 Jun 2012

Abstract

We consider a weakly nonlinear system of the form , where is a real function of the unknown vector , and is an -matrix. We propose to solve it by means of a sequence of linear systems defined by the iteration procedure , . The global convergence is proved by considering a related fixed-point problem.

1. Introduction

In recent years [1, 2], we approached the numerical resolution of the following nonstandard integrodifferential problems arising in the study of the kinetic theory of dusty plasmas: whose discretization, by means of a difference scheme and a quadrature rule [2], leads to a particular kind of nonlinear system of equations. Solving it by means of a fixed-point (FP) iteration process, we noted that such a procedure seems to globally converge, that is, it converges independently of the choice of the starting point. Our aim, here, is to explain the reason of this “nice” behavior. Throughout the paper, the notation , with , means that each element of is nonnegative, whereas means and . The nonlinear system under investigation is where is the -dimensional identity matrix, is an unknown vector, and we make the following assumptions which will be sufficient for the global convergence of the iterative process we propose (see Theorem 3.3):(i)with ;(ii) has nonnegative main diagonal and nonpositive off-diagonal entries (-matrix);(iii) is rowwise weakly diagonal dominant;(iv) is a differentiable function such that , for all and it is homogeneous of degree , that is, for all ;(v), where denotes the (row vector) gradient of .

Systems of type (1.2) fall into the class of weakly nonlinear systems (see, e.g., [3, 4]) and they also arise from the discretization, by finite-difference methods, of mathematical models less cumbersome than (1.1), as, for instance, the differential problems of the form:

A central rule in our results is played by the matrix appearing in (1.2) which, as it can be immediately seen from the assumption (ii)‒(iv), is an -matrix (see, e.g., [5, page 137, prop. ]).

The organization of the paper is the following. In Section 2, we investigate a particular class of one variable functions such that the FP iterations converge independently of the choice of the starting value . By exploiting this result jointly with the properties of the matrix , we are lead to state that, under the assumptions (i)–(v), the FP iteration procedure globally converges to a solution of (1.2). This constitutes our main result which is contained in Section 3. Finally, some numerical experiments showing the sharpness of the required conditions and the performance of the iteration scheme are reported in Section 4.

2. The One Variable Case

In order to prove our main theorem, we need the following result on a particular class of numerical sequences.

Theorem 2.1. Let be a one variable scalar function such that(a), with , closed and ;(b) continuous;(c) strictly increasing.Then, starting from any , the sequence obtained by the functional iteration process converges to , a fixed point of .

In this section, we are going to prove this theorem.

Denote by the function , then we have Of course, if there exists a point such that , then for all and the statement is trivial. Similarly, if the sequence is ultimately nonnegative (nonpositive), we have that the sequence is ultimately decreasing (increasing) and it trivially converges by hypothesis (a). Therefore, from now on, we assume that the sequence satisfies The following lemma holds.

Lemma 2.2. If and , , then

Proof. Let us prove (2.4) by induction on . Assume , that is, we are assuming that and we want to prove From the first of (2.5) and (2.2), we immediately get (2.6) with , that is, we have and so, from the hypothesis (c), we deduce which, recalling that , implies and, therefore, (2.6) holds with , that is, By using (c) once again, (2.10) gives or equivalently But the second of (2.5) implies , so that from (2.11), we obtain which assures that (2.6) holds for too.
Now, we assume that the lemma holds for , that is, suppose that implies In order to prove the lemma for , assume Of course (2.14) holds, and the second of (2.15) implies But from (2.14) with and (c), we have which in view of (2.16) leads to Hence, , that is, (2.14) holds for too, and the desired result is proved.

Now, let us partition the sequence into two subsequences and defined by

In order to state Theorem 2.1, we are going to prove that these subsequences converge to the same limit. First of all, the following result holds.

Lemma 2.3. The sequences and are strictly increasing and decreasing respectively.

Proof. Let us prove that the sequence is strictly increasing, the proof about is analogous, and it is omitted. Consider such that . Then, two cases may occur: (I) or (II) .(I)In this case, it is clear that belongs to the same subsequence and we immediately have (II)From (2.3), there exists such that , that is, we have Then, Lemma 2.2 with assures that which, for , gives The desired result comes out from (2.20) and (2.23).

Lemma 2.4. The sequences and provide two separate sets, that is,

Proof. Suppose, for example, and precisely let . If , we get (2.24) directly from the assumption . If , observe that the numerical sequence presents at least one change of sign, in view of (2.19). Let us examine the case of an unique change of sign, by supposing that there exists an integer , with , such that the other cases being analogous.
By Lemma 2.3, we get and by applying Lemma 2.2 with , we deduce Hence, by taking , we conclude

By the previous lemmas, we get the next result which completes the proof of the convergence of the FP iteration process.

Theorem 2.5. For any choice of , the sequence defined in (2.1) converges.

Proof. It is sufficient to prove that the subsequences and defined in (2.19) converge to the same limit, satisfying By the previous lemmas, we partitioned the sequence into two subsequences, and , which are separate, strictly monotone, and bounded sequences. Hence, they converge and Assume ab absurdo that Let us consider the subsequence of and the subsequence of , such that In other words, and are those elements of and , respectively, whose previous elements, in the main sequence , belong to the other subsequence and respectively, that is, we have that Of course, as is a subsequence of the convergent sequence , there results and the same is true for , that is, where and are given in (2.30). On the other hand, recalling that , from (b) and (2.33), we have But by the hypothesis (c), the assumption (2.31) implies , that is, in view of (2.36), we have , which is absurd. Thus, the result is achieved.

3. The Solution of the Nonlinear System

In this section, we come back to the nonlinear system (1.2). First of all, we recall that, under the hypotheses (ii)–(iv), for any , the matrix appearing in (1.2) is an M-matrix and it satisfies (see, e.g., [5, prop. ]) Now, we define the function as and we are going to prove that it satisfies the properties stated in the following theorem, where is the homogeneity degree of appearing in (iv) and the classical notation is used.

Theorem 3.1. Assume that (i)–(v) hold. Then, is a continuous, differentiable function, and for any , it satisfies

Proof. As the inverse of is defined for all , then is clearly a continuous function. Moreover, it can be easily seen that hence, (3.4) is immediately true. In view of Theorem A.1 in [6], we observe that (ii) and (iii) assures Therefore, for any , the vector satisfies and because of (v), we get . Consequently, assertion (3.3) holds taking into account (iv). Finally, again in view of (iv), we have where, recalling (3.6), it results Thus, by (v) and (3.1), we deduce (3.5) from

Using this theorem, we can easily claim that the function in (3.2) has the same characteristics as the function introduced in the previous section.

Corollary 3.2. Assume that (i)–(v) hold. Then, the function verifies all the hypotheses (a)–(c).

Proof. Assertions (a) and (b) directly follow from Theorem 3.1. Moreover, in view of (3.5), (c) is immediately obtained by , recalling that implies strictly increasing.

Now, we are ready to state our main result about the convergence of the sequence defined as

Theorem 3.3. Assume that the hypotheses (i)–(v) hold. Then, the sequence converges to a solution of (1.2).

Proof. Let us start from an arbitrary and put From (3.11), we have that (3.12) can also be written as Therefore, Corollary 3.2 assures that the sequence is convergent. Denoted by its limit, we set , and we obtain the statement by observing that

Remark 3.4. Of course, in view of (3.7), any solution of (1.2) satisfies , but nothing seems to imply its uniqueness. Anyway, we conjecture it, because in a large variety of experimental tests (including also high-dimensional systems), we never found more than one solution. Of course, a sufficient condition for the uniqueness is , for any , which assures , for any , by virtue of (3.4) and (3.7). Nevertheless, as it can be observed in the next section, it seems that the solution is unique even if such a condition is largely not satisfied.

4. Numerical Experiments

In order to verify our theoretical results and to check the performances of the FP iteration scheme on problem (1.2), we carried out a large variety of numerical experiments. Here, the most significant are reported. From the previous section, it appears clear that the iterative scheme (3.11), which furnishes a vector at each iteration, requires the same computational effort as (3.12), which instead provides scalar values. Hence, for the sake of simplicity, all the numerical tests reported here, are referred to (3.12), that is, we test the iterative procedure: In particular, in all the section, we assume that the convergence is reached whenever and we put .

Let us consider the following two problems: which, as it can be easily checked, satisfy the hypotheses of Theorem 3.3. In Tables 1 and 2, we report the number of iterations we performed in order to compute the fixed point of problems (4.2) and (4.3), respectively, for different choices of the starting value . Moreover, we also report the number of iterations performed by the Matlab routine fzero. This is given by the sum of the number of iterations to find an interval containing the zero (first addendum in Table 1) and the number of zero-finding iterations (second addendum in Table 1).

From these two tests, we observe that, according to Theorem 3.3, the FP iteration method converges for any in both the cases. Of course, when we know the interval where a zero lies, fzero does converge and it requires much less iterations, but in many cases this information is unavailable and, see Table 2, fzero fails to provide a value. In fact, at the third line of Table 2, the symbol AB stays for “aborted” and means that the MATLAB routine does not succeed in finding an interval containing a sign change. The last line of Table 2 shows that the best results can be obtained by a combined solution strategy, which starts with a certain number of FP iterations (first addendum) and takes the final steps using fzero iterations (second addendum). At the fifth column, we take the right balance between FP and fzero iterations, while in the remaining columns, we displayed the minimum number of FP iterations which are necessary to get a value from fzero. Thus, we can fruitfully exploit the global convergence of our FP iteration method in order to create more efficient combined strategy with fzero. Of course, this could be done also by means of other well-known methods like Newton’s one.

In order to verify whether the convergence of the FP procedure could be ascribed to the Banach contraction theorem, we plotted the absolute value of the derivative of the function in (4.1) (not reported) and we noted that the starting value in bold, that is, , in Table 1 and , in Table 2, belongs to intervals where . Hence we can assert that, at least in these cases, the convergence of the FP method is not “helped” by the contraction principle.

As we already wrote in Remark 3.4, from these numerical experiments and many others, starting from very different values of , the method seems to converge to the same fixed point. For example, in problems (4.2) and (4.3), we obtain and , respectively, for a very large number of initial guesses (much larger than the ones reported in Tables 1 and 2). This numerical evidence let us conjecture that the assumptions in Theorem 3.3 are also sufficient to assure the uniqueness of the solution of problem (1.2).

In Figures 1 and 2, we plotted the first 100 elements of the error sequence , where is computed by applying the FP iterations to the following two problems: It can be easily seen that both the problems do not satisfy the hypotheses of Theorem 3.3. To be more precise, (4.4) does not fulfill (v), whereas (4.5) does not verify (iv), having . The figures clearly show that the method fails to converge, suggesting us that the assumptions of our main theorem are not too restrictive.

Moreover, we point out that, in our experience, some problems arising in the the kinetic theory of dusty plasmas, are more cumbersome than (1.1), and therefore, the application of Newton or other available iterative procedures is not always simple, convenient, or possible. A comparison between FP and other iteration processes on problem of type (1.1) will be the subject of future investigations. Finally, in order to test the performance of our method for larger systems arising from the applications, we discretized a problem of type (1.1) with stepsize on a sufficiently large interval , with . In this way, we get a nonlinear system of type (1.2), which has dimension . By applying FP iteration method to such a system for different values of , with , we observe that the number of iterations does not vary with and so with , but only with the choice of the starting point . Moreover, we underline that in this case the linear systems arising from the FP iterations are tridiagonal and then very easy to be solved.

References

M. Basile, E. Messina, W. Themistoclakis, and A. Vecchio, “Some investigations on a class of nonlinear integro-differential equations on the half line,” Submitted.
View at: Google Scholar
M. Basile, E. Messina, W. Themistoclakis, and A. Vecchio, “A numerical method for a class of nonlinear integro-differential equations on the half line,” Submitted.
View at: Google Scholar
E. Galligani, “The arithmetic mean method for solving systems of nonlinear equations in finite differences,” Applied Mathematics and Computation, vol. 181, no. 1, pp. 579–597, 2006.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
E. Galligani, “On solving a special class of weakly nonlinear finite-difference systems,” International Journal of Computer Mathematics, vol. 86, no. 3, pp. 503–522, 2009.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
A. Berman and R. J. Plemmons, Nonnegative Matrices in the Mathematical Sciences, vol. 9 of Classics in Applied Mathematics, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, Pa, USA, 1994.
View at: Zentralblatt MATH
J. Fuhrmann, “Existence and uniqueness of solutions of certain systems of algebraic equations with off-diagonal nonlinearity,” Applied Numerical Mathematics, vol. 37, no. 3, pp. 359–370, 2001.
View at: Google Scholar

Copyright

Copyright © 2012 Woula Themistoclakis and Antonia Vecchio. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

623

Downloads

760

Citations