A Numerical Approach to Solving an Inverse Heat Conduction Problem Using the Levenberg-Marquardt Algorithm

Min, Tao; Chen, Xing; Sun, Yao; Huang, Qiang

doi:https://doi.org/10.1155/2014/626037

Mathematical Problems in Engineering

On this page

Abstract Introduction Conclusions Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2014 | Article ID 626037 | https://doi.org/10.1155/2014/626037

A Numerical Approach to Solving an Inverse Heat Conduction Problem Using the Levenberg-Marquardt Algorithm

Tao Min,^1,2Xing Chen,¹Yao Sun,¹and Qiang Huang²

Academic Editor: Hua Fan

Received14 Apr 2014

Accepted08 Jun 2014

Published24 Jun 2014

Abstract

This paper is intended to provide a numerical algorithm involving the combined use of the Levenberg-Marquardt algorithm and the Galerkin finite element method for estimating the diffusion coefficient in an inverse heat conduction problem (IHCP). In the present study, the functional form of the diffusion coefficient is unknown a priori. The unknown diffusion coefficient is approximated by the polynomial form and the present numerical algorithm is employed to find the solution. Numerical experiments are presented to show the efficiency of the proposed method.

1. Introduction

The numerical solution of the inverse heat conduction problem (IHCP) requires determining diffusion coefficient from additional information. Inverse heat conduction problems have many applications in various branches of science and engineering; mechanical and chemical engineers, mathematicians, and specialists in many other science branches are interested in inverse problems, each with different application in mind [1–15].

In this work, we propose an algorithm for numerical solving of an inverse heat conduction problem. The algorithm is based on the Galerkin finite element method and Levenberg-Marquardt algorithm [16, 17] in conjunction with the least-squares scheme. It is assumed that no prior information is available on the functional form of the unknown diffusion coefficient in the present study; thus, it is classified as the function estimation in inverse calculation. Run the numerical algorithm to solve the unknown diffusion coefficient which is approximated by the polynomial form. The Levenberg-Marquardt optimization is adopted to modify the estimated values.

The plan of this paper is as follows. In Section 2, we formulate a one-dimensional IHCP. In Section 3, the numerical algorithm is derived. Calculation of sensitivity coefficients will be discussed in Section 4. In order to discuss some numerical aspects, two examples are given in Section 5. Section 6 ends this paper with a brief discussion on some numerical aspects.

2. Description of the Problem

The mathematical formulation of a one-dimensional heat conduction problem is given as follows: with the initial condition and Dirichlet boundary conditions where , , , , and are continuous known functions. We consider the problem (1)–(3) as a direct problem. As we all know, if , , are continuous functions and is known, the problem (1)–(3) has a unique solution.

For the inverse problem, the diffusion coefficient is regarded as being unknown. In addition, an overspecified condition is also considered available. To estimate the unknown coefficient , the additional information on the boundary , , is required. Let the taken at over the time period be denoted by It is evident that for an unknown function , the problem (1)–(3) is underdetermined and we are forced to impose additional information (4) to provide a unique solution pair to the inverse problem (1)–(4).

We note that the measured overspecified condition should contain measurement errors. Therefore, the inverse problem can be stated as follows: by utilizing the above-mentioned measured data, estimate the unknown function .

In this work, the polynomial form is proposed for the unknown function before performing the inverse calculation. Therefore, is approximated as where are constants which remain to be determined simultaneously. The unknown coefficients can be determined by using least-squares method. The error in the estimate is to be minimized. Here, are the calculated results. These quantities are determined from the solution of the direct problem which is given previously by using an approximated for the exact . The estimated values of , , are determined until the value of is minimum. Such a norm can be written as where denotes the vector of unknown parameters and the superscript above denotes transpose. The vector is given by is real-valued bounded function defined on a closed bounded domain . The function may have many local minima in , but it has only one global minimum. When and have some attractive properties, for instance, being a differentiable concave function and being a convex region, then a local maximum problem can be solved explicitly by mathematical programming methods.

3. Overview of the Levenberg-Marquardt Method

The Levenberg-Marquardt method, originally devised for application to nonlinear parameter estimation problems, has also been successfully applied to the solution of linear ill-conditioned problems. Such a method was first derived by Levenberg (1944) by modifying the ordinary least-squares norm. Later Marquardt (1963) derived basically the same technique by using a different approach. Marquardt’s intention was to obtain a method that would tend to the Gauss method in the neighborhood of the minimum of the ordinary least-squares norm and would tend to the steepest descent method in the neighborhood of the initial guess used for the iterative procedure.

To minimize the least-squares norm (7), we need to equate to zero the derivatives of with respect to each of the unknown parameters ; that is, Let us introduce the sensitivity or Jacobian matrix, as follows:or The elements of the sensitivity matrix are called the sensitivity coefficients, and the results of differentiation (9) can be written down as follows: For linear inverse problem, the sensitivity matrix is not a function of the unknown parameters. Equation (12) can be solved then in explicit form as follows: In the case of a nonlinear inverse problem, the matrix has some functional dependence on the vector . The solution of (12) requires an iterative procedure, which is obtained by linearizing the vector with a Taylor series expansion around the current solution at iteration . Such a linearization is given by where and are the estimated temperatures and the sensitivity matrix evaluated at iteration , respectively. Equation (14) is substituted into (13) and the resulting expression is rearranged to yield the following iterative procedure to obtain the vector of unknown parameters : The iterative procedure given by (15) is called the Gauss method. Such method is actually an approximation for the Newton (or Newton-Raphson) method. We note that (13) and the implementation of the iterative procedure given by (15) require the matrix to be nonsingular, or where is the determinant.

Formula (16) gives the so-called identifiability condition; that is, if the determinant of is zero, or even very small, the parameters , for , cannot be determined by using the iterative procedure of (15).

Problems satisfying are denoted as ill-conditioned. Inverse heat transfer problems are generally very ill-conditioned, especially near the initial guess used for the unknown parameters, creating difficulties in the application of (13) or (15). The Levenberg-Marquardt method alleviates such difficulties by utilizing an iterative procedure in the form where is a positive scalar named damping parameter and is a diagonal matrix.

The purpose of the matrix term is to damp oscillations and instabilities due to the ill-conditioned character of the problem, by making its components large as compared to those of if necessary. is made large in the beginning of the iterations, since the problem is generally ill-conditioned in the region around the initial guess used for iterative procedure, which can be quite far from the exact parameters. With such an approach, the matrix is not required to be nonsingular in the beginning of iterations and the Levenberg-Marquardt method tends to the steepest descent method; that is, a very small step is taken in the negative gradient direction. The parameter is then gradually reduced as the iteration procedure advances to the solution of the parameter estimation problem, and then the Levenberg-Marquardt method tends to the Gauss method given by (15). The following criteria were suggested in literature [13] to stop the iterative procedure of the Levenberg-Marquardt method given by (17): where , , and are user prescribed tolerances and denotes the Euclidean norm. The criterion given by (18) tests if the least-squares norm is sufficiently small, which is expected in the neighborhood of the solution for the problem. Similarly, (19) checks if the norm of the gradient of is sufficiently small, since it is expected to vanish at the point where is minimum. The last criterion given by (20) results from the fact that changes in the vector of parameters are very small when the method has converged. Generally, these three stopping criteria need to be tested and the iterative procedure of the Levenberg-Marquardt method is stopped if any of them is satisfied.

Different versions of the Levenberg-Marquardt method can be found in the literature, depending on the choice of the diagonal matrix and on the form chosen for the variation of the damping parameter . In this paper, we choose as Suppose that the vectors of temperature measurements are given at times , , and an initial guess is available for the vector of unknown parameters . Choose a value for , say , and . Then, consider the following.

Step 1. Solve the direct problem (1)–(3) with the available estimate in order to obtain the vector .

Step 2. Compute from (7).

Step 3. Compute the sensitivity matrix from (11) and then the matrix from (21), by using the current value of .

Step 4. Solve the following linear system of algebraic equations, obtained from (17).
in order to compute .

Step 5. Compute the new estimate as .

Step 6. Solve the exact problem (1)–(3) with the new estimate in order to find . Then compute .

Step 7. If , replace by and return to Step 4.

Step 8. If , accept the new estimate and emplace by .

Step 9. Check the stopping criteria given by (18). Stop the iterative procedure if any of them is satisfied; otherwise, replace by and return to Step 3.

4. Calculation of Sensitivity Coefficients

Generally, there have been two approaches for determining the gradient; the first is a discretize-then-differentiate approach and the second is a differentiate-then-discretize approach.

The first approach is to approximate the gradient of the functional by a finite difference quotient approximation, but, in general, we cannot determine the sensitivities exactly, so this method may lead to larger error.

Here, we intend to use differentiate-then-discretize approach which we refer to as the sensitivity equation method. This method can be determined more efficiently with the help of the sensitivities

We first differentiate the flow system (1)–(3) with respect to each of the design parameters , to obtain the continuous sensitivity systems: for There are () equations; we can make them in one system equation and use the finite element methods to solve the system of equation. Here, we give the vector form of the equation as follows: where

We use the Galerkin finite element method approximation for discretizing problem (24). For this, we multiply (24) by a test function , and integrate the obtained equation in space form to . We obtain the following equation: integrating by parts gives We can change the first derivative in time and the integral. We have , because . This leads to an equivalent problem : , find satisfying for all . To simplify the notation, we use the scalar product in : We also can define the following bilinear form.Finally, we obtain with this notation the weak problem of :

4.1. Space Discretization with the Galerkin Method

In this section, we search a semidiscrete approximation of the weak problem , using the Galerkin finite element method. This leads to a first-order Cauchy problem in time.

Let be an -dimensional subspace of and . Then, the following problem is an approximation of the weak problem; find that satisfies for all , where .

The choice of is completely arbitrary. So, we can choose it the way that, for later treatment, it will be as easy as possible. For example, we subdivide the interval into partitions of equal distances : Note that the finite dimension allows us to build a finite base for the corresponding space. In the case of , we have , where .

Consider while we add for the two functions and defined as so that we can write as a linear combination of the basic elements: where and . Knowing that is bilinear form and that (32) is valid for each element of the base , we obtain This equation can be written in a vector form. For this, we define the vectors , , and with components and matrices and as Note that , , and . So that (37) is equal to the Cauchy problem the Crank-Nicolson method can be applied to (40) at time , resulting in where , , .

Equation (41) can be written in simple form as The algebraic system (42) is solved by Gauss elimination method.

5. Numerical Experiment

In this section, we are going to demonstrate some numerical results for in the inverse problem (1)–(4). Therefore, the following examples are considered and the solutions are obtained.

Example 1. Consider (1)–(3) with We obtain the unique exact solution We take the observed data as The unknown function is defined as the following form: where , , and are unknown coefficients.

Table 1 shows how the Levenberg-Marquardt algorithm can find the best parameters after 12 iterations when it is initialized in four different points.

Figures 1, 2, 3, and 4 show the fitness of the estimated parameters and the rate of convergence.

Figures 5, 6, 7, and 8 show the comparison between the inversion results and the exact value .

Table 2 shows the values of and in with all the initial values being set to 1.

Example 2. Consider (1)–(3) with We obtain the unique exact solution We take the observed data as The unknown function is defined as the following form.
, where are unknown coefficients.

Table 3 shows how the Levenberg-Marquardt algorithm can find the best parameters after 20 iterations when it is initialized in four different points.

Figures 9, 10, 11, and 12 show the fitness of the estimated parameters and the rate of convergence.

Figures 13, 14, 15, and 16 show the comparison between the inversion results and the exact value .

Table 4 shows the values of and in with all the initial values being set to 1.

6. Conclusions

A numerical method to estimate the temperature and the coefficient is proposed for an IHCP and the following results are obtained.(1)The present study successfully applies the numerical method involving the Levenberg-Marquardt algorithm in conjunction with the Galerkin finite element method to an IHCP.(2)From the illustrated example, it can be seen that the proposed numerical method is efficient and accurate to estimate the temperature and the coefficient .

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The work of the author is supported by the Special Funds of the National Natural Science Foundation of China (nos. 51190093 and 51179151). The author would like to thank the referees for constructive suggestions and comments.

References

A. Shidfar and G. R. Karamali, “Numerical solution of inverse heat conduction problem with nonstationary measurements,” Applied Mathematics and Computation, vol. 168, no. 1, pp. 540–548, 2005.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
A. Shidfar, G. R. Karamali, and J. Damirchi, “An inverse heat conduction problem with a nonlinear source term,” Nonlinear Analysis: Theory, Methods & Applications, vol. 65, no. 3, pp. 615–621, 2006.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
A. Shidfar and R. Pourgholi, “Numerical approximation of solution of an inverse heat conduction problem based on Legendre polynomials,” Applied Mathematics and Computation, vol. 175, no. 2, pp. 1366–1374, 2006.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
A. Shidfar and H. Azary, “An inverse problem for a nonlinear diffusion equation,” Nonlinear Analysis: Theory, Methods & Applications, vol. 28, no. 4, pp. 589–593, 1997.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
K. Kurpisz and A. J. Nowak, “BEM approach to inverse heat conduction problems,” Engineering Analysis with Boundary Elements, vol. 10, no. 4, pp. 291–297, 1992.
View at: Google Scholar
H. Han, D. B. Ingham, and Y. Yuan, “The boundary element method for the solution of the backward heat conduction equation,” Journal of Computational Physics, vol. 116, no. 2, pp. 292–299, 1995.
View at: Publisher Site | Google Scholar
J. Skorek, “Applying the least squares adjustment technique for solving inverse heat conduction problems,” in Proceedings of the 8th Conference on Numerical Methods in Laminar and Turbulent Flow, C. Taylor, Ed., pp. 189–198, Pinerdage Press, Swansea, U.K, August 1993.
View at: Google Scholar
R. Pasquetti and C. le Niliot, “Boundary element approach for inverse heat conduction problems: application to a bidimensional transient numerical experiment,” Numerical Heat Transfer B: Fundamentals, vol. 20, no. 2, pp. 169–189, 1991.
View at: Google Scholar
D. B. Ingham and Y. Yuan, “The solution of a nonlinear inverse problem in heat transfer,” IMA Journal of Applied Mathematics, vol. 50, no. 2, pp. 113–132, 1993.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
E. Hensel, Inverse Theory and Applications for Engineers, Prentice Hall, New Jersey, NJ, USA, 1991.
M. N. Özisik, Inverse Heat Transfer: Fundamentals and Applications, Taylor & Francis, New York, NY, USA, 2000.
R. Pourgholi, N. Azizi, Y. S. Gasimov, F. Aliev, and H. K. Khalafi, “Removal of numerical instability in the solution of an inverse heat conduction problem,” Communications in Nonlinear Science and Numerical Simulation, vol. 14, no. 6, pp. 2664–2669, 2009.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
A. Shidfar, R. Pourgholi, and M. Ebrahimi, “A numerical method for solving of a nonlinear inverse diffusion problem,” Computers & Mathematics with Applications, vol. 52, no. 6-7, pp. 1021–1030, 2006.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
J. Wang and N. Zabaras, “A Bayesian inference approach to the inverse heat conduction problem,” International Journal of Heat and Mass Transfer, vol. 47, no. 17-18, pp. 3927–3941, 2004.
View at: Publisher Site | Google Scholar
M. Dehghan, “Determination of an unknown parameter in a semi-linear parabolic equation,” Mathematical Problems in Engineering, vol. 8, no. 2, pp. 111–122, 2002.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
J. Lihua and M. Changfeng, “On L-M method for nonlinear equations,” Journal of Mathematics, vol. 29, no. 3, pp. 253–259, 2009.
View at: Google Scholar
P. Chen, “Why not use the Levenberg-Marquardt method for fundamental matrix estimation?” The Institution of Engineering and Technology, vol. 4, no. 4, pp. 286–288, 2010.
View at: Google Scholar

Copyright

Copyright © 2014 Tao Min et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1788

Downloads

1173

Citations