Continuous Analog of Accelerated OS-EM Algorithm for Computed Tomography

Tateishi, Kiyoko; Yamaguchi, Yusaku; Abou Al-Ola, Omar M.; Yoshinaga, Tetsuya

doi:https://doi.org/10.1155/2017/1564123

Mathematical Problems in Engineering

On this page

Abstract Introduction Numerical Example Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2017 | Article ID 1564123 | https://doi.org/10.1155/2017/1564123

Continuous Analog of Accelerated OS-EM Algorithm for Computed Tomography

Kiyoko Tateishi,¹Yusaku Yamaguchi,²Omar M. Abou Al-Ola,³and Tetsuya Yoshinaga⁴

Academic Editor: Kishin Sadarangani

Received06 Mar 2017

Revised29 May 2017

Accepted20 Jun 2017

Published06 Aug 2017

Abstract

The maximum-likelihood expectation-maximization (ML-EM) algorithm is used for an iterative image reconstruction (IIR) method and performs well with respect to the inverse problem as cross-entropy minimization in computed tomography. For accelerating the convergence rate of the ML-EM, the ordered-subsets expectation-maximization (OS-EM) with a power factor is effective. In this paper, we propose a continuous analog to the power-based accelerated OS-EM algorithm. The continuous-time image reconstruction (CIR) system is described by nonlinear differential equations with piecewise smooth vector fields by a cyclic switching process. A numerical discretization of the differential equation by using the geometric multiplicative first-order expansion of the nonlinear vector field leads to an exact equivalent iterative formula of the power-based OS-EM. The convergence of nonnegatively constrained solutions to a globally stable equilibrium is guaranteed by the Lyapunov theorem for consistent inverse problems. We illustrate through numerical experiments that the convergence characteristics of the continuous system have the highest quality compared with that of discretization methods. We clarify how important the discretization method approximates the solution of the CIR to design a better IIR method.

1. Introduction

In computed tomography (CT), iterative image reconstruction (IIR) [1–4] gives better quality images compared with filtered backprojection, which is a transform reconstruction method based directly on the Radon inversion formula. Among IIR methods, the maximum-likelihood expectation-maximization (ML-EM) [4] algorithm shows a better performance with respect to cross-entropy minimization or likelihood maximization [5, 6] and is most suitable for the reconstruction of CT since its derivation from underlying the Poisson distribution. To accelerate the ML-EM algorithm, which has a drawback of slow convergence, the ordered-subset variation of the ML-EM, known as the ordered-subsets expectation-maximization (OS-EM) [7, 8], is an effective and popular method. In addition to the OS-EM method, a larger power factor that does not cause divergence in the iterative process was introduced [9–13] to further accelerate the convergence rate. It was asserted [12] that a power-based ML-EM algorithm with increased power (step size) resulted in not only accelerating the convergence rate but also maximizing the likelihood values. However, there is currently no general formula for optimizing the convergence rate by choosing appropriate values of the subsets number and the step size, which depend on the tomography system condition.

In this paper, we propose a continuous analog to the power-based accelerated OS-EM, which is based on the approach of continuous-time dynamical optimization [14–19]. The system is described by a switched nonlinear differential equation with piecewise smooth vector fields. Reconstructed image pixels were obtained by using the initial value problem of differential equations describing the dynamical system, for example, a continuous-time image reconstruction (CIR) system. The proposed hybrid dynamical system has a different vector field from that of our previously presented continuous system [20–22]. We indicated that discretizing the differential equations using the geometric multiplicative first-order expansion of the nonlinear vector field leads to the exact same iterative formula of the power-based OS-EM with the scaling parameter as a step size of discretization. Iterative algorithms with the Runge-Kutta (RK), as well as Euler methods that are applied to multiplicatively and additively discretize the CIR system, can be used for image reconstruction. This paper shows that the discretization of a differential equation system can construct an IIR algorithm for solving bound-constrained inverse problems in CT.

The proposed CIR system has an advantage in that the convergence of nonnegatively constrained solutions to a stable equilibrium is globally guaranteed by the Lyapunov stability theorem [23] for consistent inverse problems. Specifically, we prove theoretically that a weighted Kullback-Leibler (KL) divergence [24] between a solution and the equilibrium can become a common Lyapunov function for a continuous OS-EM system with an arbitrary number of subsets and under arbitrary switching signals. A cross-entropy function of the KL-divergence between projection and backprojection monotonically decreases along the solution to the continuous ML-EM system. This means the likelihood function monotonically increases in time.

To study the cross-entropy convergence property in the case of inconsistent inverse problems, we applied numerical simulations using a dataset acquired from a clinical SPECT scanner for the CIR system and three kinds of discretization methods: the Euler discretization or equivalently the power-based OS-EM method, its rescaled method, and the third-order RK discretization method. We see that the convergence of solutions to the CIR system has the highest quality compared with that of other iterative methods. We clarify how important the discretization method approximates the solution of the CIR in designing a better IIR method.

2. System Description

Image reconstruction in CT is a problem to obtain unknown variable for pixel values satisfyingwhere and , respectively, denote the projection and a projection operator representing the discretization of the Radon transform ( and , resp., indicate the set of nonnegative and positive real numbers). If the system in (1) has a nonnegative solution, it is consistent; otherwise, it is inconsistent. The problem can be reduced to finding using an optimization scheme that minimizes an objective function with respect to the system of (1).

Before describing the proposed system, we will prepare a number of definitions and notations. Let and be, respectively, a subvector consisting of partial elements of and a submatrix of with the same corresponding rows of for , with indicating the total number of divisions. We also definewhere is the element of and . To simplify the description, we denote vector-valued functions of each element in vector and of , respectively.

The objective function considered in this paper is expressed in the generalized KL-divergencefor the given nonnegative vectors and with and denoting the th elements of and , respectively, and indicating an all-ones vector. The divergence , known as Csiszár’s -divergence measure [25], for the vectors and in is nonnegative with if and only if .

The proposed method for solving the tomographic inverse problem is based on the use of an initial value problem of the switched nonlinear dynamical system [26, 27], which consists of a family of subsystemsfor , , and a series of times , where and . Note that each subsystem is described by autonomous differential equations with a sufficiently smooth vector field. The solutions to the hybrid dynamical system [27] are constructed through the connection of the last state of the previous th subsystem and the initial state of the next th subsystem at every , for , and setting to be for cyclic switching process.

3. Theoretical Results

In this section, the theoretical results of the solutions to the hybrid dynamical system in (4) are given.

We first show that a solution can be made to stay positive. This property is preserved whether or not the inverse problem is consistent and indicates that the CIR system does not produce images with unphysical negative pixel values.

Proposition 1. If the initial value of the switched dynamical system in (4) is chosen in , its solution behaves in for all .

Proof. The vector field of the th element of the th subsystem can be rewritten as with the function . Therefore, holds for any on the subspace satisfying , which means the subspace becomes invariant, and, on the basis of the uniqueness criteria of solutions to the initial value problem, any flow cannot pass through every invariant subspace. This leads to the proof.

Assuming that the individual subsystems have the common equilibrium satisfying for any , we can prove the existence of a Lyapunov function for all subsystems in (4), which guarantees that the corresponding switched system has a stable equilibrium .

The OS-EM [7, 8] was introduced to accelerate the computation time of ML-EM. However, even in the consistent case, the convergence proof in [7] requires the values of to be independent of the subset , which is referred to as the “subset balance.” This condition is restrictive in practical applications. Byrne [6] presented a more general sufficient condition called “a generalized subset balance,” meaning that is separable; positive values exist such thatwhere . Our convergence proof on the stability of an equilibrium observed in the switched system in (4) needs the same condition as (5).

One of the main results is a stability theorem for the continuous OS-EM system as follows.

Theorem 2. If there exists a unique solution to the system with the matrix satisfying the condition of (5), the equilibrium of the dynamical system in (4) is asymptotically stable.

Proof. This follows from Lyapunov’s stability theorem. We define a possible candidate for a Lyapunov function as a weighted KL-divergence,which is positive definite and is well-defined via Proposition 1 when initial value is chosen in . It can be written asUsing the concavity of the log function and Jensen’s inequality, we then calculate its derivative with respect to the dynamical system in (4) aswhere , and in . The derivative equals zero at . Consequently, the hybrid system consisting of the family of subsystems in (4) has a common Lyapunov function defined by (6), so the equilibrium of the system is asymptotically stable under arbitrary switching signals.

The proof of Theorem 2 can be rephrased as follows. If the system has a unique solution , the objective function in (6) decreases monotonically in time for the solution to the system in (4) with .

We use another Lyapunov function as an objective function that has to be minimized.

Theorem 3. If the system possesses a unique solution , the following objective function decreases monotonically in time for solutions of the system in (4) with and :

Proof. We show that the functionwhich is positive definite for , can be a Lyapunov function. Because for , we can obtain the derivative with respect to the system in (4) with as follows:where and indicate all-ones vectors. By puttingfor simplicity, (11) can be written asFor the first and third terms of the right-hand side in (13), one gets and obtains for the second and fourth terms Therefore,and the Lyapunov function decreases along the flow. This concludes the proof.

4. Discretization

Consider the numerical discretization of the piecewise differential equations describing the CIR hybrid system. The integration of differential equations by discretization via the multiplicative calculus is derived as a multiplicative counterpart of classical Euler and RK methods [28, 29]. The following shows the geometric multiplicative first-order expansion of a nonlinear function that appeared in the vector field. See [29] for details on constructing a multiplicative RK method.

We describe the th equation for the th subsystem in (4) as at for , and nonnegative integer . When applying the multiplicative Euler method to the th subsystem, we obtain a single step at a time:where denotes a step size depending on . For the block continuous-time hybrid system, by choosing and by connecting solutions to the subsystems at the discrete time , where andfor , with being the floor of , we have the block-iterative formThrough this discretization procedure, the discrete time is identical to the switching time with in the hybrid dynamical system under the corresponding special switching signals.

The iterative formula for the continuous-time system in (4) is equivalent to the power-based OS-EMwhere in (20) is substituted with . Here, the collection is a partition or a subset of the index set ; the definition is the same as the derivation of and for . The step size in (20) and (21) corresponds to the scaling parameters in the OS-EM algorithm. Note that because the step size is derived in the discretization procedure of the hybrid dynamical system, its value does not affect the theoretical results of the solutions for the CIR system defined in Section 2.

5. Numerical Example

This section numerically illustrates the convergence property of continuous solutions to the CIR system and iterative solutions to certain IIR procedures derived from its discretizations. We used a set of SPECT projection data available from a publicly accessible database of Monte Carlo simulated datasets for Emission Tomography (the MC-ET database) [30]. The sinogram of a brain scan is shown in Figure 1. The numbers of projections and pixels of the reconstructed images were 7,500 (125 bins and 60 directions in 180 degrees) and ( pixels), respectively. The projection data from 60 directions were divided into nonoverlapping subsets as uniformly as possible with being 2, 4, and 8. When and 4, (5) holds with for ; namely, the subsets into which the angles are uniformly split satisfy the subset balance [7] condition. Whereas, nonuniform division into eight subsets gives an unbalanced dataset. The step size of IIR was set to 2.2 for any . Although bigger step size is effective for accelerating IIR method, the algorithm diverges or oscillates if it is too big [12, 13]. The value of was chosen to characterize the difference between lower- and higher-order discretization methods (i.e., OS-EM and RK, resp.) in the convergence process.

(a)

(b)

The IIR procedures considered here are the OS-EM (or equivalently the multiplicative Euler method) in (21), a rescaled OS-EM, and the multiplicative third-order RK method of the CIR. The rescaled OS-EM was achieved by multiplying the state in (21) by the rescaling coefficientat every th iteration step with ; namely, the mappingwas applied as a postprocess to the OS-EM procedure. The multiplicative RK method with third-order is defined as follows.where with and , for and . To solve the differential equation in (4) for CIR, we used the solver ode45, which is a variable-step RK method in MATLAB (MathWorks, Natick, USA).

Figure 2 plots the objective function in (9), which is a cross-entropy function and a Lyapunov function for the CIR system with , obtained by using CIR, the OS-EM, its rescaling, and the RK at time and the th discrete points for , with the number of subsets being equal to 1, 2, 4, and 8. Since there is no theory available to guarantee the convergence of iterative solutions to IIR with a step size greater than one, the iterative points diverge or oscillate depending on the conditions set, including the selection of . We see that the OS-EM, having the relatively large step size, accumulates an error per iteration because of the insufficient accuracy of the computation by using first-order approximation. As shown in Figure 3, which represents the reconstructed images at , the OS-EM algorithm with and the rescaled OS-EM with failed to reconstruct images corresponding to the divergence of the objective function . However, a further experiment indicated that the iterative sequence generated by the third-order RK algorithm with did not diverge nor oscillate for . One can confirm that the OS-EM and the other iterative procedures originate from the meaning of convergence from the CIR system and the third-order RK has the best robust performance among three IIR algorithms. It is important when designing a better IIR method to choose a discretization method that approximates the solution of the CIR.

(a)

(b)

(c)

(d)

6. Concluding Remarks

We proposed a hybrid dynamical system (CIR system), a multiplicative Euler discretization that is exactly the same as a power-based accelerated OS-EM. We have theoretically shown the positiveness of solutions and the stability of a common equilibrium corresponding to the exact solution of the consistent CT inverse problem. The proof of the convergence to the stable equilibrium was made using the Lyapunov stability theorem. As a result, the common Lyapunov function or the KL-divergence measure monotonically decreases along the time course.

We confirmed through numerical experiments that the convergence characteristics of the CIR system had the highest quality compared with that of any discretization method. This fact illustrates that the OS-EM method and its rescaled method are considered to be derived from discretization of the CIR system.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was partially supported by JSPS KAKENHI Grant no. 15K06110.

References

R. Gordon, R. Bender, and G. T. Herman, “Algebraic reconstruction techniques (ART) for three-dimensional electron microscopy and X-ray photography,” Journal of Theoretical Biology, vol. 29, no. 3, pp. 471–481, 1970.
View at: Publisher Site | Google Scholar
A. C. Kak and M. Slaney, Principles of Computerized Tomographic Imaging, IEEE Service Center, Piscataway, NJ, USA, 1988.
View at: MathSciNet
H. Stark, Ed., Image Recorvery Theorem and Application, Academic Press, Orlando, Fla, USA, 1987.
View at: MathSciNet
L. A. Shepp and Y. Vardi, “Maximum likelihood reconstruction for emission tomography,” IEEE Transactions on Medical Imaging, vol. 1, no. 2, pp. 113–122, 1982.
View at: Publisher Site | Google Scholar
C. L. Byrne, “Iterative image reconstruction algorithms based on cross-entropy minimization,” IEEE Transactions on Image Processing, vol. 2, no. 1, pp. 96–103, 1993.
View at: Publisher Site | Google Scholar
C. Byrne, “Block-iterative algorithms,” International Transactions in Operational Research, vol. 16, no. 4, pp. 427–463, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
H. M. Hudson and R. S. Larkin, “Accelerated image reconstruction using ordered subsets of projection data,” IEEE Transactions on Medical Imaging, vol. 13, no. 4, pp. 601–609, 1994.
View at: Publisher Site | Google Scholar
C. L. Byrne, “Accelerating the EMML algorithm and related iterative algorithms by rescaled block-iterative methods,” IEEE Transactions on Image Processing, vol. 7, no. 1, pp. 100–109, 1998.
View at: Publisher Site | Google Scholar | MathSciNet
S. Vishampayan, J. Stamos, R. Mayans, K. Koral, N. Clinthorne, and W. L. Rogers, “Maximum likelihood image reconstruction for SPECT,” Journal of nuclear medicine, vol. 26, no. 5, p. 20, 1985.
View at: Google Scholar
E. Tanaka, N. Nohara, T. Tomitani, and M. Yamamoto, “Utilization of non-negativity constraints in reconstruction of emission tomograms,” in Information Processing in Medical Imaging, L. B. Stephen, Ed., vol. 1, Springer, Netherlands, pp. 379–393, 1986.
View at: Google Scholar
E. Tanaka, “A fast reconstruction algorithm for stationary positron emission tomography based on a modified em algorithm,” IEEE Transactions on Medical Imaging, vol. 6, no. 2, pp. 98–105, 1987.
View at: Publisher Site | Google Scholar
D. Hwang and G. L. Zeng, “Convergence study of an accelerated ML-EM algorithm using bigger step size,” Physics in Medicine and Biology, vol. 51, no. 2, pp. 237–252, 2006.
View at: Publisher Site | Google Scholar
H.-M. Huang and I.-T. Hsiao, “Accelerating an ordered-subset low-dose X-ray cone beam computed tomography image reconstruction with a power factor and total variation minimization,” PLoS ONE, vol. 11, no. 4, Article ID e0153421, 2016.
View at: Publisher Site | Google Scholar
J. Schropp, “Using dynamical systems methods to solve minimization problems,” Applied Numerical Mathematics. An IMACS Journal, vol. 18, no. 1-3, pp. 321–335, 1995.
View at: Publisher Site | Google Scholar | MathSciNet
R. G. Airapetyan, A. G. Ramm, and A. B. Smirnova, “Continuous analog of the Gauss-Newton method,” Mathematical Models and Methods in Applied Sciences, vol. 9, no. 3, pp. 463–474, 1999.
View at: Publisher Site | Google Scholar | MathSciNet
R. G. Airapetyan and A. G. Ramm, “Dynamical systems and discrete methods for solving nonlinear ill-posed problems,” in Applied mathematics reviews, G. A. Anastassiou, Ed., vol. 1, pp. 491–536, World Scientific Publishing Company, Singapore, 2000.
View at: Publisher Site | Google Scholar | MathSciNet
R. G. Airapetyan, A. G. Ramm, and A. B. Smirnova, “Continuous methods for solving nonlinear ill-posed problems,” in Operator theory and its applications, A. G. Ramm, Shivakumar P. N., and A. V. Strauss, Eds., vol. 25 of Fields Inst Comm, pp. 111–136, American Mathematical Society, Providence, RI, USA, 2000.
View at: Google Scholar | MathSciNet
A. G. Ramm, “Dynamical systems method for solving operator equations,” Communications in Nonlinear Science and Numerical Simulation, vol. 9, no. 4, pp. 383–402, 2004.
View at: Publisher Site | Google Scholar | MathSciNet
L. Li and B. Han, “A dynamical system method for solving nonlinear ill-posed problems,” Applied Mathematics and Computation, vol. 197, no. 1, pp. 399–406, 2008.
View at: Publisher Site | Google Scholar | MathSciNet
K. Fujimoto, O. Abou Al-Ola, and T. Yoshinaga, “Continuous-time image reconstruction using differential equations for computed tomography,” Communications in Nonlinear Science and Numerical Simulation, vol. 15, no. 6, pp. 1648–1654, 2010.
View at: Publisher Site | Google Scholar
O. M. Abou Al-Ola, K. Fujimoto, and T. Yoshinaga, “Common Lyapunov function based on Kullback-Leibler divergence for a switched nonlinear system,” Mathematical Problems in Engineering, Article ID 723509, Art. ID 723509, 12 pages, 2011.
View at: Publisher Site | Google Scholar | MathSciNet
K. Tateishi, Y. Yamaguchi, O. M. Abou Al-Ola, T. Kojima, and T. Yoshinaga, “Continuous analog of multiplicative algebraic reconstruction technique for computed tomography,” in Proceedings of the SPIE, Medical Imaging 2016: Physics of Medical Imaging, USA, March 2016.
View at: Publisher Site | Google Scholar
A. M. Lyapunov, Stability of Motion, Academic Press, New York, NY, USA, 1966.
View at: MathSciNet
S. Kullback and R. A. Leibler, “On information and sufficiency,” Annals of Mathematical Statistics, vol. 22, pp. 79–86, 1951.
View at: Publisher Site | Google Scholar | MathSciNet
I. Csiszár, “Why least squares and maximum entropy? an axiomatic approach to inference for linear inverse problems,” The Annals of Statistics, vol. 19, no. 4, pp. 2032–2066, 1991.
View at: Publisher Site | Google Scholar | MathSciNet
Z. Sun and S. S. Ge, Stability Theory of Switched Dynamical Systems, Communications and Control Engineering Series, Springer-Verlag, London, 2011.
View at: Publisher Site | MathSciNet
R. Goebel, R. G. Sanfelice, and A. R. Teel, Hybrid Dynamical Systems: Modeling, Stability, and Robustness, Princeton University Press, 2012.
View at: MathSciNet
M. Rybaczuk, A. Kȩdzia, and W. Zielinski, “The concept of physical and fractal dimension II, The differential calculus in dimensional spaces,” Chaos, Solitons & Fractals, vol. 12, no. 13, pp. 2537–2552, 2001.
View at: Publisher Site | Google Scholar | MathSciNet
D. Aniszewska, “Multiplicative Runge-Kutta methods,” Nonlinear Dynamics, vol. 50, no. 1-2, pp. 265–272, 2007.
View at: Publisher Site | Google Scholar
I. Castiglioni, I. Buvat, G. Rizzo, M. C. Gilardi, J. Feuardent, and F. Fazio, “A publicly accessible Monte Carlo database for validation purposes in emission tomography,” European Journal of Nuclear Medicine and Molecular Imaging, vol. 32, no. 10, pp. 1234–1239, 2005.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2017 Kiyoko Tateishi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1725

Downloads

695

Citations