About this Journal Submit a Manuscript Table of Contents
International Journal of Antennas and Propagation
Volume 2014 (2014), Article ID 495057, 8 pages
http://dx.doi.org/10.1155/2014/495057
Research Article

An Efficient Algorithm for EM Scattering from Anatomically Realistic Human Head Model Using Parallel CG-FFT Method

Center for Computational Science and Engineering, School of Mathematics and Statistics, Jiangsu Normal University, Xuzhou 221116, China

Received 1 December 2013; Revised 2 February 2014; Accepted 16 February 2014; Published 24 March 2014

Academic Editor: Gaobiao Xiao

Copyright © 2014 Lei Zhao and Gen Chen. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

An efficient algorithm is proposed to analyze the electromagnetic scattering problem from a high resolution head model with pixel data format. The algorithm is based on parallel technique and the conjugate gradient (CG) method combined with the fast Fourier transform (FFT). Using the parallel CG-FFT method, the proposed algorithm is very efficient and can solve very electrically large-scale problems which cannot be solved using the conventional CG-FFT method in a personal computer. The accuracy of the proposed algorithm is verified by comparing numerical results with analytical Mie-series solutions for dielectric spheres. Numerical experiments have demonstrated that the proposed method has good performance on parallel efficiency.

1. Introduction

In recent years, there has been an increasing effort to achieve an efficient numerical analysis of large-scale electromagnetic problems which usually require much computational time and large computer memory. An efficient numerical method for large and complex bodies is very important for many practical applications. The method of moments (MoM) [1] has become one of the most popular methods to compute the scattering problems in a variety of applications [26]. However, MoM requires memory usage and computational load to solve the matrix equation using the LU decomposition or Gaussian elimination, where is the number of unknowns. To reduce the computational time, CG-FFT is employed to solve the MoM matrix equation, which is one of the most efficient ways to solve the volume integral equation for dielectric targets and reduces the computational complexity to in each iteration [710]. For the most practical EM problems, a regular computer cannot be sufficient for its limited available memory and performance. New developments of parallel-processing techniques and high-performance computer (HPC) system give the chance of solving large problems that were unattainable in the past. To reach this point, it becomes more and more important that the development and parallelization of fast algorithms with highly parallel performance be able to benefit from large amounts of computational memory and parallel processors of HPC system [1113].

In the past few decades, the energy absorption in human head exposed to radio-frequency (RF) electromagnetic radiation has brought about an increased concern for the possible consequences of electromagnetic radiation on human health. Many studies have been performed for calculating the power absorbed in a human body exposed to the electromagnetic (EM) field emitted by radio-communication equipment [1417]. In this paper, the EM scattering problem from a high-resolution 3D anatomically realistic model of the human head was considered. The volume integral equations are applied to describe the problem. MoM is then used to discretize the coupled integral equations, and a CG-FFT algorithm has been proposed to solve the resulting discrete linear system. And the parallelization techniques were applied to speed up the FFT calculation, vector-vector product, and matrix-vector product during the process of CG iteration. The paper presents a deep review of the proposed parallel implementation of CG-FFT algorithm with pulse base function. Different stages of the parallel algorithm were described, and its overall parallel performance was analyzed carefully. With this implementation, we have done a benchmark model test with more than 400 million unknowns and solved a practical EM scattering problem with more than 40 million unknowns using a HPC system which includes 27 nodes. Each node of the cluster has two Intel Xeon E5520 CPU and 12 GB memory and they are connected by 10 Gbps Ethernet high speed network. We have verified the accuracy and efficiency of the algorithm by comparing the numerical results with analytical results for dielectric spheres. Numerical results show that the proposed method has good parallel performance.

2. Theory and Methods

Consider a 3D dielectric object of arbitrary shape that is in homogeneous space which is characterized by relative permittivity ; we set the homogeneous space is free space . The arbitrarily shaped dielectric object with complex permittivity is inscribed by a cuboid . The time dependence of   is assumed and suppressed. Under the illumination of the incident electric field, the total electric field inside the dielectric object can be determined through the following volume integral equation: where is the dyadic Green’s function in homogenous space, in which the corresponding elements are given by

The equivalent version for the induced current can be approximately obtained by where , and are the normalized electric current inside the dielectric object and the equivalent incident current, respectively.

A box with the size of is used to bound the considered dielectric target and is discretized into cuboidal cells. Then the volume of each cell is , where and is the division number in the -direction. Choosing pulse function as the basis and testing function, we obtain the discrete forms of (4) as in which

We remark that the above formulations (6)-(7) actually imply the scattering by small particles with the size of because of the use of pulse basis functions although the dielectric targets may be continuous. We can convert (6) into a linear system of equations where is an system matrix, is a column vector with the coefficients of the unknown currents, and is a column vector associated with the incident fields in the dielectric object. Here is the total number of unknowns. However, the inner products in (6) are all 3D summations of the products of discrete Green’s functions and discrete electric currents, which are quite time and memory consuming. For electric-large electromagnetic problems, is very large and it is very difficult to solve (8) directly. In order to calculate fast the products of Green’s functions and electric currents, the discrete Green’s functions are extended in a larger computational domain as where ,

The signs of the expanded discrete Green’s functions are directly related to the even and odd nature of the components with respect to the coordinates in different extended subdomains. After defining the extended Green’s functions, the equivalent electric current can be defined in the extended domain by zero padding as

Using the convolution theorem and FFT method, we can obtain the discrete form of the integral equation (6) with FFT method [18, 19]: where are the discrete Fourier transform (DFT) of , respectively. Similarly, the corresponding adjoint operations can also be performed using FFT. As a consequence, we can solve (12) rapidly through the CG-FFT algorithm [18]. In order to speed up the FFT calculation, the parallel FFT is used to obtain the FFT and inverse FFT results. In the proposed algorithm, both the FFT transform and the inverse FFT transform are implemented using the FFTW library, which is a subroutine library for computing the discrete Fourier transform in one or more dimensions and supports the distributed-memory implementation based on message passing interface (MPI). For example, to calculate the vector-vector product in the CG-FFT method, which can be parallelized by call MPI_Allreduce() as shown in Algorithm 1.

alg1
Algorithm 1

3. Numerical Results

To illustrate the accuracy and efficiency of the proposed parallel CG-FFT algorithm, we first consider the EM scattering by a dielectric sphere illuminated by plane waves, which has a closed-form solution. In the following examples, the background is just free space. The dielectric sphere with  m is illuminated by a plane wave. The incident wave is polarized in the direction and propagating in the direction, in which the operating frequency is 0.3 GHz. The comparison of numerical results of the internal electric fields between parallel CG-FFT and analytical results is illustrated in Figure 1, which shows that the numerical results have good agreement with the analytical results. We have also computed the scattered electric fields from the dielectric object on the observation plane  m and compared such results with the exact solutions as shown in Figure 2.

fig1
Figure 1: Electric field distribution on the center line of plane .
fig2
Figure 2: Electric fields on the plane  m. (a) Parallel CG-FFT results. (b) Analytical results.

Then, we do the parallel performance testing on a HPC which has 27 nodes shown in Table 1, in which nodes are connected by 10 Gbps Ethernet. The benchmark model is a homogenous cubic dielectric object with , and the edge of cubic is 0.4 m. The incident wave is the same plane wave as that in Figure 1. We compare the network latency inside node and internode, which means that we test the network latency on one node and between two nodes, respectively. Figure 3 shows the testing results for internode and inside node. From Figure 3, we can see that the speed of network inside node is about 4 times of the inter node, which will be a bottleneck for the parallel CG-FFT method. To evaluate the performance of the parallel CG-FFT code, we define the performance as follows:

tab1
Table 1: The HPC hardware information.
495057.fig.003
Figure 3: Network latency of nodes.

The parallel CG-FFT methods performance testing result is demonstrated in Figure 4, and the detail data is listed in Table 2. From Figure 4, we can obtain that the performance goes up when no more than 8 nodes are used, and the performance goes down using 10 nodes. The reason is that the network latency plays an important role when we use more than 8 nodes. The parallel efficiency is also tested, which is defined as where is the number of processes, is the running time used by one process, and is running time used by processes. Figure 5 shows the parallel efficiency of parallel CG-FFT with different discretization and processes, and the detail results are listed in Table 3. From Figure 5 and Table 3, we can see that the parallel efficiency is above 60% when no more than 8 nodes are used.

tab2
Table 2: Performance test.
tab3
Table 3: Parallel efficiency.
495057.fig.004
Figure 4: The performance of parallel CG-FFT.
495057.fig.005
Figure 5: Parallel efficiency of different case.

Finally, we use the proposed parallel CG-FFT method to simulate EM scattering problem from 3D anatomically realistic human head model exposed to the plane wave working at 900 Mhz. The popular HUGO model [20] with a resolution of   mm, as shown in Figure 6, includes 16 different tissues and organs. The electromagnetic properties (  and  ) of 16 tissues in the model can be obtained from FCC published data [21], as listed in Table 4. In our simulation, 4 nodes are used and the computation time is about 65 minutes. Figure 7 shows the electric field on head surface. With the object oriented HUGO model, the field distribution over a specific object can be investigated. The electric field distributions on eyes, bone, and brain are demonstrated in Figures 8, 9, and 10, respectively.

tab4
Table 4: Tissue parameters for HUGO model.
495057.fig.006
Figure 6: A cut plane of the HUGO human head model.
495057.fig.007
Figure 7: The total electric field distribution on head surface.
495057.fig.008
Figure 8: The total electric field distribution on eyes.
495057.fig.009
Figure 9: The total electric field distribution on bone.
495057.fig.0010
Figure 10: The total electric field distribution on brain.

4. Conclusion

In this paper, we have analyzed the performance of an efficient MPI parallel implementation of the CG-FFT algorithm on HPC computers. In the proposed method, the codes can run not only on share memory systems machine but also on distributed ones, which present high scalability behavior. Special attention was paid to communications during the matrix-vector product and vector-vector product, which are a key point for the parallel performance. We solved a problem with more than 400 million unknowns on a HPC including 27 nodes.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported in part by the National Science Foundation of China under Grant no. 61372057, in part by Natural Science Foundation of the Jiangsu Higher Education Institutions under Grant no. 10KJD180004, and in part by Postgraduate Innovation Project of Jiangsu Province under Grant no. CXZZ13_0973.

References

  1. R. F. Harrington, Field Computation by Moment Methods, MacMillan, New York, NY, USA, 1968.
  2. P. M. Goggans, A. A. Kishk, and A. W. Glisson, “Electromagnetic scattering from objects composed of multiple homogeneous regions using a region-by-region solution,” IEEE Transactions on Antennas and Propagation, vol. 42, no. 6, pp. 865–871, 1994. View at Publisher · View at Google Scholar · View at Scopus
  3. R. D. Graglia, P. L. E. Uslenghi, and R. S. Zich, “Moment method with isoparametric elements for three-dimensional anisotropic scatterers,” Proceedings of the IEEE, vol. 77, no. 5, pp. 750–760, 1989. View at Publisher · View at Google Scholar · View at Scopus
  4. J. M. Jarem, “Method-of-moments solution of a parallel-plate waveguide aperture system,” Journal of Applied Physics, vol. 59, no. 10, pp. 3566–3570, 1986. View at Publisher · View at Google Scholar · View at Scopus
  5. D. E. Livesay and K. Chen, “Electromagnetic field induced inside arbitrarily shaped biological bodies,” IEEE Transactions on Microwave Theory and Techniques, vol. 22, no. 12, pp. 1273–1280, 1974. View at Publisher · View at Google Scholar · View at Scopus
  6. T. K. Sarkar and E. Arvas, “An integral equation approach to the analysis of finite microstrip antennas: volume/surface formulation,” IEEE Transactions on Antennas and Propagation, vol. 38, no. 3, pp. 305–312, 1990. View at Publisher · View at Google Scholar · View at Scopus
  7. H. Gan and W. C. Chew, “A discrete BCG-FFT algorithm for solving 3D inhomogeneous scatterer problems,” Journal of Electromagnetic Waves and Applications, vol. 9, no. 10, pp. 1339–1357, 1995. View at Scopus
  8. T. J. Cui, “Fast algorithm for electromagnetic scattering by buried 3-D dielectric objects of large size,” IEEE Transactions on Geoscience and Remote Sensing, vol. 37, no. 5, pp. 2597–2608, 1999. View at Publisher · View at Google Scholar · View at Scopus
  9. L. Zhao and T. J. Cui, “CG-FFT algorithm for EM scattering by small dielectric particles with high permittivity and permeability,” Microwave and Optical Technology Letters, vol. 49, no. 2, pp. 305–310, 2007. View at Publisher · View at Google Scholar · View at Scopus
  10. L. Zhao, T. J. Cui, and W. D. Li, “An efficient algorithm for em scattering by electrically large dielectric objects using MR-QEB iterative scheme and CG-FFT method,” Progress in Electromagnetics Research, vol. 67, pp. 341–355, 2007. View at Publisher · View at Google Scholar · View at Scopus
  11. W. Yu, R. Mittra, T. Su, Y. Liu, and X. Yang, Parallel Finite Difference Time Domain Method, Artech House, Norwood, Mass, USA, 2006.
  12. W. Yu, X. Yang, Y. Liu et al., “New development of parallel conformal FDTD method in computational electromagnetics engineering,” IEEE Antennas and Propagation Magazine, vol. 53, no. 3, pp. 15–41, 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. J. M. Taboada, M. G. Araujo, F. O. Basteiro, J. L. Rodriguez, and L. Landesa, “MLFMA-FFT parallel algorithm for the solution of extremely large problems in electromagnetic,” Proceedings of the IEEE, vol. 101, no. 2, pp. 350–363, 2013. View at Publisher · View at Google Scholar · View at Scopus
  14. O. P. Gandhi, G. Lazzi, and C. M. Furse, “Electromagnetic absorption in the human head and neck for mobile telephones at 835 and 1900 MHz,” IEEE Transactions on Microwave Theory and Techniques, vol. 44, no. 10, pp. 1884–1897, 1996. View at Publisher · View at Google Scholar · View at Scopus
  15. G. Lazzi and O. P. Gandhi, “Realistically tilted and truncated anatomically based models of the human head for dosimetry of mobile telephones,” IEEE Transactions on Electromagnetic Compatibility, vol. 39, no. 1, pp. 55–61, 1997. View at Publisher · View at Google Scholar · View at Scopus
  16. A. K. Lee, H. D. Choi, and J. I. Choi, “Study on SARs in head models with different shapes by age using SAM model for mobile phone exposure at 835 MHz,” IEEE Transactions on Electromagnetic Compatibility, vol. 49, no. 2, pp. 302–312, 2007. View at Publisher · View at Google Scholar · View at Scopus
  17. Q.-X. Li and O. P. Gandhi, “Thermal implications of the new relaxed IEEE RF safety standard for head exposures to cellular telephones at 835 and 1900 MHz,” IEEE Transactions on Microwave Theory and Techniques, vol. 54, no. 7, pp. 3146–3154, 2006. View at Publisher · View at Google Scholar · View at Scopus
  18. T. J. Cui and W. C. Chew, “Fast algorithm for electromagnetic scattering by buried 3-D dielectric objects of large size,” IEEE Transactions on Geoscience and Remote Sensing, vol. 37, no. 5, pp. 2597–2608, 1999. View at Publisher · View at Google Scholar · View at Scopus
  19. J. Weaver, Applications of Discrete and Continuous Fourier Analysis, John Wiley & Sons, New York, NY, USA, 1983.
  20. P. Bernardi, M. Cavagnaro, S. Pisa, and E. Piuzzi, “Specific absorption rate and temperature increases in the head of a cellular-phone user,” IEEE Transactions on Microwave Theory and Techniques, vol. 48, no. 7, pp. 1118–1126, 2000. View at Publisher · View at Google Scholar · View at Scopus
  21. http://www.fcc.gov/fcc-bin/dielec.sh.