Medical Image Fusion Based on Fast Finite Shearlet Transform and Sparse Representation

Tan, Ling; Yu, Xin

doi:https://doi.org/10.1155/2019/3503267

Computational and Mathematical Methods in Medicine

On this page

Abstract Introduction Experimental Results and Analysis Conclusion Data Availability Conflicts of Interest References Copyright Related Articles

Research Article | Open Access

Volume 2019 | Article ID 3503267 | https://doi.org/10.1155/2019/3503267

Medical Image Fusion Based on Fast Finite Shearlet Transform and Sparse Representation

Ling Tan¹and Xin Yu¹

Academic Editor: Chuangyin Dang

Received11 Sept 2018

Revised21 Dec 2018

Accepted26 Jan 2019

Published03 Mar 2019

Abstract

Clinical diagnosis has high requirements for the visual effect of medical images. To obtain rich detail features and clear edges for fusion medical images, an image fusion algorithm FFST-SR-PCNN based on fast finite shearlet transform (FFST) and sparse representation is proposed, aiming at the problem of poor clarity of edge details that is conducive to maintaining the details of source image in current algorithms. Firstly, the source image is decomposed into low-frequency coefficients and high-frequency coefficients by FFST. Secondly, the K-SVD method is used to train the low-frequency coefficients to obtain the overcomplete dictionary D, and then the OMP algorithm sparsely encodes the low-frequency coefficients to complete the fusion of the low-frequency coefficients. Then, a high-frequency coefficient is applied to excite a pulse-coupled neural network, and the fusion coefficient of the high-frequency coefficient is selected according to the number of ignitions. Finally, the fused low-frequency coefficient and high-frequency coefficient are reconstructed into the fused medical image by FFST inverse transform. The experimental results show that the image fusion result of the proposed algorithm is about 35% higher than the comparison algorithms for the edge information transfer factor QAB/F index and has achieved good results in both subjective visual effects and objective evaluation indicators.

1. Introduction

With the development of imaging devices, different sensors can acquire different information from images of the same scenario [1–4]. In medicine, images of different modes are properly fused to make the source images complementary to each other and thus obtain more informative images [5, 6].

In recent years, the image fusion method based on multiscale geometric analysis has been widely used in the image processing due to its multiresolution characteristics [7]. The wavelet transform [8, 9] is the most typical multiscale analysis method, but it has only three (horizontal, vertical, and diagonal) directions when decomposing an image and thus cannot well represent a two-dimensional image with curve singularity or a high-dimensional function with surface singularity, and it is easy to produce pseudo-Gibbs phenomenon. To solve this problem, multiscale geometric analysis methods such as contourlet transform [10] and shearlet transform [11] have been proposed successively. They have good anisotropy and directional selectivity. Among them, the NSCT is the best one for the image fusion. NSCT has a translation invariance, and it can attenuate the Gibbs effect generated in various types of transformations in the past. But the amount of computational data is too large, the computational complexity is high, and the real-time performance is poor. Compared with NSCT, the shearlet transform [12] fusion algorithm has a more flexible structure, higher computational efficiency, and better fusion effect. However, it uses subsampled in the discretization process, thus it has no translation invariance and is easy to produce pseudo-Gibbs phenomenon near singular points during the image fusion. By cascading the non-subsampling pyramid filter and the shear filter, fast finite shearlet transform (FFST) [13] gets all the advantages of the shearlet transform, avoids the subsampled process, and obtains translation invariance. However, FFST exhibits a problem: the low-frequency coefficients it decomposed are not sparse. Sparse representation (SR) can express the deeper structural characteristics among low-frequency coefficients and make a perfect approximation for the linear combination of a small number of atoms in the dictionary [14]. To extract the fine contour information from the edge of images, highlight the edge features, and get more abundant information, this paper proposed the FFST-SR-PCNN, a medical image fusion algorithm based on the fast finite shearlet transform (FFST) and sparse representation (SR).

2. Medical Image Fusion Algorithm Based on FFST-SR-PCNN

The FFST-SR-PCNN first decomposed the registered source images into low-frequency and high-frequency coefficients by FFST, where was the scale of decomposition and was the number of directions of decomposition. Then, low-frequency coefficients were fused by the SR fusion algorithm, and high-frequency coefficients were fused by the fusion algorithm of the simplified PCNN model. Finally, the fused low-frequency and high-frequency coefficients were reconstructed by the inverse FFST to obtain the fused images. The process of FFST-SR-PCNN is illustrated in Figure 1.

2.1. Shearlet Transform

Set as the dilation matrix and as the shearlet matrix. They are defined aswhere , .

For , the shearlet function defined by the dilating, shear, and translation of is

For , its continuous shearlet transform and corresponding Parseval equation is

Specially, define wavelet function and impulse function , whose Fourier transform is and , respectively.

Set ; then, fulfills the permissibility. Choose different and ; the frequency domain is separated into different areas, including horizontal cone and vertical cone.

2.2. FFST

The shearlet transform generated shearlet functions with different features by scaling, shearing, and translating basis functions. Image decomposition based on the shearlet transform included the following: (1) decompose images into low-frequency and high-frequency subbands at different scales with Laplacian pyramid algorithm; (2) directionally subdivide subbands of different scales with the shear filter to realize multiscale and multidirectional decomposition and to make the size of the decomposed subband images consistent with the source images [15].

To obtain a discrete shearlet transformation, this algorithm discretized the scaling, shearing, and translating parameters in formula (2):where and represented the scale of decomposition; thus a discrete shearlet was obtained:

The expression of the frequency domain waswhere

To obtain the shearlets in the whole frequency domain, was defined at the intersection of the conical surfaces, and the sum of the shearlets was

Thus, the discrete shearlet can be expressed aswhere .

The shearlet defined by formula (9) can be realized by a two-dimensional fast Fourier transform algorithm with high computational efficiency. Since FFST has no subsampled process, it owns translation invariance. FFST also has excellent localization characteristics and high directional sensitivity.

2.3. Sparse Representation

The basic idea of sparse representation is to represent or approximately represent any signal by the linear combination of a small number of nonzero atoms in a given dictionary [16]. If a signal can be represented or approximated by the linear combination of a small number of atoms in , then the mathematical model of sparse representation [14] can be obtained by the following formula:where dictionary is an overcomplete set; is the coefficient of the sparse representation of signal ; is the norm of ; and is the margin of approximation error.

In FFST-SR-PCNN, first, the K singular value decomposition (K-SVD) method was used to train low-frequency coefficients and obtain the matrix of an overcomplete dictionary. Then, the orthogonal matching pursuit (OMP) optimization algorithm was used to approximate the original signal through the local optimal solution and estimate the coefficient of sparse representation [17]. Finally, the sparse coefficients were fused according to image features adaptively.

With the complete dictionary , the objective function equation of the K-SVD algorithm can be written as follows:where is the sparse representation of the maximum number of nonzero count in the coefficient, i.e., the maximum sparsity.

Formula (11) is an iterative process. First, suppose the dictionary is fixed, then use the orthogonal matching pursuit (OMP) algorithm to get the sparse matrix; next, fix the matrix and update the dictionary column by column, which means only the first atom in the dictionary is updated.

The fusion process of low-frequency coefficient based on sparse presentation is illustrated in Figure 2.

In Figure 2, and are low-frequency coefficients; is the size of the sliding window.

2.4. Pulse-Coupled Neural Network

Pulse-coupled neural network (PCNN) can combine the input high-frequency coefficients with human visual characteristics to obtain detailed information such as texture, edge, and contour [18]. The mathematical expression of the simplified model iswhere is the number of iterations; is the stimulation signal; and are the external input and the internal state, respectively; is the feedback input; is the link input; is the connection weight coefficient between neurons; , , and are the link strength, the variable threshold input, and the time constant of variable threshold attenuation, respectively; and and are amplification coefficients of the link input and the threshold.

High-frequency coefficient fusion used a pixel as the neuronal feedback input to stimulate the simplified PCNN model. SF waswhere the window size was ; and were

It got ignition maps through PCNN ignition and selected fusion coefficients according to the number of ignition times.

3. Implementation of FFST-SR-PCNN

3.1. Rules of Low-Frequency Coefficient Fusion

The process was implemented as follows: Step 1. Decompose the source images A and B with the registered size by FFST to obtain the low-frequency coefficient and the high-frequency coefficient. Step 2. Using a sliding window with a step size of one pixel and a size , the low-frequency coefficients and are subjected to block processing to obtain image subblocks, and the image subblocks are converted into column vectors to obtain a sample training matrix and . Step 3. Do iterative operation for sample matrix with K-SVD and obtain overcomplete dictionary matrix of low-frequency efficient. Step 4. Estimate the sparse coefficient of and with OMP algorithm and obtain sparse coefficient matrix and . The column sparse coefficient matrix will be fused as follows. Case 1. If the norm of is larger than norm of , then fuse with equation (15):

Case 2. If the norm of is smaller than norm of , then fuse with equation (16):

Case 3. If the norm of is equal to norm of , then fuse with equation (17):

where and are the column sparse coefficient matrix of and , respectively; is the column fused sparse coefficient matrix. Step 5. Multiply overcomplete dictionary matrix and fused sparse coefficient matrix . Fused sample training matrix is

Step 6. Turn the columns of into data subblocks, reconstruct data subblocks, and obtain low-frequency fusion coefficient.

3.2. Rules of High-Frequency Coefficient Fusion

The process was implemented as follows: Step 1. Calculate the neighborhood spatial frequency and of the high-frequency coefficients and according to equation (13) and use it as the link strength values of the neurons. Step 2. Initialization: . Now neurons are in off state, i.e., the resulting pulse is . Step 3. Compute , , and according to equation (12). Step 4. Compare the output threshold (ignition frequency) of firing time at the pixels of fire mapping image , ; the high-frequency fused coefficient is

4. Experimental Results and Analysis

In order to verify the effectiveness of FFST-SR-PCNN, five representative algorithms were selected as the controls for medical image fusion experiments. Five indicators including spatial frequency (SF), average gradient (AG), mutual information (MI), edge information transfer factor QAB/F (high-weight evaluation indicator) [19–22], and running time (RT) were used to make objective evaluation. Comparing algorithm 1 was a fusion algorithm proposed in [23] for images based on PCNN. Comparing algorithm 2 was an improved fusion algorithm proposed in [24] for medical images based on NSCT and adaptive PCNN. Comparing algorithm 3 was a fusion algorithm proposed in [25] for medical images based on SR and neural network. Comparing algorithm 4 was a fusion algorithm proposed in [26] for multimode medical images based on NSCT and Log-Gabor energy. Comparing algorithm 5 was a fusion algorithm proposed in [27] for medical images based on non-subsampled Shearlet transform and parameter adaptive pulse-coupled neural network.

4.1. Gray Image Fusion Experiment

In this experiment, six pairs of brain images in different states were selected for fusion. The first three pairs are CT/MR-T2 images and the last three pairs are MR-T1/MR-T2 images. The resulting images fused by different algorithms are shown in Figures 3–8, and their objective quality evaluation indicators are listed in Tables 1–6.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

According to Figures 3–8, comparing algorithm 1 gave poor performance compared to the source images in the presentation of detailed feature information and had horizontal and vertical blocking effects (Figures 3(c), 4(c), 5(c), 6(c), 7(c), and 8(c)). Comparing algorithm 2 gave poor performance compared to the source MR-T2 image in the presentation of detailed edge information and had blurry edge details (Figures 3(d), 4(d), 5(d), 6(d), 7(d), and 8(d)). Comparing algorithm 3 had low overall contrast and blurred edge details (Figures 3(e), 4(e), 5(e), 6(e), 7(e), and 8(e)). Comparing algorithm 4 had blurry edge details (Figures 3(f), 4(f), 5(f), 6(f), 7(f), and 8(f)). Comparing algorithm 5 had low contrast in the upper right corner (Figures 3(g), 4(g), 5(g), 6(g), 7(g), and 8(g)). FFST-SR-PCNN fully retained the feature information of the source images, without dark lines and low contrast (Figures 3(h), 4(h), 5(h), 6(h), 7(h), and 8(h)). From the evaluation indicators in Tables 1–6, FFST-SR-PCNN had better performance than the other five comparing algorithms on QAB/F by an average increase of 15.5%. FFST-SR-PCNN is not always the best one in each individual evaluation indicators, but it never ranked less than top three. It can be seen that the computational efficiency of FFST-SR-PCNN was lower than comparing algorithm 5 (average 34.8% lower), while higher than the other four methods (average 34.6%, 65%, 63.7%, and 48.5% higher, respectively). This is because the number of iterations of the comparison algorithm 5 is relatively small, but its other indicators were not as good as FFST-SR-PCNN. Totally, FFST-SR-PCNN had the best effect and can provide better fused medical images with relative lower computing cost.

4.2. Color Image Fusion Experiment

In this experiment, six pairs of brain images in different states were selected for fusion. The first three pairs are MR-T2/PET images and the last three pairs are MR-T2/SPECT images. The resulting images fused by different algorithms are shown in Figures 9–14, and their objective quality evaluation indicators are listed in Tables 7–12.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

According to Figures 9–14, comparing algorithm 1 gave poor performance compared to the source image in the presentation of detailed feature information and had widespread blocking effects (Figures 9(c), 10(c), 11(c), 12(c), 13(c), and 14(c)). Comparing algorithm 2 retained most feature information from the source images, but the fused image had low overall contrast (Figures 9(d), 10(d), 11(d), 12(d), 13(d), and 14(d)). Comparing algorithm 3 had blurred edge contours compared to the source image (Figures 9(e), 10(e), 11(e), 12(e), 13(e), and 14(e)). Comparing algorithm 4 retained most of the feature information from the source image, but the edge contours are blurred (Figures 9(f), 10(f), 11(f), 12(f), 13(f), and 14(f)). Comparing algorithm 5 has clearer details than the other four algorithms (method 1 to method 4), but its contrast is still somewhat low (Figures 9(g), 10(g), 11(g), 12(g), 13(g), and 14(g)). The FFST-SR-PCNN method fully retained the feature information from the source images, without low contrast and blocking effects (Figures 9(h), 10(h), 11(h), 12(h), 13(h), and 14(h)). From the evaluation indicators in Tables 7–12, FFST-SR-PCNN had better performance than the other five comparing algorithms on QAB/F by an average increase of 31.7%. FFST-SR-PCNN is not always the best one in each individual evaluation indicators, but it never ranked less than top two. It can be seen that the computational efficiency of the proposed method was lower than comparing algorithm 5 (average 17.7% lower), while higher than the other four methods (average 40.35%, 76.8%, 69.8%, and 64.4% higher, respectively). This is because the number of iterations of the comparison algorithm 5 is relatively small, but its other indicators were not as good as the proposed algorithm. Overall, FFST-SR-PCNN had the best effect and can provide better fused medical images with relative lower computing cost.

Taken above gray images and color images fusion results together, FFST-SR-PCNN can achieve better fusion performance in edge sharpness, change intensity, and contrast.

5. Conclusion

To promote the fusion performance of unimodal medical images, this thesis proposed a FFST-SR-PCNN algorithm based on FFST, sparse presentation, and pulse-coupled neural network. It has excellent detail delineation and can efficiently extract the feature information of images, thus enhanced the overall performance of the fusion results. The performance of FFST-SR-PCNN is evaluated by several experiments. In the comparing experiments with 5 comparison algorithms, all single-evaluation indexes of our algorithm are ranked in the top three; the comprehensive evaluation index of our algorithm has best result, and its QAB/F is higher than other 5 comparison algorithms. In terms of subjective manner, FFST-SR-PCNN can efficiently express the marginal information of images and make the details of fusion image clearer, with more smooth edges. Thus, it has better subjective visual effects.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

A. A. Goshtasby and S. Nikolov, “Image fusion: advances in the state of the art,” Information Fusion, vol. 8, no. 2, pp. 114–118, 2007.
View at: Publisher Site | Google Scholar
F. Shabanzade and H. Ghassemian, “Multimodal image fusion via sparse representation and clustering-based dictionary learning algorithm in nonsubsampled contourlet domain,” in Proceedings of 2016 8th International Symposium on Telecommunications (IST), pp. 472–477, Tehran, Iran, September 2016.
View at: Google Scholar
A. Mohammed, K. L. Nisha, and P. S. Sathidevi, “A novel medical image fusion scheme employing sparse representation and dual PCNN in the NSCT domain,” in Proceedings of 2016 IEEE Region 10 Conference (TENCON), pp. 2147–2151, Singapore, November 2016.
View at: Google Scholar
Y. Yang, Y. Que, S. Huang, and P. Lin, “Multimodal sensor medical image fusion based on type-2 fuzzy logic in NSCT domain,” IEEE Sensors Journal, vol. 16, no. 10, pp. 3735–3745, 2016.
View at: Publisher Site | Google Scholar
A. P. James and B. V. Dasarathy, “Medical image fusion: a survey of the state of the art,” Information Fusion, vol. 19, pp. 4–19, 2014.
View at: Publisher Site | Google Scholar
V. Bhateja, A. Moin, A. Srivastava, L. N. Bao, A. Lay-Ekuakille, and D.-N. Le, ““Multispectral medical image fusion in contourlet domain for computer based diagnosis of Alzheimer’s disease,” Review of Scientific Instruments, vol. 87, no. 7, pp. 1–4, 2016.
View at: Publisher Site | Google Scholar
R. P. Shingadiya and J. Rahul, “Review on multimodality medical image fusion,” International Journal of Engineering Sciences & Research Technology, vol. 4, no. 1, pp. 628–631, 2015.
View at: Google Scholar
I. Mehra and N. K. Nishchal, “Wavelet-based image fusion for securing multiple images through asymmetric keys,” Optics Communications, vol. 335, no. 4, pp. 153–160, 2015.
View at: Publisher Site | Google Scholar
M. Francis, A. A. Suraj, T. S. Kavya, and T. M. Nirmal, “Discrete wavelet transform based image fusion and denoising in FPGA,” Journal of Electrical Systems and Information Technology, vol. 1, no. 1, pp. 72–81, 2014.
View at: Publisher Site | Google Scholar
H. Huang, X. A. Feng, and J. Jiang, “Medical image fusion algorithm based on nonlinear approximation of contourlet transform and regional features,” Journal of Electrical and Computer Engineering, vol. 2017, Article ID 6807473, 9 pages, 2017.
View at: Publisher Site | Google Scholar
X. Liu, Y. Zhou, and J. Wang, “Image fusion based on shearlet transform and regional features,” AEU-International Journal of Electronics and Communications, vol. 68, no. 6, pp. 471–477, 2014.
View at: Publisher Site | Google Scholar
D. Labate, W. Q. Lim, G. Kutyniok, and G. Weiss, “Sparse multidimensional representation using shearlets,” in Proceedings of Optics and Photonics 2005 (Wavelets XI, SPIE), pp. 254–262, San Diego, CA, USA, September 2005.
View at: Google Scholar
S. Hauser and G. Ssteidl, “Fast finite shearlet transform,” 2012, http://arxiv.org/abs/1202.1773.
View at: Google Scholar
M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311–4322, 2006.
View at: Publisher Site | Google Scholar
K. Guo and D. Labate, “Optimally sparse multidimensional representation using shearlets,” SIAM Journal on Mathematical Analysis, vol. 39, no. 1, pp. 298–318, 2007.
View at: Publisher Site | Google Scholar
B. A. Olshausen and D. J. Field, “Emergence of simple-cell receptive field properties by learning a sparse code for natural images,” Nature, vol. 381, no. 6583, pp. 607–609, 1996.
View at: Publisher Site | Google Scholar
N. Ouyang, X.-Y. Zheng, and H. Yuan, “Multi-focus image fusion based on NSCT and sparse representation,” Computer Engineering and Design, vol. 38, no. 1, pp. 177–182, 2017.
View at: Google Scholar
S. Sneha, G. Deep, R. S. Anand, and V. Kumar, “Nonsubsampled shearlet based CT and MR medical image fusion using biologically inspired spiking neural network,” Biomedical Signal Processing and Control, vol. 18, pp. 91–101, 2015.
View at: Publisher Site | Google Scholar
C. S. Xydeas and V. Petrović, “Objective image fusion performance measure,” Electronics Letters, vol. 36, no. 4, pp. 308-309, 2000.
View at: Publisher Site | Google Scholar
G. Qu, D. Zhang, and P. Yan, “Information measure for performance of image fusion,” Electronics Letters, vol. 38, no. 7, pp. 313–315, 2002.
View at: Publisher Site | Google Scholar
G. Piella and H. Heijmans, “A new quality metric for image fusion,” in Proceedings of IEEE Conference Publications on Image Processing, pp. 173–176, Barcelona, Spain, September 2003.
View at: Google Scholar
Z. Liu, E. Blasch, Z. Xue, Z. Zhao, R. Laganiere, and W. Wu, “Objective assessment of multiresolution image fusion algorithms for context enhancement in night vision: a comparative study,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 1, pp. 94–109, 2012.
View at: Publisher Site | Google Scholar
H. Chen, J. Zhu, Y.-Y. Liu et al., “Image fusion based on pulse coupled neural network,” Optics and Precision Engineering, vol. 18, no. 4, pp. 995–1001, 2010.
View at: Google Scholar
J. Chen and D. Huang, “A medical image fusion improved algorithm based on NSCT and adaptive PCNN,” Journal of Changchun University of Science and Technology (Natural Science Edition), vol. 38, no. 3, pp. 152–159, 2015.
View at: Google Scholar
Y. Chen, J. Xia, Y. Chen et al., “Medical image fusion combining sparse representation and neural network,” Journal of Henen University of Science and Technology (Natural Science), vol. 39, no. 2, pp. 40–48, 2018.
View at: Google Scholar
Y. Yang, S. Tong, S. Huang, and P. Lin, “Log-Gabor energy based multimodal medical image fusion in NSCT domain,” Computational and Mathematical Methods in Medicine, vol. 2014, Article ID 835481, 12 pages, 2014.
View at: Publisher Site | Google Scholar
M. Yin, X. Liu, Y. Liu, and X. Chen, “Medical image fusion with parameter-adaptive pulse coupled neural network in nonsubsampled shearlet transform domain,” IEEE Transactions on Instrumentation and Measurement, vol. 68, no. 1, pp. 49–64, 2018.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2019 Ling Tan and Xin Yu. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2152

Downloads

1151

Citations