Image Recovery Algorithm Based on Learned Dictionary

Zhu, Xinghui; Kui, Fang

doi:https://doi.org/10.1155/2014/964835

Mathematical Problems in Engineering

On this page

Abstract Introduction Analysis Conclusions References Copyright Related Articles

Research Article | Open Access

Volume 2014 | Article ID 964835 | https://doi.org/10.1155/2014/964835

Image Recovery Algorithm Based on Learned Dictionary

Xinghui Zhu¹and Fang Kui¹

Academic Editor: Binxiang Dai

Received28 May 2014

Accepted25 Jul 2014

Published12 Aug 2014

Abstract

We proposed a recovery scheme for image deblurring. The scheme is under the framework of sparse representation and it has three main contributions. Firstly, considering the sparse property of natural image, the nonlocal overcompleted dictionaries are learned for image patches in our scheme. And, then, we coded the patches in each nonlocal clustering with the corresponding learned dictionary to recover the whole latent image. In addition, for some practical applications, we also proposed a method to evaluate the blur kernel to make the algorithm usable in blind image recovery. The experimental results demonstrated that the proposed scheme is competitive with some current state-of-the-art methods.

1. Introduction

Image recovery has been a widely studied issue in the past decades, which remains an active area in low-level image processing [1, 2]. Many factors may cause blur, such as the imperfection of devices, atmospheric turbulence (for remote sensing image), and motion between the camera and scene. By assuming the blur model is linear and space invariant, the observation can be expressed as where and denote the blurry image and the latent image, respectively, is the blur kernel, and and represent the convolution operation and the additive white Gaussian noise. The mission of deblurring is to obtain the latent image from the degradation observation , which is usually solved by the regularization method due to its ill-posed property. The key idea of the regularization is to penalize the prior knowledge of latent image. In the past decades, many kinds of prior knowledge have been studied in literatures [3–5]. In recent years, the sparse regularization based modeling is employed to solve the recovery problems, which is proven to be more effective than the conventional regularization. Jalobeanu et al. proposed to deblur the satellite image with complex wavelet packet, which established a Gaussian model for the sparse coefficients [6]. Cho et al. [7] proposed a regression model by exploiting the sparsity of natural image to mitigate the ringing artifacts in the deblurred image, which can be seen as a natural generalization work of [8]. Based on the dictionary learning, Foi et al. proposed a shape adaptive DCT for high-quality image deblurring [9]. Recently, incorporating with the sparse representation, the NLM (nonlocal means) method is exploited in many image processing tasks successfully, such that [10, 11] all achieve the impressive recovery results based on this model. In this paper, inspired by the sparse representation and NLM techniques, we proposed a novel scheme for image deblurring, which will train many subdictionaries to better present the patches with different features and recover the whole latent image based on sparse model.

The outline of the remainder of the paper is as follows. In Section 2, we review the regularization framework for image deblurring and propose a novel optimization based on sparse representation. In Section 3, we show an iterative method to solve the optimization in Section 2 with the alternating direction method (ADM), and, additionally, a dictionary learning method with NLM is developed to better present the image. The numerical experiments are shown in Section 4 and we concluded the paper in Section 5.

2. Deblurring Framework with Sparse Model

To begin, following the conventional notation in many literatures, we denote the latent image by and degradation image by ; we show the blurry model as follows: where indicates the convolution operator and and are the blurring kernel and additive white Gaussian noise, respectively.

Based on the regularization, we can formulate the optimization to obtain the solution for (2) as follows: where denotes the Frobenius norm and the presents regularization for latent image , such as the TV norm and smoothness prior.

To solve the optimization in (3), we still need to know the blur kernel. Nevertheless, in practice, it is unknown for blind image deblurring. A popular estimation method is developed in [12]; in their work, the authors proposed to estimate the blur kernel as follows:

In our scheme, we incorporate the model in (3) and (4) into a novel unified variational framework as follows: where and denote the regularization penalty term of desired latent sharp image and blur kernel, respectively, which can stabilize the solution. and are termed regularization parameters.

Furthermore, assuming the image patch of size at location by and the dictionary by , we can propose a novel framework based on the sparse model in the following form: where is a matrix to extract the patch from the latent image at location . The first term in (6) is called fidelity constraint, which guarantees that the latent image is consistent with the degradation observation. The second term constrains the sparsity of patches in image with the reasonable dictionary . The third term is the -norm constraint on blur kernel to preserve the solution stability.

As for the optimization in (6), the recent alternating direction method (ADM) [13] can be employed to obtain the solution, which is the popular method used in multivariables optimization. In addition, the dictionary is also a valuable issue to be studied, because the more reasonable the dictionary is, the more effective the restoration result is. So, in the next section, we will present an iterative algorithm based on ADM to solve the minimization in (6).

3. Iterative Algorithm for Image Deblurring

3.1. Local Dictionary Learning

In recent years, there has been a growing interest in studying the sparse model and its application to image processing. And a key technique for spare model is to design a dictionary to better fit the above model, which is generated in two main methods. The first one is combining some predefined bases to generate a comprehensive dictionary, such as wavelet and DCT. Nevertheless, by this method, the generated dictionary is not data independent. In other words, the dictionary can be termed “fixed.” The other one is to learn a dictionary adaptive to a set of training examples from degradation image itself or some sample images. In our framework, we intend to adopt the second method, which will make the dictionary more adaptive and flexible. The recent method termed KSVD is introduced by Aharon et al. [14]. In their work, the author tended to learn a global dictionary and best presented the patches from the image. However, the training tends to generate a group of atoms that can present each patch in image, which leads to loss of some ability to present some local structure.

Note that the patches with similar structure in image may be presented by the same dictionary ideally. Motivated by the NLM technique, our learning scheme is proposed to train many local dictionaries for similar patches in the same cluster. For low complexity, we can measure the distance between two patches as follows: where and denote the patch at locations and and is the scale parameter to control the discrimination. And, then, we can take the distance in (7) as the measurement and cluster the patches by -Mean algorithm.

Next, given the train examples set corresponding to clustering ; we can learn the local dictionary adaptive to this clustering by the following minimization: where denotes the coding matrix and is its th column corresponding to patch .

3.2. Deblurring Algorithm

In this section, we will propose an iterative algorithm to solve the novel framework in (6). By the ADM method, we can solve the minimization alternately and divide the optimization into three major parts.

Firstly, by fixing all other variables without , (6) can be reduced to the following form:

The minimization (9) is a strictly convex problem, which leads to a closed-form solution as follows: where and denote the fast Fourier transform (FFT) and its inverse. denotes the complex conjugate operator, the “” presents the pointwise product, and is a unit matrix.

Secondly, by fixing the code set and the estimation blur kernel , we can generate the second major part as follows: where denotes the local learned dictionary for the patches in clustering and footnote presents the variable corresponding to clustering .

Given the sparse coding set and the blur kernel, (11) can be solved with the fast deconvolution method [15] as follows:

At last, we coded the patches from with the local dictionary to update the sparse coding set for the next iteration by the following minimization:

The comprehensive deblurring scheme is summarized in Algorithm 1.

Input: The patches set from the degradation image .
Initialization: , , the coarse estimation latent image
by the method in [11];
For :
Learn the local dictionary by (8) with train set from ;
Estimate the new kernel by (10) with ;
For : ( means the maximum clustering number)
Compute the latent image by (12);
Compute the coding set of by solving (13)
via the method in [12];
end
Update the ;
end
Output the comprehensive latent image .

In Algorithm 1, the coding by (13) can be implemented by some recent sparse coding algorithms, such as “Basis Pursuit” and “Orthogonal Matching Pursuit”. With Algorithm 1, the proposed scheme solves our novel framework with the unified variational formulation, which shows some advantage in practice.

4. Numerical Results and Analysis

We conduct some numerical experiments to demonstrate the performance of our proposed scheme. The compared method includes the BM3D proposed in [16], SA-DCT method in [9], FISTA in [17], and our scheme. The sizes of test images are all 256 × 256 and the pixel value ranges to . The Gaussian blur kernel with the standard deviation 1.5 was used in our experiments and the additive white noise with different standard deviation was also adopted. The maximum iteration number . For low computing complexity, the clustering number is set to be 5 empirically. The zoomed local visual comparisons are shown in Figures 1 and 2, and we only show the deblurring results of Parrots and Cameraman with due to the limited space. Also, the PSNR for test images is reported in Tables 1 and 2 as the objective evaluation.

From the visual comparison results, we can see that our novel scheme outperforms other methods in overcoming the staircase effect and also show some advance in recovering the texture and shaper edge. Also, the PSNR results also demonstrate that our method is competitive with the compared method and presents robustness with higher noise standard deviation. Though the BM3D obtains the highest PSNR, its visual results still leave some staircase effect in the part of smooth areas. The PSNR of our scheme is only a little lower than BM3D, but the visualization seems better, such as the texture, which means that our method shows synthesis advantage compared to BM3D. In addition, in the experiments, we found that the PSNR decreased with the increasing iterations, which can guarantee the convergence of our scheme empirically.

5. Conclusions

In this paper, we addressed the image deblurring problem. We proposed a scheme based on the sparse representation. To better present the image, we learned many dictionaries via the patches with nonlocal self-similarity. And, then, we incorporated the kernel estimation and sparse model into a novel unified framework. Additionally, an iterative algorithm was provided to solve the novel framework with the ADM. The experimental results demonstrate that our proposed scheme outperforms some leading methods. For further work, many possible extension researches can be explored, such as extending the framework to video deblurring and considering some other kinds of noise.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This project was supported in part by the National Key Technology R&D Program of China (2013BAD15B02 and 2012BAD35B07).

References

F. Li, C. M. Shen, J. S. Fan, and C. L. Shen, “Image restoration combining a total variational filter and a fourth-order filter,” Journal of Visual Communication and Image Representation, vol. 18, no. 4, pp. 322–330, 2007.
View at: Publisher Site | Google Scholar
J. Yang, Y. Zhang, and W. Yin, “An efficient {TVL}1 algorithm for deblurring multichannel images corrupted by impulsive noise,” SIAM Journal on Scientific Computing, vol. 31, no. 4, pp. 2842–2865, 2009.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
A. M. Bruckstein, D. L. Donoho, and M. Elad, “From sparse solutions of systems of equations to sparse modeling of signals and images,” SIAM Review, vol. 51, no. 1, pp. 34–81, 2009.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
X. Guo, F. Li, and M. K. Ng, “A fast $ℓ 1$ -TV algorithm for image restoration,” SIAM Journal on Scientific Computing, vol. 31, no. 3, pp. 2322–2341, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
S. Dai, M. Han, W. Xu, Y. Wu, Y. Gong, and A. K. Katsaggelos, “SoftCuts: a soft edge smoothness prior for color image super-resolution,” IEEE Transactions on Image Processing, vol. 18, no. 5, pp. 969–981, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
A. Jalobeanu, L. Blanc-Féraud, and J. Zerubia, “Hyperparameter estimation for satellite image restoration using a MCMC maximum-likelihood method,” Pattern Recognition, vol. 35, no. 2, pp. 341–352, 2002.
View at: Publisher Site | Google Scholar | Zentralblatt MATH
T. S. Cho, N. Joshi, C. L. Zitnick, S. B. Kang, R. Szeliski, and W. T. Freeman, “A content-aware image prior,” in Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '10), pp. 169–176, June 2010.
View at: Publisher Site | Google Scholar
A. Levin, R. Fergus, F. Durand et al., “Deconvolution using natural image priors,” ACM Transactions on Graphics, vol. 26, no. 3, 2 pages, 2007.
View at: Google Scholar
A. Foi, V. Katkovnik, and K. Egiazarian, “Pointwise shape-adaptive DCT for high-quality denoising and deblocking of grayscale and color images,” IEEE Transactions on Image Processing, vol. 16, no. 5, pp. 1395–1411, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
K. Dabov, A. Foi, V. Katkovnik, and K. Egiazarian, “Image denoising by sparse 3-D transform-domain collaborative filtering,” IEEE Transactions on Image Processing, vol. 16, no. 8, pp. 2080–2095, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
W. Dong, L. Zhang, and G. Shi, “Centralized sparse representation for image restoration,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV '11), pp. 1259–1266, Barcelona, Spain, November 2011.
View at: Publisher Site | Google Scholar
S. Cho and S. Lee, “Fast motion deblurring,” ACM Transactions on Graphics, vol. 28, no. 5, pp. 145–145, 2009.
View at: Publisher Site | Google Scholar
E. Esser, “Applications of lagrangian-based alternating direction methods and connections to split Bregman,” CAM Rep 09-31, UCLA, 2009.
View at: Google Scholar
M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311–4322, 2006.
View at: Publisher Site | Google Scholar
Q. Shan, J. Jia, and A. Agarwala, “High-quality motion deblurring from a single image,” ACM Transactions on Graphics, vol. 27, no. 3, p. 73, 2008.
View at: Publisher Site | Google Scholar
K. Dabov, A. Foi, V. Katkovnik, and K. Egiazarian, “Image restoration by sparse 3D transform-domain collaborative filtering,” in Image Processing: Algorithms and Systems VI, vol. 6812 of Proceedings of SPIE, 2008.
View at: Google Scholar
A. Beck and M. Teboulle, “Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems,” IEEE Transactions on Image Processing, vol. 18, no. 11, pp. 2419–2434, 2009.
View at: Publisher Site | Google Scholar | MathSciNet

Copyright

Copyright © 2014 Xinghui Zhu and Fang Kui. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

842

Downloads

889

Citations