Research Article  Open Access
A Weighted TwoLevel Bregman Method with Dictionary Updating for Nonconvex MR Image Reconstruction
Abstract
Nonconvex optimization has shown that it needs substantially fewer measurements than l_{1} minimization for exact recovery under fixed transform/overcomplete dictionary. In this work, two efficient numerical algorithms which are unified by the method named weighted twolevel Bregman method with dictionary updating (WTBMDU) are proposed for solving l_{p} optimization under the dictionary learning model and subjecting the fidelity to the partial measurements. By incorporating the iteratively reweighted norm into the twolevel Bregman iteration method with dictionary updating scheme (TBMDU), the modified alternating direction method (ADM) solves the model of pursuing the approximated l_{p}norm penalty efficiently. Specifically, the algorithms converge after a relatively small number of iterations, under the formulation of iteratively reweighted l_{1} and l_{2} minimization. Experimental results on MR image simulations and real MR data, under a variety of sampling trajectories and acceleration factors, consistently demonstrate that the proposed method can efficiently reconstruct MR images from highly undersampled kspace data and presents advantages over the current stateoftheart reconstruction approaches, in terms of higher PSNR and lower HFEN values.
1. Introduction
Fast acquisition is an important issue in magnetic resonance imaging (MRI) for avoiding physiological effects and reducing scanning time on patients. Acquisition time is proportional to the number of acquired kspace samples [1]. Unfortunately, accelerating acquisition by reducing kspace samples leads to noise amplification, blurred object edges, and aliasing artifacts in MR reconstructions. Then improving reconstruction accuracy from highly undersampled kspace data becomes a complementary tool to alleviate the above side effects of reducing acquisition.
To cope with the loss of image quality, some prior information should be used in the reconstruction procedure. One process of introducing prior information in the reconstruction is known as “regularization” [2–11]. Tikhonov regularization, a commonly used method that pursuits reconstructions by a norm minimization, leads to a closedform solution that can be numerically implemented in an efficient way [6–8, 11]. Moreover, with the advent of compressed sensing (CS) theory, sparsitypromoting regularization has gained popularity in MRI (e.g., the based regularization) [1–3, 5, 10, 12]. The CS theory states that sparse, or more generally, compressible signals, incoherently acquired in an appropriate sense, can be recovered from a reduced set of measurements that are largely below the Nyquist sampling rate. Exact reconstruction can be achieved by nonlinear algorithms, using minimization or orthogonal matching pursuit (OMP) [1–3, 5, 10, 12, 13]. Usually, nonconvex optimization like () minimization will guarantee a better recovery by directly attacking the minimization problem [14–16]. Chartrand et al. investigated the nonconvex () norm optimization as a relaxation of the norm for MRI reconstruction [17, 18] and showed that it often outperforms the minimization method. Inspired by viewing as a scale parameter, Trzasko and Manduca [15] developed a homotopic minimization strategy to reconstruct MR image and generalized the nonconvex () norm to a family of nonconvex functions as alternatives. Candès et al. presented the iteratively reweighted minimization (IRL1) algorithm [19], which amounts to linearly minimize the log penalty of the total variation. Wong et al. [16] further incorporated a seminonlocal priority into the homotopic minimization for breast MRI.
Besides the priorinducing penalty function, another important issue in CSMRI reconstruction, is the choice of a sparsifying transform, since one of the important requirements in CSMRI is that the image to be reconstructed has a sparse representation in a given transform domain. At the start, sparse representation was usually realized by total variation (TV) regularization and/or wavelet transform [1, 12, 13, 15, 20, 21]. As it is known, TV prior assumes the target images consist of piecewise constant areas which may not be valid in many practical MRI applications. Other analytically designed bases, such as wavelets and shearlets, also involve their intrinsic deficiencies [20]. Some advanced approaches were developed to address these issues. Knoll et al. introduced the total generalized variation (TGV) for MR image reconstruction problems [22]. Liang et al. [23] applied the nonlocal total variation (NLTV) regularization to improve the signaltonoise ratio (SNR) in parallel MR imaging, by replacing the gradient functional in conventional TV using a weighted nonlocal gradient function to reduce the blocky effect of TV regularization. Since fixed bases might not be universally optimal for all images, some methods utilizing sparsity prior under adaptive transform/dictionary were developed recently [24–29]. The sparsity in this framework is enforced on overlapping image patches to emphasize local structures. Additionally, the global dictionary consisting of local atoms (prototype) is adapted to the particular image instance, which thereby provides better sparsity. Our work also adopts the similar concept as a penalty term for MRI reconstruction.
After reviewing two important components in CS, an interesting question is whether a better sparsityinducing function (e.g., () norm) combined with a better transform (e.g., adaptive dictionary) will lead to better reconstruction from the same set of measurements. To the best of our knowledge, there is only one paper by Shi et al. preliminarily studying the approximation sparsity under the learned dictionary [14], where the penalty is relaxed by the nonconvex minimax concave (MC) penalty. The MC penalty approximates the penalty by gradually increasing a scale parameter. However, their work focused on image processing and the homotopic strategy they used is computationally demanding [14].
In this paper, we propose a novel method to solve the nonconvex minimization problem in CSMRI by incorporating the reweighting scheme with our recently developed twolevel Bregman method with dictionary updating (TBMDU). An ultimate advantage of our proposed method is that, by incorporating the iteratively reweighted scheme into the twolevel Bregman method/augmented Lagrangian (AL) with dictionary updating, the based updating step is conducted in each inner iteration of the AL scheme, and both the minimization of the sparsity of image patches and the datafidelity constraint can be solved in a unified formalism. The modified alternating direction method (ADM) efficiently solves the model of pursuing the approximated ()norm penalty. Specifically, by applying iteratively reweighted or minimization scheme at each AL inner iteration, the proposed algorithm utilizes adaptive weights to encourage the large coefficients to be nonzero and the small ones to be zero under learned dictionary, hence resulting in more compact and discriminative representation for reconstructed image patches. Numerical experiments show that the performance of the derived method is superior to other existing methods under a variety of sampling trajectories and kspace acceleration factors. Particularly, it achieves better reconstruction results than those by using TBMDU without adding any computational complexity.
The rest of this paper is organized as follows. We start with a brief review on TBMDU and iteratively reweighting scheme for minimization in Section 2. Consequently, the proposed method WTBMDU is presented in Section 3. Several numerical simulation results are illustrated in Section 4 to show the superiority of the proposed method. Finally, conclusions and discussions are given in Section 5.
2. Theory
Plenty of papers, with the theme of compressed sensing (CS) [2, 3], demonstrated that MR images can be reconstructed from very few linear measurements [1–3, 5, 10, 12]. These approaches take advantage of the sparsity inherent in MR images. The wellknown example is the success of TV regularization which implies that MR images can be approximated by those having sparse gradients. Generally, if a sparsifying transform can be readily defined, ideally one could choose the estimation with the sparsest representation in that still matches the limited measurements. That is, by minimizing transformdomain sparsitypromoting energy that is subject to the data consistency , where is noise, it yields where is a vector representing the pixel 2D complex image. denotes the partially sampled Fourier encoding matrix and represents the acquired data in kspace. is the standard deviation of the zeromean complex Gaussian noise, and is the standard deviation of both real and imaginary parts of the noise. Practically, the quasinorm in the regularization term of (1) is usually relaxed. In this paper, we propose some efficient algorithms for nonconvex relaxation such as quasinorm [17, 18].
The choice of the sparsifying transform is an important consideration in minimizing functional (1). Besides the fixed transforms such as finitedifference and wavelet transform, modeling image patches (signals) by sparse and redundant representations have been drawing considerable attention in recent years [24]. Considering the image patches set consisting of signals, denotes a vectored form of the patch extracted from the image of size . The sparseland model for image patches suggests that each image patch, , could be sparsely represented over a learned dictionary [27]; that is, where stands for the sparse coefficient of the th patch. The combination of sparse and redundant representation modeling of signals, together with a learned dictionary using signal examples, has shown its promise in a series of image processing applications [24–29]. For example, in [25, 26], the authors used reference MRI image slices to train the dictionary and achieved limited improvement. Considering that a dictionary learnt from a reference image would not be able to effectively sparsify new features in the current scan; Ravishankar and Bresler [29] suggested learning dictionary from the target image itself and developed a twostep alternative method to remove aliasing artifacts and noise in one step and subsequently fills in the kspace data in the other step.
In the following, to make the paper selfcontained, we, respectively, review the TBMDU method for MRI reconstruction and reweighted scheme for solving nonconvex relaxation of the quasinorm minimization.
2.1. The TBMDU Framework for MRI Reconstruction
Recently, we proposed a series of dictionary learning methods [30–32] based on augmented Lagrangian/Bregman iterative scheme [33]. Particularly, by using a twolevel Bregman iterative method and the relaxed version of sparseland model (i.e., ) as the regularization term for MRI reconstruction [31], the TBMDU method solves the objective function equation (1) by iterating the following: where and . stands for the sparse level of the image patches in the “optimal” dictionary. For many natural or medical images, the value of can be determined empirically with robust performance in our work. , in which measures the degree of the overcompleteness of the dictionary.
The proposed TBMDU method consists of a twolevel Bregman iterative procedure, where (3) is the outerlevel of the method. On the other hand, the innerlevel Bregman iterative is employed to solve the subproblem of sparse representation in (3), that is, adding auxiliary variables to convert the unconstrained problem (2) into a constrained formulation:
Then split Bregman method/augmented Lagrangian is used to solve problem (4) as follows:
We minimize the corresponding AL function (5) alternatively with respect to one variable at a time. This scheme decouples the minimization process and simplifies the optimization task. At the dictionary updating step, by updating (5) with respect to along the gradient descent direction and constraining the norm of each atom to be unit [24], we can get the following update rule:
At the sparse coding stage of updating , the optimization problem for each is derived:
By applying the iterative shrinkage/thresholding algorithm (ISTA) [33, 34] (i.e., performing a gradient descent of the first functional term in (8) and then solving a proximal operator problem) and using the immediate variable to represent the updating scheme compactly (i.e., the formulation derived from (6), it attains the solution of as follows: where and is the solution of .
The corresponding procedure of TBMDU is summarized in Algorithm 1. Line 10 is the frequency interpolation step for updating the image . AL scheme and ADM applied to the constrained problem result in an efficient twostep iterative mechanism, which alternatively updates the solution and imagepatch related coefficients (, and ). For a more detailed description of the derivations, the interested readers can refer to [31].

2.2. Reviews of Reweighted Scheme on Solving Nonconvex Relaxation of the QuasiNorm
In (1), the penalty can be relaxed into several tractable alternatives. For example, some nonconvex relaxations such as quasinorm [17, 18], log penalty [19], and smoothly clipped absolute deviation (SCAD) penalty [35] were usually used. Compared to norm, function with is a smooth relaxation of and the geometry of ball gives better approximation on sparsity. Recent numerical results showed that adapting nonconvex optimization technique can reduce the required number of measurements for reconstruction [14–16].
Most current approaches for solving nonconvex quasinorm optimization can be classified into two categories: reweighted and reweighted [36–41]. The common idea of these methods is to utilize an optimization transfer technique to iteratively surrogate the nonconvex potential function by a convex function [37, 39, 41]. Usually, the convex function is chosen as a local quadratic approximation or linear local approximation. A technique drawing much interest is the iteratively reweighted least squares (IRLS) method which is widely used in compressed sensing and image processing [36, 40]. Another technique closely related to IRLS is the iteratively reweighted norm (IRN) approach introduced by Rodríguez and Wohlberg for generalized total variation functional [42]. The IRN method pursues to minimize the norm for by using a weighted norm in an iterative manner. At iteration , the solution is the minimizer of The sequence of solution converges to the minimizer of as . It has been proven that IRN is one kind of majorizationminimization (MM) method [42], which involves good property of . In order to extend the iterative reweighting approach IRLS to the general nonconvex function, Mourad and Reilly developed the quadratic local approximation of the nonconvex function as follows [39]: where is a suitably chosen parameter and is the first derivative of . stands for the th index number of vector .
A similar work conducted by Elad and Aharon is the iteratively reweighted minimization (IRL1) algorithm [27], which iteratively minimizes the linearization of a quasilogarithmic potential function. Zou and Li [41] presented the linear local approximation of the general nonconvex function and the upper bound of Equation (12) aims to approximate the penalty function with its firstorder Taylor expansion at the current iteration. Its majorization properties can be easily obtained by exploiting the concavity of the function , which always lies below its tangent. The numerical tests in [17] by Chartrand and Yin demonstrated that IRLS is comparable with IRL1. In [38], the authors presented weighted nonlinear filter for compressed sensing (WNFCS) for general nonconvex minimization by integrating the IRL1 algorithm shown in (12) and their previously developed NFCS framework [43].
3. Materials and Methods
As discussed in previous section, dictionary updating and sparse coding to (5) are performed sequentially. Therefore, an interesting question is that will the better sparsity inducing function in the coefficient matrix lead to better image reconstruction under the learned dictionary? In the following, we investigate two variations at the sparse coding step. By employing the seminorms (), it is feasible to expect that utilizing approximations that are closer to the quasinorm than will correspondingly reduce the required number of measurements for accurate reconstruction.
3.1. Proposed Method: WTBMDU
By replacing norm with nonconvex norm, (3) can be rewritten as
After some manipulations similar to those in Section 2.1, updating coefficients at the sparse coding stage are deduced as follows:
When other variables are fixed, numerical strategies are proposed in the following to solve the corresponding nonconvex minimization problems with regard to , by iteratively minimizing a convex majorization of the nonconvex objective function.
3.1.1. Algorithm 1: Reweighed
By inserting the local linear approximation in the sparse coding step, it is possible to transform the nonconvex unconstrained minimization problem into a convex unconstrained problem, which can be solved iteratively. In the penalty functional of (14), like the wellknown method ISTA, a proximal operator is employed to approximate the first term, and the reweighted approximation is employed to the second term: where stands for the th index number of vector . Following the definition in (12), where and , it attains the solution of : where . is a small parameter to prevent division by zeros.
3.1.2. Algorithm 2: Reweighed

Compared to the previous reweighted strategy, the only difference is that the reweighted approximation in (10) is employed to the second term:
Then it attains the solution of :
The minimizer of this leastsquare problem is given by where , . Similar to that in (14), is a small parameter to prevent numerical instabilities.
Now, we summarize our proposed method for MRI reconstruction here, which we call weighted TBMDU. The detailed description of the proposed method is listed in Algorithm 2. Similar to TBMDU, the proposed WTBMDU method alternatively updates the target solution , image patch related coefficients (, and ), and auxiliary variables. The difference between the plain TBMDU and the weighted TBMDU mainly lies on the weights used in updating the sparse coefficients. In WTBMDU, the weights are obtained by evaluating the function of variables at the solution of the previous step. Specifically, whether the in (16) or in (19), the weighting scheme decreases as the absolute value of increases, indicating that it penalizes more on the coefficients with small magnitude value. Therefore, this operation strongly encourages the large coefficients to be nonzero and the small ones to be zero. In line 8 for reweighted and line 10 for reweighted , the formulations denote operating every element in the matrix componentwise. In the following content of this paper, we denote the reweighted and reweighted in WTBMDU as WTBMDUL1 and WTBMDUL2, respectively.
The strategy we used in WTBMDU is similar to that used in WNFCS [38], that is, combining the penalized proximal splitting strategy and reweighting strategy. The difference is that WNFCS focus on updating in the simple CS domain and the proximal splitting strategy is employed to the Fourierrelated datafidelity term, while our WTBMDU devotes to updating the coefficients in the adaptive dictionary and the proximal splitting strategy is employed to the dictionaryrelated sparse representation error term. Additionally, we also derive the updating scheme based on reweighted .
3.2. Parameter Values, Continuation Strategy, and Algorithm Convergence
The proposed method involves four parameters: , , , and . The setting of these parameters is similar to that in TBMDU. Firstly, both parameters and are the Bregman (or augmented Lagrangian) positive parameters associated with the small image patches and the whole image itself, respectively. One is for the overlapping image patches and the other is for the image solution itself. It has been mathematically proven that the choice of the Bregman parameter has little effect on the final reconstruction quality as long as it is sufficiently small [31, 44, 45]. In our work, the smaller the value of the Bregman parameter is, the more iterations the Bregman method need to reach the stopping condition. Moreover, since and are with different orders of magnitude, we set in the AL formalism for the balance between various “penalty” terms. Secondly, stands for the sparse level of the image patches and can be determined empirically. Finally, the step size in the dictionary updating stage can be set to be a small positive number, for example, 0.01.
In summary, there is only the parameter that should be carefully chosen in order to enable the efficiency of the algorithm. In our method, by taking advantage of the typical structure of the problem in this paper, we propose a similar rule to that used in [21]. Specifically, we set and so as to achieve condition numbers and that result in fast convergence of the algorithm. Since and , . Consequently is a decreasing function of . Choosing such that would require a large and accordingly, the influence of the weight diminishes and the solution of trends to the leastsquare solution (). In our experiments, we observed that this phenomenon would result in an “immature” dictionary that is not learned enough. On the other hand, taking would increase which makes numerically unstable. The same trend also applies to as a function of . We found that empirically choosing such that generally provided good convergence speed in all experiments. Additionally, we can estimate in a more practical way. Specifically, since the number of data samples is huge, we can set , where is a robust estimate. Together with the fact that , it leads to . As for the parameter , since , where when the patch overlapping is , it concludes that setting such that is a reasonable choice, under the assumption that the kspace data is significantly undersampled (e.g., ).
As for the convergence, because of the unconvexity and nonlinearity of the problem in the case of updating dictionary, the global solution may not to be found easily like in TBMDU. Nevertheless, our dictionary is updated by a gradient descent of the AL scheme which leads to a monotonic decrease in the cost function. At the sparse coding stage, since WTBMDU merges the reweighted and penalization strategies, it still maintains the descent property of the latter and local minimum must be attained as demonstrated in [38]. Therefore, both the value of the objective function and the norm of the reconstruction difference between successive iterations can be chosen as the stopping criterion. The convergence property of the algorithm will be presented in the numerical section.
4. Experiment Results
In this section, we evaluate the performance of the proposed method using a variety of sampling schemes, with different undersampling factors. Sampling schemes used in our experiments include 2D random sampling [15], Cartesian approximation of multishot variabledensity spiral sampling [15], Cartesian sampling with random phase encodings (1D random) [1, 15], and pseudo radial sampling [15, 29]. Reconstruction results on simulated MRI data, a complex phantom, and real MRI data were presented. The MR images tested in the synthetic experiments are from in vivo MR scans of size (many of which are courtesy (2009, American Radiology Services [Online]. Available: http://www3.americanradiology.com/pls/web1/wwimggal.vmg/) and used in [29]), and the real MRI data examples reported here are of size (except in Figures 6 and 7 where a phantom of size was used). According to many prior work on CSMRI [1, 13, 29], the CS data acquisition was simulated by subsampling the 2D discrete Fourier transform of the MR images (except in the second subsection where real acquired data was used). Our proposed method WTBMDU was compared with the leading DLMRI (the code is available in http://www.ifp.illinois.edu/~yoram/DLMRILab/DLMRI.html#Intro) [29] and TBMDU methods [31], which have been shown to substantially outperform other CSMRI methods such as LDP (Matlab codes are available in http://www.stanford.edu/~mlustig/) [1], and the zerofilling reconstruction. DLMRI directly solves the minimization by OMP while TBMDU is devoted to the induced sparse minimization. In each given example, the parameters for the DLMRI method were set to be default values.
In the experiments, the nominal values of various parameters were set as patch size , the overcompleteness of the dictionary (correspondingly ), and the patch overlap ; thereby the number of data samples were for and for . , , , , , and . To avoid dividing by zero, the parameter used in the weighted matrix decreased by 2% after each inner iteration with an initial value of 5. The setting of parameters and is very similar to that in, [31, 32] and not discussed here due to the limit of paper space. Realvalued dictionaries were used for the simulated experiments with realvalued images, where the overcomplete discrete cosine transform (DCT) was chosen as the initial dictionary. Complexvalued dictionaries were used for real MR data, where both the real and imaginary parts were the same DCT matrix. The quality of the reconstruction was quantified using the peak signaltonoise ratio (PSNR (the PSNR is defined as , where the RMSE is the root mean error estimated between the ground truth and the reconstructed image)) and highfrequency error norm (HFEN) [29]. All algorithms were implemented in MATLAB 7.1 on a PC equipped with AMD 2.31 GHz CPU and 3 GByte RAM.
4.1. Reconstruction of Simulated Data (RealValued)
We first investigate the performance of WTBMDU with the noiseless measurements. Figure 1 involves an axial T2weighted reference image of the brain with pseudo radial sampling under 85% and 95% undersampling percentages, respectively (i.e., only acquiring 15% and 5% kspace data with corresponding acceleration factors of 6.67 and 20). The plots of PSNR and HFEN values as functions of iteration number under the undersampling percentage of 85% are presented in Figures 1(d) and 1(e). It can be observed that the quantitative measures of both TBMDU and WTBMDU change quickly during the first few iterations. In other words, these measure values only need less iterations to reach the convergence zone and hence the iterative convergence property of our method is better than that of DLMRI. The higher PSNR values and lower HFEN values after convergence also confirm the superiority of our method to DLMRI and TBMDU. The reconstructed results shown in Figures 1(f), 1(g), 1(h), and 1(i) reveal that the method WTBMDUL1 with provides a more accurate reconstruction on image contrast and sharper anatomical depiction. Compared to DLMRI, the magnitude images of the reconstruction error shown in Figures 1(h) and 1(i) indicate that our method exhibits crisper reconstruction of object edges (the large anatomical structure in the middle region) and preserves finer texture information (the gray matter regions in the bottomright of the reconstruction). In general, our proposed method provides better intensity fidelity to the fully sampled image.
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
(j)
(k)
More obvious differences in visual quality can be observed in the case of 95% undersampling as shown in Figures 1(j) and 1(k). The obtained PSNRs of DLMRI and WTBMDUL1 with are 26.30 dB and 28.13 dB, respectively. The DLMRI reconstruction in Figure 1(j) based on kSVD dictionary updating and greedy pursuit of coefficients shows a large number of spurious oscillations, although it gives much improvement than zerofilling and LDP (not shown in this paper). In contrast, WTBMDU shown in Figure 1(k) results in much fewer spurious oscillations and better preservation of edges through iterative weighting of the coefficients under the dataadaptive dictionary. This is especially noticeable for the brain’s fissures with sharper anatomical edges.
Figure 2 compares the results generated by DLMRI and WTBMDU using four sampling trajectories roughly under the same undersampling percentage: variable density random with 87% undersampling [15], Cartesian approximation of multishot spiral with 86% undersampling [15], 1D Cartesian trajectory [29], and pseudo radial sampling with 86% undersampling (i.e., 7.11fold acceleration). The test image is the axial T2weighted brain image shown in Figure 1(a). As can be observed from the error images, WTBMDUL1 performs better in reducing aliasing artifacts and maintaining fine details than DLMRI with all verified sampling trajectories.
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
(j)
(k)
(l)
Table 1 lists the PSNR values of the axial T2weighted brain image at different sampling trajectories with the same undersampling percentage using DLMRI, TBMDU, and WTBMDU. Generally, the improvements gained by WTBMDU over other methods are different for four kinds of trajectories although under the same undersampling rate. The largest and smallest improvements were achieved with the 2D random and radial sampling, respectively, where roughly 5 dB and 2 dB were obtained. This indicates that the efficiency of dictionary learning methods may depend on the incoherence of the data acquisition. In the family of WTBMDU algorithms, the optimal value is also different at various trajectories for both WTBMDUL1 and WTBMDUL2. To balance the numerical calculation and the selection of trajectories, or is a good option.

Figure 3 illustrates the performance of DLMRI, TBMDU, and WTBMDU at a range of acceleration factors including 2.5, 4, 6, 8, 10, and 20, where zeromean complex white Gaussian noise with standard deviation was added to the 2D random sampled kspace [29]. Since the stopping rule for the outer loop of both TBMDU and WTBMDU is determined by , the number of outer iterations of TBMDU and WTBMDUL2 with take the values of 3, 3, 6, 7, 7, 11 and 4, 8, 8, 10, 11, 11 for the above six acceleration factors, respectively. For the quantitative comparison, the values of PSNR and HFEN as functions of acceleration factor are shown in Figures 3(b) and 3(c). It can be seen that WTBMDU performs best at all tested acceleration factors. The reconstruction results and the corresponding error images under 10fold acceleration using three methods are displayed in Figures 3(d), 3(e), and 3(f) and 3(g), 3(h), and 3(i), respectively.
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
As shown in the error images, the brighter the error image appears, the larger the deviation between the reconstruction and the reference image will be. WTBMDU (Figure 3(i)) presents less pixel errors and structure loss than that of DLMRI in Figure 3(g) and TBMDU in Figure 3(h), especially in regions indicated by red arrows. In general, it can be concluded that both WTBMDUL2 and TBMDU offer strong preservation of details compared with that of DLMRI, while WTBMDUL2 provides a crisper result of fine structures than that of TBMDU.
Figure 4 depicts the reconstruction results of a transverse slice of a noncontrast MR angiography (MRA) of the circle of Willis (COW) at the same experiment setting as in Figure 3. The MRA of the circle of Willis has much textural information such as the vessels on the middle region and finescale details on the bottom region. As can be seen from Figure 4(b), when the acceleration factor increased until 10fold, the PSNR gap between TBMDU and WTBMDU increases synchronously. The PSNR improvement is also reflected in Figures 4(e) and 4(f) where much less errors appeared in the WTBMDU error image, indicating WTBMDU performs better in maintaining fine details. On the other hand, similar as observed in Figure 3, when the acceleration factor increased to as large as 20fold, the advantage of the nonconvex optimization degraded and none of these methods can faithfully reconstruct the original image from such fewer kspace samples with additional noise.
(a)
(b)
(c)
(d)
(e)
(f)
To investigate the sensitivity of various methods to different levels of complex white Gaussian noise, DLMRI, TBMDU, and WTBMDU were applied to reconstruct a T2weighted sagittal view of the lumbar spine under pseudo radial sampling at 6.09fold acceleration. Figure 5(c) plots the PSNR values of the recovered MR images by DLMRI (blue curves), TBMDU (green curves), and WTBMDU (red curves) at a sequence of different standard deviations (, 5, 8, 10, 14.2). In the case of , the PSNR of the image obtained by DLMRI is only 38.22 dB, TBMDU is 39.49 dB, and WTBMDUL2 with reaches 40.52 dB. Obviously, the difference gap between three methods is significant at low noise levels. The reconstructions and corresponding magnitudes of the error images with are shown in Figures 5(d), 5(e), 5(f), 5(g), 5(h), and 5(i). It can be observed that the skeletons in the top half part of the TBMDU reconstruction appear less obscured than those in the DLMRI results. Meanwhile, the reconstruction by WTBMDU is clearer and sharper than that by DLMRI and TBMDU and is relatively devoid of aliasing artifacts. This reveals that our method provides a more accurate reconstruction of image contrast and sharper anatomical depiction in noisy case.
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(a)
(b)
(c)
(d)
(e)
4.2. Reconstruction of Experimental MRI Data (ComplexValued)
Figure 6 shows the comparison between different methods on a physical phantom, which is often used to assess the resolution of MR reconstruction. Figure 6(a) displays the fully sampled reconstruction of the physical phantom. Figures 6(c), 6(d), 6(e), and 6(f) exhibit the results of DLMRI, TBMDU, WTBMDUL1, and WTBMDUL2 with at 80% undersampling ratio. The corresponding PSNR values are 18.66 dB, 26.82 dB, 28.89 dB, and 29.12 dB, respectively. We can find that the WTBMDU reconstructions exhibit higher resolution than those with DLMRI and TBMDU and are almost devoid of aliasing artifacts especially in zoomedin regions.
To investigate the noise sensitivity of proposed method on the complex kspace data, complex Gaussian noise of was added to the kspace with 5fold acceleration. In this case, the PSNR values of DLMRI, TBMDU, and WTBMDUL2 with methods are 17.93 dB, 22.94 dB, and 25.42 dB, respectively. The reconstructions of three methods are shown in Figure 7. The enlargements of two regionofinterests (ROIs) are presented in Figures 7(d) and 7(e). As can be observed, the DLMRI reconstruction exhibits more oscillating artifacts than that of the other two methods. Besides, the spot and circle in the bottom right of the WTBMDU reconstruction appear less obscured than those in the DLMRI and TBMDU results.
In Figures 8 and 9, two real brain data sets containing more finedetailed structures [44, 46] were used for comparison. The images in Figures 8(a) and 9(a) are both the fullysampled reconstruction of size 256 × 256 as references. A variable density Cartesian sampling trajectory with 80% undersampling was employed as shown in Figure 8(b). The reconstructions from two data sets using DLMRI, TBMDU and WTBMDUL2 with are displayed in Figures 8(b), 8(c), and 8(d) and Figures 9(b), 9(c), and 9(d), respectively. The corresponding PSNR values of three methods are 31.35 dB, 32.04 dB, and 32.69 dB for reconstructions in Figure 8 and 30.58 dB, 31.59 dB, and 31.82 dB for those in Figure 9. Some regions of error in these subplots have been illustrated with red arrows, indicating that the reconstruction using WTBMDU shows better fidelity to the reference image than that of DLMRI and TBMDU. The enlargements of the reconstruction results with these methods are shown in Figures 8(f) and 9(e). It can be observed that visible aliasing artifacts along the phase encoding direction (horizontal in the image plane) in the DLMRI reconstruction is more pronounced than those of TBMDU reconstruction, while there are almost no artifacts in that of WTBMDU with . Close inspection of the zoomedin images indicates that the weighted sparse regularization enhances the reconstruction quality.
(a)
(b)
(c)
(d)
(e)
(f)
(a)
(b)
(c)
(d)
(e)
In summary, the results from our work clearly demonstrate the advantages of introducing iterative reweighting scheme and alternating direction method in nonconvex optimization over conventional methods, for constrained image reconstruction from undersampled kspace data. Not only for the brain image containing large piecewise constant regions, as shown in Figures 1, 2, and 3, but also for the circle of Willis and lumbar spine composed of many texture features as shown in Figures 4 and 5, the proposed method visually provides more pleasant results than existing methods. By means of the Bregman iteration/AL methodology, it is notable that the proposed method consistently supports superior results without any parameters tuned manually regardless of different sampling trajectories, varying sampling factors, and the existence of noise or not. We note that, in some circumstances, the PSNR value started to decrease a little bit after achieving the highest value. The reason of this phenomenon is that the solution of problem (1) does not necessarily have higher PSNR than the intermediate iterates. In realtime imaging applications where the acquisition and reconstruction speed is crucial, a numerical algorithm which converges fast at the beginning iterations is highly appreciated and useful.
5. Conclusions and Discussions
This work presents a new regularizer combining the nonconvex pseudonorm and adaptive dictionary for MRI reconstruction from undersampled kspace data. Based on the potential success of TBMDU in the AL framework for CSMRI, the proposed WTBMDU method extends the plain TBMDU to handle nonconvex regularization by weighting the sparse coding of the coefficients. The strategy of employing iteratively reweighted or technique results in the WTBMDUL1 and WTBMDUL2 algorithms, respectively. Highaccuracy reconstructions are obtained from significantly undersampled kspace data by minimizing the adaptively sparsitypromoting regularization criterion subject to the dataconsistency constraint. In particular, compared to the DLMRI method that minimizes the regularization by the greedy algorithm of matching pursuit, the iteratively reweighted strategy combined with AL framework yields much higher PSNR values. On the other hand, compared to the plain TBMDU for norm minimization, the adaptively weighted method achieves 0.3–2.5 dB PSNR improvement and visually provides more pleasure results. Various experimental results demonstrate the superior performance of the method under a variety of sampling trajectories and kspace acceleration factors.
WTBMDU is very general since it represents an algorithmic framework that can be easily adapted to different reweighting strategies and nonconvex sparsityinducing functions proposed in the previous literatures. Some extensions will be studied in future as follows: (i) for the reweighted strategy, using different nonconvex objective functions as conducted in [38, 41]; besides, the shrinkage can be generalized as in [47, 48]; (ii) for the reweighted strategy, trying different nonconvex objective functions as in [39]. Additionally, the proposed scheme can be easily extended to incorporate other prior (e.g., spike and slab prior used in [49], where the regularization consists of the norm and norm terms). It may be better for stabling the numerical process; (iii) the current iteratively reweighted strategies can be combined with the homotopic technique described in [14–16], where the value of slowly decreases from 1 to 0.
Another extension we will consider in future is to extend our proposed model to parallel MRI reconstruction. We note that a very recently published paper [21], under the work of Ramani and Fessler, addressed the regularized SENSEreconstruction using AL methods. In order to effectively solve the unconstrained SENSEreconstruction consisting of TV/wavelets regularization term and datafidelity term using the AL formalism, they split not only the regularization term, but also the Fourier encoding and spatial components in the datafidelity term, by introducing auxiliary variables to decouple the datadomain components, and the regularization component, respectively. We believe that the regularizer of the sparse representations of overlapping image patches in our work can be naturally incorporated into their work in a unified AL framework.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Acknowledgments
The authors would like to thank Dr. Leslie Ying at the State University of New York at Buffalo for helpful discussion and thank Ravishankar et al. for sharing their experiment materials and source codes. This work was partly supported by the National Natural Science Foundation of China (61362001, 51165033, 61340025, 61261010, 81120108012, and 11301508), the Natural Science Foundation of Jiangxi Province (20132BAB211030, 20121BBE50023, and 20122BAB211015), Technology Foundation of Department of Education in Jiangxi Province (nos. GJJ12006 and GJJ14196), Young Scientists Training Plan of Jiangxi province (no. 20142BCB23001), the Shenzhen Peacock Plan KQCX20120816155710259, and the Basic Research Program of Shenzhen JC201104220219A.
References
 M. Lustig, D. Donoho, and J. M. Pauly, “Sparse MRI: the application of compressed sensing for rapid MR imaging,” Magnetic Resonance in Medicine, vol. 58, no. 6, pp. 1182–1195, 2007. View at: Publisher Site  Google Scholar
 E. J. Candes, J. Romberg, and T. Tao, “Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information,” IEEE Transactions on Information Theory, vol. 52, no. 2, pp. 489–509, 2006. View at: Publisher Site  Google Scholar  MathSciNet
 D. L. Donoho, “Compressed sensing,” IEEE Transactions on Information Theory, vol. 52, no. 4, pp. 1289–1306, 2006. View at: Publisher Site  Google Scholar  MathSciNet
 S. L. Keeling, C. Clason, M. Hintermüller, F. Knoll, A. Laurain, and G. von Winckel, “An image space approach to Cartesian based parallel MR imaging with total variation regularization,” Medical Image Analysis, vol. 16, no. 1, pp. 189–200, 2012. View at: Publisher Site  Google Scholar
 D. Liang, B. Liu, J. Wang, and L. Ying, “Accelerating SENSE using compressed sensing,” Magnetic Resonance in Medicine, vol. 62, no. 6, pp. 1574–1584, 2009. View at: Publisher Site  Google Scholar
 F.H. Lin, K. K. Kwong, J. W. Belliveau, and L. L. Wald, “Parallel imaging reconstruction using automatic regularization,” Magnetic Resonance in Medicine, vol. 51, no. 3, pp. 559–567, 2004. View at: Publisher Site  Google Scholar
 F.H. Lin, F.N. Wang, S. P. Ahlfors, M. S. Hämäläinen, and J. W. Belliveau, “Parallel MRI reconstruction using variance partitioning regularization,” Magnetic Resonance in Medicine, vol. 58, no. 4, pp. 735–744, 2007. View at: Publisher Site  Google Scholar
 P. Qu, J. Luo, B. Zhang, J. Wang, and G. X. Shen, “An improved iterative SENSE reconstruction method,” Concepts in Magnetic Resonance Part B: Magnetic Resonance Engineering, vol. 31, no. 1, pp. 44–50, 2007. View at: Publisher Site  Google Scholar
 X. Qu, Y. Hou, F. Lam, D. Guo, J. Zhong, and Z. Chen, “Magnetic resonance image reconstruction from undersampled measurements using a patchbased nonlocal operator,” Medical Image Analysis, vol. 18, no. 6, pp. 843–856, 2014. View at: Publisher Site  Google Scholar
 J. A. Tropp and A. C. Gilbert, “Signal recovery from random measurements via orthogonal matching pursuit,” IEEE Transactions on Information Theory, vol. 53, no. 12, pp. 4655–4666, 2007. View at: Publisher Site  Google Scholar  MathSciNet
 L. Ying, D. Xu, and Z.P. Liang, “On Tikhonov regularization for image reconstruction in parallel MRI,” in Proceedings of the IEEE Engineering in Medicine and Biology Conference (EMBC ’04), pp. 1056–1059, September 2004. View at: Google Scholar
 K. T. Block, M. Uecker, and J. Frahm, “Undersampled radial MRI with multiple coils. Iterative image reconstruction using a total variation constraint,” Magnetic Resonance in Medicine, vol. 57, no. 6, pp. 1086–1098, 2007. View at: Publisher Site  Google Scholar
 S. Ma, W. Yin, Y. Zhang, and A. Chakraborty, “An efficient algorithm for compressed MR imaging using total variation and wavelets,” in Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '08), pp. 1–8, June 2008. View at: Publisher Site  Google Scholar
 J. Shi, X. Ren, G. Dai, J. Wang, and Z. Zhang, “A nonconvex relaxation approach to sparse dictionary learning,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR '11), pp. 1809–1816, June 2011. View at: Publisher Site  Google Scholar
 J. Trzasko and A. Manduca, “Highly undersampled magnetic resonance image reconstruction via homotopic ${\mathcal{l}}_{\text{0}}$minimization,” IEEE Transactions on Medical Imaging, vol. 28, no. 1, pp. 106–121, 2009. View at: Publisher Site  Google Scholar
 A. Wong, A. Mishra, P. Fieguth, and D. A. Clausi, “Sparse reconstruction of breast MRI using homotopic ${l}_{0}$ minimization in a regional sparsified domain,” IEEE Transactions on Biomedical Engineering, vol. 60, no. 3, pp. 743–752, 2013. View at: Publisher Site  Google Scholar
 R. Chartrand and W. Yin, “Iteratively reweighted algorithms for compressive sensing,” in Proceedings of the IEEE 33rd International Conference on Acoustics, Speech and Signal Processing (ICASSP '08), pp. 3869–3872, Las Vegas, Nev, USA, April 2008. View at: Publisher Site  Google Scholar
 R. Chartrand, “Exact reconstruction of sparse signals via nonconvex minimization,” IEEE Signal Processing Letters, vol. 14, no. 10, pp. 707–710, 2007. View at: Publisher Site  Google Scholar
 E. J. Candès, M. B. Wakin, and S. P. Boyd, “Enhancing sparsity by reweighted ${\mathcal{l}}_{\text{1}}$ minimization,” Journal of Fourier Analysis and Applications, vol. 14, no. 56, pp. 877–905, 2008. View at: Publisher Site  Google Scholar  MathSciNet
 M. Akcakaya, S. Nam, P. Hu et al., “Compressed sensing with wavelet domain dependencies for coronary MRI: a retrospective study,” IEEE Transactions on Medical Imaging, vol. 30, no. 5, pp. 1090–1099, 2011. View at: Publisher Site  Google Scholar
 S. Ramani and J. A. Fessler, “Parallel MR image reconstruction using augmented lagrangian methods,” IEEE Transactions on Medical Imaging, vol. 30, no. 3, pp. 694–706, 2011. View at: Publisher Site  Google Scholar
 F. Knoll, K. Bredies, T. Pock, and R. Stollberger, “Second order total generalized variation (TGV) for MRI,” Magnetic Resonance in Medicine, vol. 65, no. 2, pp. 480–491, 2011. View at: Publisher Site  Google Scholar
 D. Liang, H. Wang, Y. Chang, and L. Ying, “Sensitivity encoding reconstruction with nonlocal total variation regularization,” Magnetic Resonance in Medicine, vol. 65, no. 5, pp. 1384–1392, 2011. View at: Publisher Site  Google Scholar
 M. Aharon, M. Elad, and A. Bruckstein, “KSVD: an algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311–4322, 2006. View at: Publisher Site  Google Scholar
 A. Bilgin, Y. Kim, F. Liu, and M. Nadar, “Dictionary design for compressed sensing MRI,” in Proceedings of the 18th Annual Meeting on International Society for Magnetic Resonance in Medicine (ISMRM '10), p. 4887, 2010. View at: Google Scholar
 Y. Chen, X. Ye, and F. Huang, “A novel method and fast algorithm for MR image reconstruction with significantly undersampled data,” Inverse Problems and Imaging, vol. 4, no. 2, pp. 223–240, 2010. View at: Publisher Site  Google Scholar  MathSciNet
 M. Elad and M. Aharon, “Image denoising via sparse and redundant representations over learned dictionaries,” IEEE Transactions on Image Processing, vol. 15, no. 12, pp. 3736–3745, 2006. View at: Publisher Site  Google Scholar  MathSciNet
 R. Otazo and D. Sodickson, “Adaptive compressed sensing MRI,” in Proceedings of the 18th Annual Meeting on International Society for Magnetic Resonance in Medicine (ISMRM ’10), p. 4867, May 2010. View at: Google Scholar
 S. Ravishankar and Y. Bresler, “MR image reconstruction from highly undersampled kspace data by dictionary learning,” IEEE Transactions on Medical Imaging, vol. 30, no. 5, pp. 1028–1041, 2011. View at: Publisher Site  Google Scholar
 Q. Liu, S. Wang, and J. Luo, “A novel predual dictionary learning algorithm,” Journal of Visual Communication and Image Representation, vol. 23, no. 1, pp. 182–193, 2012. View at: Publisher Site  Google Scholar
 Q. Liu, S. Wang, K. Yang, J. Luo, Y. Zhu, and D. Liang, “Highly Undersampled magnetic resonance image reconstruction using twolevel Bregman method with dictionary updating,” IEEE Transactions on Medical Imaging, vol. 32, no. 7, pp. 1290–1301, 2013. View at: Publisher Site  Google Scholar
 Q. Liu, J. Luo, S. Wang, M. Xiao, and M. Ye, “An augmented Lagrangian multiscale dictionary learning algorithm,” EURASIP Journal on Advances in Signal Processing, vol. 1, pp. 1–16, 2011. View at: Google Scholar
 W. Yin, S. Osher, D. Goldfarb, and J. Darbon, “Bregman iterative algorithms for $l1$minimization with applications to compressed sensing,” SIAM Journal on Imaging Sciences, vol. 1, no. 1, pp. 143–168, 2008. View at: Publisher Site  Google Scholar  MathSciNet
 I. Daubechies, M. Defrise, and C. De Mol, “An iterative thresholding algorithm for linear inverse problems with a sparsity constraint,” Communications on Pure and Applied Mathematics, vol. 57, no. 11, pp. 1413–1457, 2004. View at: Publisher Site  Google Scholar  MathSciNet
 A. Mehranian, H. Saligheh Rad, M. R. Ay, A. Rahmim, and H. Zaidi, “Smoothly clipped absolute deviation (SCAD) regularization for compressed sensing MRI using an augmented Lagrangian scheme,” in Proceedings of the IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC '12), pp. 3646–3653, Anaheim, Calif, USA, November 2012. View at: Publisher Site  Google Scholar
 I. F. Gorodnitsky and B. D. Rao, “Sparse signal reconstruction from limited data using FOCUSS: a reweighted minimum norm algorithm,” IEEE Transactions on Signal Processing, vol. 45, no. 3, pp. 600–616, 1997. View at: Publisher Site  Google Scholar
 Q. Lyu, Z. Lin, Y. She, and C. Zhang, “A comparison of typical ℓp minimization algorithms,” Neurocomputing, vol. 119, pp. 413–424, 2013. View at: Publisher Site  Google Scholar
 L. B. Montefusco, D. Lazzaro, and S. Papi, “A fast algorithm for nonconvex approaches to sparse recovery problems,” Signal Processing, vol. 93, no. 9, pp. 2636–2647, 2013. View at: Publisher Site  Google Scholar
 N. Mourad and J. P. Reilly, “Minimizing nonconvex functions for sparse vector reconstruction,” IEEE Transactions on Signal Processing, vol. 58, no. 7, pp. 3485–3496, 2010. View at: Publisher Site  Google Scholar  MathSciNet
 B. D. Rao and K. KreutzDelgado, “An affine scaling methodology for best basis selection,” IEEE Transactions on Signal Processing, vol. 47, no. 1, pp. 187–200, 1999. View at: Publisher Site  Google Scholar  Zentralblatt MATH  MathSciNet
 H. Zou and R. Li, “Onestep sparse estimates in nonconcave penalized likelihood models,” The Annals of Statistics, vol. 36, no. 4, pp. 1509–1533, 2008. View at: Publisher Site  Google Scholar  MathSciNet
 P. Rodríguez and B. Wohlberg, “Efficient minimization method for a generalized total variation functional,” IEEE Transactions on Image Processing, vol. 18, no. 2, pp. 322–332, 2009. View at: Publisher Site  Google Scholar  MathSciNet
 L. B. Montefusco, D. Lazzaro, and S. Papi, “Fast sparse image reconstruction using adaptive nonlinear filtering,” IEEE Transactions on Image Processing, vol. 20, no. 2, pp. 534–544, 2011. View at: Publisher Site  Google Scholar  MathSciNet
 B. Liu, K. King, M. Steckner, J. Xie, J. Sheng, and L. Ying, “Regularized sensitivity encoding (SENSE) reconstruction using bregman iterations,” Magnetic Resonance in Medicine, vol. 61, no. 1, pp. 145–152, 2009. View at: Publisher Site  Google Scholar
 S. Osher, M. Burger, D. Goldfarb, J. Xu, and W. Yin, “An iterative regularization method for total variationbased image restoration,” A SIAM Interdisciplinary Journal, vol. 4, no. 2, pp. 460–489, 2005. View at: Publisher Site  Google Scholar  MathSciNet
 J. X. Ji, B. S. Jong, and S. D. Rane, “PULSAR: a MATLAB toolbox for parallel magnetic resonance imaging using array coils and multiple channel receivers,” Concepts in Magnetic Resonance Part B: Magnetic Resonance Engineering, vol. 31, no. 1, pp. 24–36, 2007. View at: Publisher Site  Google Scholar
 R. Chartrand, “Fast algorithms for nonconvex compressive sensing: MRI reconstruction from very few data,” in Proceedings of the IEEE International Symposium on Biomedical Imaging (ISBI '09), 2009. View at: Google Scholar
 S. Voronin and R. Chartrand, “A new generalized thresholding algorithm for inverse problems with sparsity constraints,” in Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '13), pp. 1636–1640, Vancouver, Canada, May 2013. View at: Publisher Site  Google Scholar
 X. Lu, Y. Yuan, and P. Yan, “Sparse coding for image denoising using spike and slab prior,” Neurocomputing, vol. 106, pp. 12–20, 2013. View at: Publisher Site  Google Scholar
Copyright
Copyright © 2014 Qiegen Liu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.