A Coordinate Descent Method for Total Variation Minimization
Total variation (TV) is a well-known image model with extensive applications in various images and vision tasks, for example, denoising, deblurring, superresolution, inpainting, and compressed sensing. In this paper, we systematically study the coordinate descent (CoD) method for solving general total variation (TV) minimization problems. Based on multidirectional gradients representation, the proposed CoD method provides a unified solution for both anisotropic and isotropic TV-based denoising (CoDenoise). With sequential sweeping and small random perturbations, CoDenoise is efficient in denoising and empirically converges to optimal solution. Moreover, CoDenoise also delivers new perspective on understanding recursive weighted median filtering. By incorporating with the Augmented Lagrangian Method (ALM), CoD was further extended to TV-based image deblurring (ALMCD). The results on denoising and deblurring validate the efficiency and effectiveness of the CoD-based methods.
Total variation (TV), also known as the ROF model , was introduced by Rudin et al. The TV model is effective in preserving sharp and salient edges while suppressing noise and has been extensively adopted as a regularizer in various image restoration applications, for example, deblurring [2, 3], superresolution [4, 5], inpainting [6, 7], and compressed sensing [8, 9].
Recently, other image models, such as dictionary-based sparse coding [10–12] and nonlocal similarity [13–17], have been developed. Compared with these models, TV is much more efficient to be solved, making TV-based methods remain active in image and vision studies [17–24]. Moreover, TV may be complementary with the other models, and thus proper combination of them can lead to better performance [25, 26]. Besides, extensions of TV regularizer were also studied. For color images, TV can be extended to a class of vectorial TV (VTV) [27, 28], where interchannel correlation is taken into account to reduce the uneven color effects. While TV only considers first-order gradients, Total Generalized Variation (TGV)  was proposed to involve higher-order derivatives. For structure extraction, relative TV  was employed to distinguish structure from textures. Considering that the gradient distribution of each pixel is actually spatially variant, nonlocal extension of TV model [17, 31] was presented to leverage the similar patches for adaptive distribution estimation.
A basic TV minimization problem is TV-based image denoising formulated aswhere is the TV regularizer, is the trade-off parameter, and and are the latent clear image and the noisy observation, respectively. Various methods to solve TV denoising problem had been proposed and can be roughly categorized from three directions, that is, gradient based, Markov Random Fields (MRF) based, and CoD-based methods. First, gradient descent-based algorithms have been widely adopted in image processing tasks [18, 32–37]. As to TV minimization, gradient projection based PDE methods  originally were adopted to solve the associated nonlinear Euler-Lagrange equation. Following this line, a number of methods tried to directly solve primal variables [38–42]. To avoid nonsmoothness trap, the dual formulation of TV minimization was proposed and several variants came forward [43–45]. Recently, a hybrid primal dual scheme that alternatively solves primal and dual variables had been developed [46–48]. Most specially, Chambolle’s fixed point algorithm  solving dual variable is the most successful, which has been widely adopted in general image restoration methods, for example, TwIST , FISTA , and SALSA . Second, TV minimization can be mapped to a class of binary MRFs [52–54], such that it can be solved by graph-cut techniques. Third, another entirely different direction is to employ CoD method, decomposing optimization problem with respect to each pixel and updating coordinate variables via some appropriate patterns. For the high efficiency of decomposed scalar optimizations, the CoD-based methods are usually efficient. However, the sole attempt based on CoD to solve TV minimization  only considers the anisotropic TV minimization, while isotropic TV minimization is unreachable for CoD-based methods, since it cannot be decomposed with respect to each pixel.
In this paper, we systematically study the CoD-based methods for TV minimization problem. First, we provide a unified formulation of anisotropic and isotropic TV minimization problem based on multidirectional gradients representation, via which the isotropic TV regularizer can also be decomposed into a sequence of scalar convex problems with respect to each pixel. The scalar convex problem can be efficiently solved, and by sequentially updating each pixel, the CoD-based denoising (CoDenoise) algorithm converges fast. Due to the nondifferentiability of TV regularizer, CoDenoise may get stuck at nonstationary points [55–57]; however fortunately it is experimentally verified that CoDenoise can bypass nonstationary points and converge to optimal solution by adding small random perturbations. The CoDenoise algorithm only requires updating the pixels poisoned by noises, due to which the CoDenoise algorithm is more efficient than other methods, especially for low noise levels. Interestingly, the CoDenoise algorithm can be interpreted as the recursive weighted median operations on noisy images. Based on the more recent progress in weighted median filter [58, 59], the CoDenoise algorithm should be much more improved in terms of efficiency. Then, by combining variable splitting strategy and Augmented Lagrangian Method (ALM), we further embed CoDenoise algorithm to solve general image restoration problem, for example, image deblurring, resulting in the ALMCD algorithm. In deblurring problems, the blurry images are usually poisoned by relatively low level noises, and thus the incorporated CoDenoise algorithm for denoising subproblem contributes significantly to efficiency improvement of the ALMCD algorithm. Compared with TwIST, FISTA, and SALSA, ALMCD can obtain satisfactory results but is more efficient.
Our contribution can be summarized from two aspects:(i)We systematically study the CoD-based methods for TV minimization and develop an extremely simple unified CoD-based solution for both anisotropic and isotropic TV minimization. The resulting CoDenoise algorithm is more efficient than gradient based and MRF based methods and achieves satisfactory denoising results.(ii)By incorporating with ALM, CoDenoise is extended to image deblurring problem. In the deblurring problems, the blurry images usually suffer from severe blur and relatively low level noises, and thus the proposed ALMCD algorithm with CoDenoise embedded for denoising subproblem is much more efficient and can concurrently provide satisfactory deblurring quality compared with several state-of-the-art methods.
This paper is organized as follows: Section 2 presents some preliminaries, including definition of TV regularizers and multidirectional gradient approximation of TV regularizers. The CoDenoise algorithm together with its convergence proof and computational complexity is proposed in Section 3. In Section 4, we embed CoDenoise to image deblurring. Section 5 demonstrates experimental results, and Section 6 ends this paper with some concluding remarks.
In this section, we first present the definitions of the discrete anisotropic and isotropic TV operators. In previous studies, CoD-based solution is only available for anisotropic TV minimization problem. To address this, we then introduce the multidirectional gradient representation to establish the connection between the anisotropic and isotropic TV models, making it possible to use the unified CoD method for TV minimization.
2.1. The Discrete TV Operators
For an image with pixels, the discrete gradient operators including both horizontal gradient operator and vertical gradient operator are defined aswhere and . The anisotropic TV regularizer [50, 60] is defined asWith this definition, it is easy to obtain the anisotropic TV regularization with respect to coordinate asThus, the CoD method can be directly used to solve the anisotropic TV minimization problem. Similarly, the isotropic TV regularizer [50, 60] is defined asApparently the isotropic TV cannot be decomposed with respect to coordinate since the quadratic interactions with horizontal and vertical gradients, making the CoD method unfeasible to solve isotropic TV minimization problem. Therefore, to extend the results of CoD to isotropic TV minimization problem, we tempt to find a connection between and .
2.2. Multidirectional Gradients Approximation
The isotropic TV regularizer can be approximated by multidirectional gradients representation, and thus the anisotropic and isotropic TV models can be connected in a unified formulation . For any pair of real numbers and , the identityalways holds, which can be discretized by Riemannian approximation. Now, let be a set of points uniformly distributed in . Equation (6) can then be discretized as
Thus we can approximate TV regularizer aswhere and and . Equation (8) provides a unified formulation of anisotropic and isotropic TV models,In later context, we will use to represent the TV regularizers.
3. The Unified Coordinate Descent Method for TV-Based Denoising
With regularizer, anisotropic and isotropic TV denoising models are reformulated in the unified formwhich is exactly anisotropic TV-based denoising when and infinitely approximates isotropic TV-based denoising when increases. We thus can decompose the objective function into a sequence of one-dimensional subproblems, which can be solved efficiently via simple convex optimization. With simple sequential updating pattern, we then obtain the unified CoD denoising algorithm for both anisotropic and isotropic TV minimization.
3.1. The Coordinate Subproblem
Let first present equivalent decomposition of the image denoising objective function with respect to each pixel ,where and . Vectors and are both of length , which are the coefficients of and the combinations of its 4 neighbourhoods, respectively.
3.2. Solving Subproblem
For simplicity, we unify the formulation of subproblems as
The scalar optimization problem is convex but nonsmooth. We assume (the case can be easily generalized). Let be the permutation of according to the ascending order of . Let be an ascending sequence with , , and . Let . Thus, (12) is transformed to
The solution to (13) can be obtained by making its first-order derivative be 0,
We then discuss the solution with different cases of :(1)When , Then is the optimal solution to (13), if .(2)When , Then is the optimal solution to (13), if .(3)When , Then is the optimal solution to (13), if .(4)When ,
Since , for , we thus haveand . Then is the optimal solution to (13), if .
As a summary, we notate procedures as an operator,
Therefore, the subproblem with respect to (11) can be solved by
The following question is how to choose coordinate updating pattern. Li and Osher adopted the checkerboard pattern , in which the pixels are divided into black and white blocks. The pixels in the same group are not neighbors, and then the pixels in two blocks can be alternatively updated. Another greedy strategy is also popular , in which the selected coordinate makes the biggest contribution to the decrease of the energy function. And by the divide and conquer strategy, the corresponding coordinate can be searched with complexity .
The proposed CoDenoise algorithm adopted the simple cyclic updating pattern, sequentially sweeping each pixel. If the computed solution at new selected coordinate makes a big progress than that in last iteration (evaluated by a tolerance ), then it will be updated. In our implementation, we use a binary mask matrix to indicate whether a pixel will be updated or not. If any four neighbor of pixel is updated, is marked as , and the pixel will be updated in the next iteration, otherwise 0. For the nondifferentiability of TV norm, the solution generated by CoDenoise may get stuck at nonstationary points, which can be easily bypassed by adding small random perturbations. The perturbations decrease along with the increasing iteration number.
To stop the CoDenoise algorithm, we check whether the relative difference between two iterations is below tolerance ; that is,
The CoDenoise algorithm is summarized as Algorithm 1.
3.4. Convergence and Complexity
We first discuss the convergence of the CoDenoise algorithm and then analyze its computational complexity.
Theorem 1. For the optimization problem equation (12), one can obtain its optimal solution using ; then holds for any .
Proof. First, observe that Suppose ; then . (1)If , we have . Therefore,(2)If , then . So for some , we have . Therefore, We hence conclude that, for any , always holds.
Theorem 2. The sequence generated by the CoDenoise algorithm converges.
3.4.2. Computational Complexity
First, we present the analysis of computational complexity of the operator . The operation with the heaviest computational cost is to sort vectorwhich can be done by existing sorting algorithms, for example, max-heap sort, and thus the sorting the vectors in (11) can be done with computational complexity . Then, the optimal solution can be searched in at worst. And thus, the complexity of proximal operator at worst is .
Then, CoDenoise requires calling operator times in each iteration, where is the number of nonzero entries of mask matrix , proportional to the noise level, and thus the computational complexity of CoDenoise is .
We discuss the equivalence of (28) and (20) using the following two cases:(1)Suppose that . The optimal solution is Now we have , so In the sequence , there are elements less than or equal to and elements greater than or equal to , and from (29), in the sequence , there are elements less than or equal to and elements greater than or equal to . And thus, is the median (28). Specially, when , all the elements in the sequence are greater than or equal to , and elements in the sequence are less than or equal to , and thus is the median (28). Also when , the same conclusion can be similarly drawn.(2)Suppose that , and it lies in . Similarly, in the sequence , there are elements less than or equal to and elements greater than or equal to , and from (29), in the sequence , there are elements less than or equal to and elements greater than or equal to . And thus, is the median (28).
With the equivalence of proposed operator (20) and finding median value (28), the CoDenoise algorithm provides an interesting interpretation of the recursive weighted median operations on noisy image. By the recently great progress of studies on median filter or reweighted filter [58, 59], the computational efficiency of the proposed CoDenoise algorithm should be further improved.
4. CoD for TV-Based Image Deblurring
based image deblurring problem is formulated aswhere is convolution matrix, which is an ill-posed problem. By combining variable splitting and ALM, we employ ADMM to solve this problem.
First, by introducing an auxiliary variable , the -based image deblurring problem is reformulated as
Then the augmented Lagrangian function of (33) iswhere is a positive penalty parameter and is related to Lagrangian vector. Then, the two variables and can be updated alternatively until some convergence criterion is satisfied. Given , the -subproblem can be efficiently solved in Fourier domain,where and are Fourier and inverse Fourier transformation, respectively. Given , the -subproblemcan be directly solved by the proposed CoDenoise algorithm. Finally, the parameters and are updated. The overall algorithm is summarized as Algorithm 2.
5. Experimental Results
In this section, we report the experimental results on image denoising and deblurring to validate the proposed CoD-based methods. First, as to the image denoising, CoDenoise is compared with three Chambolle’s works, that is, fixed points (CFP) algorithm on dual variables , first-order primal dual (CPD) algorithm , and graph cut- (CGC-) based algorithm . Then, we compare ALMCD with several state-of-the-art deblurring algorithms with the denoising subproblem embedded, including accelerated IST algorithms, that is, TwIST  and FISTA , and ALM-based algorithm, that is, SALSA . All the experiments ran on a 2.40 GHz Core(TM) i7-4700MQ processor. The CoDenoise algorithm is coded in C/C++, and ALMCD is coded in Matlab. We provide Matlab wrapper of CoDenoise which can be called by ALMCD. And for the parameter settings, the updating tolerances and are both set as . We set for anisotropic TV minimization and for isotropic TV minimization.
5.1. Image Denoising
As to the competing denoising algorithms, CFP and CPD are both only designed for isotropic TV minimization, and by modifying the projection step, CFP and CPD are easily applied to anisotropic TV minimization. Since only anisotropic TV model can be mapped to binary MRF, CGC is only feasible to anisotropic TV minimization. The denoising experiments were conducted on four 512 × 512 images, that is, Couple, Man, Hill, and Boat, shown in Figure 1, which were degraded by different Gaussian noise levels, with standard deviation (std.) as 0.05, 0.10, 0.15, and 0.20. Corresponding to each noise level, the trade-off parameter was chosen as 0.04, 0.09, 0.16, and 0.23, respectively, with best PSNR values.
For the convexity of TV minimization problem, CFD and CPD are proved to converge to global optimal solution. From the PSNR and SSIM comparison of different algorithms, shown in Tables 2 and 3, CoDenoise can also converge to the same solutions with CFD and CPD for each noise level. As to the computational efficiency shown in Table 1, for anisotropic TV denoising CoDenoise is much faster than all the competing algorithms, especially for low level noises, and for isotropic TV denoising CoDenoise has to handle more extra entries, thus being little computational expensive than CFP and comparable to CPD. Figure 2 delivers the visual denoising effect of different algorithms, and CoDenoise can achieve satisfactory denoising results.
(a) Noised image (16.48, 0.508)
(b) CPD (26.61, 0.795)
(c) CFP (26.58, 0.792)
(d) CoD (26.62, 0.795)
5.2. Image Deblurring
The proposed ALMCD algorithm is compared with TwIST, FISTA, and SALSA, where CFP is adopted to solve the involved denoising subproblem. In the experiments, the test images were degraded with Gaussian kernel with std. 7 and Gaussian noise with zero mean value and std. . The trade-off parameter is set as for all the four algorithms.
Since blurry images usually suffer from severe blur and low level noise, the subproblem involved should be more efficiently solved by CoDenoise. Table 4 presents the deblurring results comparison for anisotropic TV deblurring, and one can see that ALMCD is significantly faster than all the competing algorithms. Even though CoDenoise is slower than CFP for isotropic TV minimization, ALMCD is instead more efficient than the competing algorithms, shown in Table 5. Particularly, SALSA adopted the same variable splitting strategy with ALMCD, generating the same subproblems, so the efficiency superiority of ALMCD over SALSA heavily confirms that CoD-based method contributes more to efficiency improvement of ALMCD. In terms of deblurring quality, both PSNR and SSIM for anisotropic and isotropic TV models, shown in Tables 4 and 5, achieved by ALMCD are comparable to all the other competing algorithms. Figure 3 presents the visual deblurring results of Boat, from which one can see that the ALMCD can obtain visually plausible deblurring results. As a summary, CoD-based methods can provide comparable solutions compared with competing algorithms, while CoDenoise for anisotropic TV model is much more efficient than all the competing denoising algorithms, and ALMCD with CoDenoise embedded to solve denoising subproblem is much faster than state-of-the-art deblurring algorithms for both anisotropic and isotropic TV models.
(a) Original image
(b) TwIST (28.11, 0.907)
(c) FISTA (27.60, 0.890)
(d) Blurred image (22.15, 0.601)
(e) SALSA (28.32, 0.899)
(f) ALMCD (28.67, 0.911)
In this paper, we propose a novel unified solution based on CoD method to solve TV minimization problems. With the unified formulation, both anisotropic and isotropic TV minimization can be decomposed into scalar problems that can be efficiently solved by convex optimization. With simple cyclic updating pattern and random perturbations, CoDenoise can empirically converge to the optimal solution. Also when applied in image deblurring, the CoDenoise algorithm embedded in ALMCD makes significant contributions in terms of efficiency, compared with competing deblurring algorithms. In terms of deblurring quality, ALMCD can provide comparable or superior results, validating the effectiveness of CoD-based methods. Furthermore, with the great improvements in weighted median filter or parallel implementation, the CoDenoise algorithm should be much more efficient. Also, CoD could be extended to other TV variants, for example, nonlocal TV and vectorial TV.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
J. Xu, D. Ren, L. Zhang, and D. Zhang, “Patch Group Based Bayesian Learning for Blind Image Denoising,” in Computer Vision – ACCV 2016 Workshops, vol. 10116 of Lecture Notes in Computer Science, pp. 79–95, Springer International Publishing, Berlin, Germany, 2017.View at: Publisher Site | Google Scholar
T. Zeng, Study on the variational models and dictionary learning [Ph.D. thesis], Université Paris-Nord-Paris XIII, France, 2007.
M. Zhu and T. Chan, “An efficient primal-dual hybrid gradient algorithm for total variation image restoration,” UCLA CAM Report, pp. 8–34, 2008.View at: Google Scholar