Abstract

Single-image super-resolution (SISR) is a resolution enhancement technique and is known as an ill-posed problem. Motivated by the idea of pan-sharping, we propose a novel variational model for SISR. The structure tensor of the input low-resolution image is exploited to obtain the gradient of an imaginary panchromatic image. Then, by constraining the gradient consistency, the image edges and details can be better recovered during the procedure of restoration of high-resolution images. Besides, we resort to the nonlocal sparse and low-rank regularization of image patches to further improve the super-resolution performance. The proposed variational model is efficiently solved by ADMM-based algorithm. We do extensive experiments in natural images and remote sensing images with different magnifying factors and compare our method with three classical super-resolution methods. The subjective visual impression and quantitative evaluation indexes both show that our method can obtain higher-quality results.

1. Introduction

Image super-resolution (SR) is one of the most fundamental problems in the field of image processing, which aims to reconstruct clear and accurate high-resolution images from degraded low-resolution images. In other words, SR technology can recover a high-resolution (HR) image from one or more low-resolution (LR) input images [1]. In recent years, image SR has attracted great attention from academia and industry. The quality of the reconstructed SR image also greatly affects the accuracy of other computer vision tasks (such as image classification, image segmentation, and target detection). At present, SR technology has made great progress; it has been widely used in high-definition digital television, remote sensing monitoring, medical image reconstruction, image restoration, military reconnaissance, and other fields [2]. However, as an ill-posed problem, the existing image SR technology cannot always obtain satisfactory reconstruction results. Therefore, the study of SR is still a challenging but significant research topic.

According to the number of input LR images, the SR can be classified into multi-image super-resolution (MISR) [35] and single-image super-resolution (SISR) [68]. MISR uses the complementary information provided by multiple images of the same scene to enhance resolution. However, in a practical application, it is much difficult to obtain image sequences of the same scene and achieve precise subpixel registration. Meanwhile, SISR is suitable for more scenarios, which enhances the resolution of one input image based on some prior information [9, 10]. To date, SISR methods can be broadly classified into interpolation-based methods, reconstruction-based methods, and learning-based methods [11].

Interpolation methods like nearest-neighbor interpolation, bilinear interpolation, and bicubic Interpolation [12, 13] estimate the intensity at a point using the information of adjacent pixels. While these methods are easier calculated and very speedy, they tend to generate excessive smoothing and jagged artifacts. Reconstruction-based SISR methods [1416] generally utilize image priors to restrict the possible solution space with an advantage of recovering sharp details. However, these methods are usually time-consuming, and their performance suffers a rapid degradation when the amplification factor increases.

Learning-based SISR methods include example-based methods, neighbor-embedding-based methods, sparse-representation-based methods, and deep-learning-based methods [17, 18]. The basic idea of these approaches is to obtain the correspondence between the LR image and the HR image by a priori knowledge training and then reconstruct an HR image from the input LR image using the correspondence that has been learned. Freeman et al. [19] presented an example-based method with a Markov network. Chang et al. [20] proposed a neighbor-embedding-based method that needs less training set. Yang et al. [21] proposed a sparse-representation-based algorithm to obtain more favorable results and further improve their work by jointly training coupled dictionaries for LR and HR image patch pairs [22]. Very recently, deep-learning-based SISR methods, using convolutional network [23, 24], residual network [25, 26], generative adversarial network [27], attention network [28, 29], and so on, have become popular and demonstrated good performance with the rapid development of deep learning technology.

Pan-sharpening [30] is a multispectral image fusion and super-resolution technique; the panchromatic (PAN) image has higher spatial resolution than the corresponding multispectral image but only has a single band. Pan-sharpening fuses PAN image and multispectral image together to enhance both spatial and spectral resolutions of the data. Motivated by the idea of pan-sharpening, we present a novel variational approach for single-image super-resolution. In this paper, we assume that there is a PAN image of the input LR image. By utilizing the structure tensor [31] of input image, we can construct the gradient of the PAN image; then we build a variational model combined with low-rank and sparse representation of similar patches to effectively fuse the constructed PAN information with LR image and obtain HR image. The proposed approach addresses the super-resolution problem of both single nature image and the multispectral data without panchromatic image.

The remainder of this paper is organized as follows. In Section 2, we present the proposed SISR model in detail. The numerical algorithm for solving our model is given in Section 3. The experimental results and analysis are discussed in Section 4. Finally, conclusion is presented in Section 5. In addition, the descriptions of the acronyms used in this paper are listed in Table 1.

2. Variational Model

Drawing on the idea of pan-sharpening, we construct a variational fusion model to realize single-image super-resolution. As panchromatic image contains the edges and details needed for resolution enhancement, we first construct the information of PAN image which does not actually exist. According to [31], the key information of the assumed PAN image can be exploited from the structure tensor of the input image. Then the gradient of PAN image can be fused into the LR image to enhance the HR details. The similar image patch pairs extracted from LR and HR images are constrained by low-rank and sparse regularization to improve the SR performance.

2.1. Image Degradation Model

We consider that the observed low-resolution image is a downsampled version of the high-resolution image , and the degradation model [32] can be written aswhere represents a downsampling operator and is the additive Gaussian white noise.

2.2. Construct the Gradient of PAN Image

We construct the gradient of the PAN image corresponding to input LR image using the knowledge of structure tensor [31]. Firstly, we apply quadratic interpolation to the given LR image to obtain the enlarged image in desired scale. Then, matrix known as the structure tensor [31] can be expressed aswhere denotes the number of spectral bands; denotes the weight and it can be determined by  = .

As a symmetric matrix, can be decomposed aswhere is an orthogonal matrix and the columns of which are the eigenvectors of ; is a diagonal matrix and the diagonal elements of which are the eigenvalues of . Equation (3) can also be written aswhere is the maximum eigenvalue and it gives the maximum rate of change of ; is the minimum eigenvalue and it gives the minimum rate of change. The corresponding eigenvectors and give the directions of change.

The PAN image can theoretically capture the basic geometry of the image . Consider image with  =  and  = 0, whose structure tensor should approximately be equal to . Thus, we have the following equation:

In order to solve , equation (5) can be rewritten as follows:

To specify the sign of the eigenvectors, the target gradient of the constructed PAN image is thus obtained as

2.3. Variational Model Based on Gradient Consistency

In this subsection, we build a variational model to fuse the low-resolution image and the gradient of its corresponding PAN image ; then the model is optimized together with the nonlocal sparse and low-rank regularization to obtain the high-resolution image .

2.3.1. Consistency of Gradient

Bands of an ideal HR image often have similar structural information with the PAN image ; namely, bands of HR image closely approximate the PAN image in gradient information. Therefore, we can use constraint about the consistency of gradient to establish a relationship model. We use to denote the -th band of image and then set up the gradient consistency regularization term as follows:where denotes the weight of the -th band in , and we generally set in the absence of special requirements.

2.3.2. Sparse and Low-Rank Decomposition

Motivated by the effective work of optical flow estimation with nonlocal sparse and low-rank regularization in [33], we adopt a joint sparse and low-rank decomposition term formulated as follows:where denotes the operator that extracts the -th patch of image and groups its nonlocal similar patches together. Then the grouped similar patches are decomposed to the low-rank component and the sparse component , and denotes the Gaussian white noise.

2.3.3. Total Energy

Combining the equality constraint in (1), (8), and (9), the proposed variational SISR model can be formulated as the following optimization objective:where is the number of image patches that are extracted from the -th band of image , is the number of spectral bands, are parameters, denotes the rank of low-rank component, and norm denotes the sparse regularization. It is well known that the rank-solution and norm are discrete combinatorial optimization problems; they are both NP-hard. Therefore, in order to make the minimization tractable, we resort to their convex relaxed modification, and model (10) can be reformulated as the following convex optimization problem:where denotes the nuclear norm, which is the convex envelope (tightest convex surrogate) of rank, and norm is the nearest convex norm to norm.

3. Numerical Algorithm

In this section, the numerical procedure of the proposed model will be implemented. We adopt the alternating direction method of multipliers (ADMM) [34] to solve the minimization problem (11). The flexibility of the ADMM lies in the fact that it splits the initial optimization problem into several subproblems.

3.1. Solution of the Sparse Matrix

For each band, we fix and the low-rank matrix ; the optimization problem of sparse matrix is as follows:

We can obtain the solution of equation (12) by soft threshold algorithm aswhere .

3.2. Solution of the Low-Rank Matrix

For each band, we fix and the sparse matrix ; the optimization problem of low-rank matrix is as follows:

We can obtain the solution of equation (14) by singular value threshold algorithm aswhere , are singular values of matrix , and are left and right singular vectors of , and .

3.3. Solution of the SR Image

In order to separately solve the fidelity term and two regularization terms in (11), we introduce two intermediate variables and which are close to . Thus, we need to iteratively solve the three following subproblems to obtain the solution of SR image.(1)We use the intermediate variable to replace in . The optimization problem for is as follows:where denotes the inner product, denotes the Lagrange multiplier matrix, and is a positive penalty parameter. We set the derivative of (16) with respect to to 0:which yieldsThe Lagrange multiplier is updated as follows:(2)We introduce an intermediate variable to replace in the gradient consistency term. The optimization problem for is as follows:where denotes the Lagrange multiplier matrix. We set the derivative of (20) with respect to to 0:Using the fast Fourier transform (FFT), we can obtain the closed-form solution of from (21) directly:where and denote the FFT and inverse FFT, respectively. The Lagrange multiplier is updated as follows:(3)After and are substituted into equation (11), the optimization subproblem of becomesWe set the derivative of (24) with respect to to 0, which yields

Overall, taking all above analyses into account, we can summarize the complete numerical procedure for the proposed method. The detailed descriptions are shown in Algorithm 1.

(i)Input:
 The LR image ;
 Magnification factor.
Output:
 The SR image .
Initialization:
 Compute an initial SR image via bicubic
 interpolation;
 Calculate the target gradient via equation (7).
For:
For:
  1. For each extracted image patch , search
   for a set of similar patches across the whole
   image by patch matching.
  2. Decompose the grouped patches into low-rank
   and sparse components.
   (a) Calculate the sparse matrix via (13);
   (b) Calculate the low-rank matrix via (15).
  3. Reconstruct the SR image .
   (a) Update the image via (25);
   (b) Update the intermediate variable and via (18) and (22);
   (c) Update the Lagrange multiplier and via (19) and (23);
   (d)  =  ;
End;
End;
=

4. Experimental Results and Analysis

In this section, we first evaluate the sensitivity of the proposed algorithm with respect to the main parameters, that is, , , , and . Then, to demonstrate the SR effectiveness on both natural images and remote sensing images with different magnifying factors (2 and 4), we compare our method with the Bicubic interpolation method, sparse-coding-based method (SC) [22], and its back-projection enhanced version (SCBP) [35]. We evaluate the outcome of various methods by using quantitative indexes: peak signal-to-noise ratio (PSNR), root mean square error (RMSE), and structure similarity index (SSIM) [36]; they are calculated on RGB channels for nature image and on each band for multispectral image. The higher PSNR and SSIM values and lower RMSE value represent the better SR performance.

4.1. Adjustment of Parameters

The parameter is the weight to the consistency of gradient, controls the low-rank and sparse regularization of the similar patches, and and control the contribution of low-rank and sparse regularization terms. We adjust the parameter values in proper ranges and demonstrate the average quantitative metrics as functions of the parameters on the experimental dataset; then the optimal parameter combination is found with extensive numerical tests. Figure 1 shows the average PSNR, RMSE, and SSIM results when the parameter varies from 0 to 0.5 with fixed  = 1.6,  = 0.2, and . From Figure 1, we can see that when , all quantitative metrics get the worst values, which illustrate the effectiveness of the gradient consistency constraint. Meanwhile, Figure 1 shows that our method achieves the best performance when . Figure 2 shows the average PSNR, RMSE, and SSIM results under various combinations of and with fixed and . The analysis of and shows that the trend of the overall performance gets better and then worse. When the maximal PSNR is selected, we have  = 1.8 and  = 0.3; when the maximal SSIM is selected, we have  = 1.4 and  = 0.1. In order to synthesize the performance of our algorithm in these two indexes, we normalize PSNR and SSIM by introducing E = 0.8PSNR + 0.2SSIM, and E gets its maximum when  = 1.6 and  = 0.2. Besides, we find that the performance of the proposed method is insensitive to the parameter , and we simply set .

4.2. SR Results and Analysis
4.2.1. Natural Images Super-Resolution

To evaluate the performance of the proposed super-resolution algorithm, we use bicubic interpolation and SC and SCBP algorithms for comparison. Experimental results of the images “Flower,” “Leaf,” and “Lena” with magnification factor of 2 are shown in Table 2 and Figure 3, while the results with magnification factor of 4 are shown in Table 3 and Figure 4.

As we can see from the visual comparison, our method could obtain desired results; it effectively reduces zigzag and distortion of the recovered SR images so that it better preserves the image details compared to other competitors. From the close-ups of the regions in red boxes, we can see clearly that the SR images obtained by bicubic interpolation and SC methods are more blurred than ours, and SCBP method could restore clear edges but generate some obvious artifacts in SR images. In summary, our algorithm is superior to the contrast methods in terms of reconstructing clearer SR images with sharper edges and without artifacts. The above observations can be quantitatively confirmed by the Tables 2 and 3 which record the PSNR, RMSE, and SSIM values of the SR results of the three images with magnification factors of 2 and 4, respectively. The proposed method obtains the best metrics values compared with the other three methods.

4.2.2. Multispectral Images Super-Resolution

The adopted multispectral images contain four bands (red, blue, green, and infrared bands), and our algorithm is executed at each band. Same as the nature image experiments, we compared the performance of our method with bicubic interpolation, SC, and SCBP. Experimental results for the multispectral datasets “Field,” “Tree,” and “Beach” with magnification factor of 2 are shown in Figure 5 and Table 4, while the results for magnification factor of 4 are shown in Table 5.

The observation of visual comparison is close to that of natural image experiment. Bicubic interpolation and SC algorithms tend to reconstruct overly smooth images that lack detailed information and have fuzzy image quality, while the SR images obtained by SCBP method are overenhanced and have obvious artifacts. Comparatively speaking, our algorithm can better restore the detailed information of images and make a better texture effect. By comparing the numerical results in Tables 4 and 5, our algorithm also gets the best results for all indicators. It is worth noting that, because of the lack of training set, the results of SCBP algorithm are worse than bicubic interpolation. In conclusion, our proposed algorithm is superior to other algorithms compared in the subjective visual impression and objective indexes.

5. Conclusion

This paper presented an efficient variational method for single-image super-resolution. Motivated by image fusion and pan-sharpening, we exploit the gradient of imaginary PAN image from the input image via structure tensor and then construct a gradient consistency constraint which is supposed to fuse HR edges and details into the optimization target image. Meanwhile, we adopt the regularization of sparse and low-rank decomposition for similar image patches to further improve the SR accuracy. The variables of our variational model are iteratively optimized by ADMM-based algorithm. Extensive experiment results demonstrate the edge recovery and artifacts suppression capabilities of our method.

Data Availability

The datasets used to support the findings of this study are public available and included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

Yingying Xu was supported by the School of Electronics and Information Engineering, Taizhou University, Zhejiang, China. This research was founded by Zhejiang Provincial Natural Science Foundation of China under Grant no. LQ21F020001, Research Project of Education Department of Zhejiang Province under Grant no. Y201840245, and Agricultural Science and Technology Project of Taizhou under Grant no. 20ny13.