Abstract

It was proposed to develop a better multiscale learning dictionary picture de-noising technique. The approach improves the adaptive threshold curvilinear transform, which can divide an image into different scale information and be used to build a curvilinear multiscale learning dictionary. The method finished the dictionary and sparse coefficient updates in the picture through circular iterations and then superimposed the matching curvilinear wave domain image blocks and performed the curvilinear inverse transform to generate the denoised image. The test was carried out by adding additive Gaussian noise to a standard grayscale image, and the results revealed that the peak signal-to-noise ratio of the grayscale image de-noising result of this paper’s method was improved by 56.6% on average, and the structural similarity was improved by 0.44 on average, compared to the conventional de-noising algorithm. It was determined that the enhanced approach preserved the picture’s edge and texture information well, that image quality was greatly improved, and that the algorithm’s execution efficiency was superior to that of the conventional de-noising algorithm.

1. Introduction

To better read and recognize the legitimate information in a picture, the noise included in it must be eliminated, but the detailed information such as edges and textures of the image structure must be preserved as much as possible. Hence, image de-noising was carried out using several approaches today. For example, in the spatial domain, there were median filtering methods [1, 2], Gaussian filtering methods [3], nonlocal mean filtering methods [4], bilateral filtering methods [5], etc. In the transform domain, there were wavelet thresholding methods [6], singular value decomposition methods [7], curvilinear wave transform methods [8], BM3D algorithms [9], etc. However, the spatial domain de-noising method was limited by the principle, and it was more difficult to remove the noise in complex images. The transform domain de-noising method made empirical assumptions on the transform domain coefficients, and it was difficult for a single fixed basis function to adapt to complex image features, which can easily damage the detailed information of image features and make the image blurred. Therefore, digital images require an efficient noise removal method that can both better eliminate image noise and retain maximum image edge and texture detail information [10].

In recent years, there have been more de-noising methods in the field of digital images, and Pei et al. proposed a multiinformation combined dictionary method in 2018 for improving the problem that conventional dictionary learning methods easily lead to blurred de-noising results [11], and the de-noising results need to be improved due to the poor anti-interference ability of the algorithm. Khmag et al. in 2018 used noisy images and clustered batches of Hidden Markov Models (HMMs) to remove noise from images, but the results need to be improved because the NLM algorithm is computationally intensive and time-consuming when estimating pixels [12]. Tao et al. fused dictionary learning and a variational model in 2019, which can improve the structural similarity of the resultant images [13]. However, the de-noising effect was poor when the noise was strong. Lu et al. proposed a gradient-domain nonparametric Bayesian dictionary learning method in 2020, which was better for image detail protection [14]. However, the optimal model parameters of the algorithm were difficult to determine, making the de-noising results less than ideal. The curvilinear wave transform has directional and multiscale characteristics, which can better represent the local features of the signal; however, the determination of the appropriate threshold was more difficult, and the de-noising results need to be improved. Khmag et al. in 2022 used a generative adversarial network model and a semisoft thresholding method to achieve image noise removal, but it was susceptible to interference from external factors when carrying out noise removal due to the inability to produce diverse results when considering independent samples alone [15].

To address the abovementioned situation, this paper combined the advantages of the K-SVD algorithm and curvilinear wave transform, improved the dictionary training and construction method of the K-SVD algorithm, constructed multiscale learning dictionaries in the curvilinear wave domain, created multiscale adaptive dictionaries, suppressed the noise in different scale cases in multiple iterative updates, and had good de-noising ability while protecting effective information such as effective edges and textures of images. The proposed method can overcome the limitation of sparse image representation at a single scale and made the new dictionary the advantage of multiscale analysis, which can protect the effective information of the image, especially the small-scale details and edges, and other features will not be oversmoothed so that the noise and effective information at different scales can be identified and distinguished. The dictionary and the sparse coefficients were updated by iterative cycles to remove the noise at different scales so that the effective information of the image can be expressed adaptively and completely.

A series of simulations showed that the revised de-noising algorithm can successfully improve the de-noising effect, better protect picture features such as edges and textures, and significantly improve image quality.

2. Curve-Wave Transform and Dictionary Learning Principle

2.1. Curve-Wave Transformation

The curvilinear transform was first proposed by Candes et al. [16] and has undergone two developments. At present, the second-generation curve-wave transform is widely used, and its expression is

In formula (1), represent scale, angle, and position parameters, respectively; represents the objective function; represent the curvilinear function; and the expression in the frequency domain is

In formula (2), represents the objective function in the frequency domain, represents the frequency window, and represents the angle matrix. For the image size of M × N, the scale layers of the curvilinear transform are

The curvilinear coefficients may be split into three layers after the curvilinear wave transform: a large-scale coefficient layer, a medium-scale coefficient layer, and a fine small-scale layer. The image-effective signal was represented by a few large-scale curvilinear coefficients, whereas the noisy signal was mixed in the medium- and small-scale curvilinear coefficients [17]. As an example, following the curvilinear wave transforms, an image with a size of 256256 had its curvilinear wave coefficients separated into five scales, and their scales, orientations, and other information are given in Table 1.

The curvilinear coefficients in formula (1) were thresholded, the small-value coefficients corresponding to the noisy signal were discarded, the large-value coefficients corresponding to the valid signal were retained, and the de-noising results were obtained by reconstructing the valid signal using the curvilinear inverse transform.

2.2. K-SVD Dictionary Learning Algorithm

The K-SVD algorithm mainly included two steps: sparse coding and dictionary updating. For a given original noise-free image and noisy image , set learning dictionary , the de-noising of sparse representation of images can be expressed as an optimization problem, which can be expressed as

Here, represents the optimal sparse representation coefficient, represents the image after de-noising, and represent the balance parameter, represents the sparse representation coefficient corresponding to the image containing noise, and represents the image block matrix.

When solving formula (4), it needed to set up an initial learning dictionary . The initial dictionary was usually composed of random matrix or discrete cosine transform coefficients, where was the number of iterations. Then, the sparse coding coefficient is calculated, and the updated sparse representation coefficient formula is

The Least Angle Regression (LARS) algorithm was used to solve the abovementioned formula and obtain the sparse representation coefficient of the image. Then, the updated dictionary formula is

Formula (6) was adopted to complete dictionary updating, and then the sparse coefficient matrices and were fixed. Then, the objective function of formula (4) can be further transformed into the following:

Rewrite formula (7) into matrix form, and then the denoised image expression is

In formula (8), represents the identity matrix, and represents the image result after de-noising. Since the image’s effective signal had good sparsity while the noise did not, according to this feature, K-SVD achieved the separation of the effective signal and the noise signal by reconstructing the sparse representation signal of the image.

3. De-Noising Algorithm for Multiscale Learning Dictionary

In this paper, based on the curvilinear wave transform and K-SVD algorithm, a learning dictionary de-noising algorithm with multiscale features were proposed, which combined the advantages of the curvilinear wave transform and K-SVD algorithm to effectively improve the de-noising results and protects details such as edges and textures. The algorithm’s principle is as follows.

3.1. Curvilinear Transform and Threshold Preprocessing

The input image was subjected to a discrete curvilinear transform, and the curvilinear coefficients at different scales were thresholded and updated. The updated coefficients are

In formula (9), represents the updated curvilinear coefficient, represents the curvilinear coefficient before the update, represents the threshold function, and represent scale, angle, and position parameters, respectively. In this paper, the improved Bayes soft threshold algorithm was used to preprocess the noise coefficient of the curving coefficient, and the threshold calculation steps under various scales are as follows:(1)Estimated the noise variance at each scale (2)Estimated the variance of the curvature coefficient of the image

Then, the final threshold expression is

However, the real noise variance was difficult to estimate, and it usually took several debugs to obtain reliable threshold parameters. Therefore, a sparse inversion model is established in this paper to solve the optimal threshold.

First, a noise-containing model was established, and its expression is

Here, represents effective image information, represents noise information, represent the curvilinear function, and represents the curvilinear coefficient. When the noise information is known, the effective information of the image can be obtained by solving the following formula to achieve noise removal. The expression is as follows:

In formula (14), represents the noise variance. Since it was difficult to estimate the real noise of the actual image, a sparse inversion model was established in this paper for solving the problem, and its expression is as follows:

Here, is the regularization parameter, and the effective information after de-noising is obtained by solving the formula (15).

The key to achieving the abovementioned goal was determining the appropriate and optimal regularization parameters. In general, when the image contained weak noise, a smaller value can be selected; when there was strong noise, a larger value should be selected; and the best de-noising effect can be achieved only when the optimal value is selected. Therefore, for curvilinear coefficients at different scales, different ones should be selected to effectively remove the noise information at this scale.

For formula (15), this paper used an improved iterative approximation with a soft thresholding method to find the regularization parameter λ. First, apply a larger λ parameter to solve formula (15), obtain the initial solution for the next iteration, and then gradually reduce the regularization parameter λ and solve until the optimal regularization parameter is obtained. To achieve an adaptive threshold finding, this paper defined an intermediate variable parameter with the following formula:

According to formula (16), as the regularization parameter gradually decreased, increased and decreased, the intermediate variable parameter increased first and then decreased. The experiment showed that the corresponding threshold was the optimal threshold only when was the maximum value.

3.2. Initialization of the Online Dictionary

According to the sparse representation theory, the curvilinear coefficient matrix of each scale in formula (12) was partitioned into n pieces according to a given specification, and each small piece of data became a sample .

Before dictionary learning, an initial learning dictionary was set for each layer of curvilinear wave coefficients, then the initial dictionary at the current scale, angle, and position could be set as , under which was the number of iterations. This initial dictionary was typically set up as a random matrix.

To avoid the occurrence of small values in the sparse representation coefficient matrix, the binary norm of the initial dictionary column vector is usually constrained, and the constraint formula is

Here, represents the convex set of the lexicographical matrix, represents the real matrix, represents the column vectors of the lexicographical matrix, and represents the transposition of the column vectors of the lexicographical matrix.

3.3. Online Dictionary Learning and Updating
3.3.1. Cyclic Sparse Coding

According to formula (5), LARS was used to calculate the sparse coding coefficient:

Here, represents the sparse coefficient calculated under the current number of cycles, represents the curving coefficient matrix under the current number of cycles, represents the sparse coefficient of the curving coefficient of the original noisy image, and represents the set number of cycles.

3.3.2. Intermediate Update

is the intermediate variable, and . The purpose was to facilitate dictionary updates, and the initial quantity was usually set to zero matrices. During each iteration, the data block matrix of the current cycle number contained several valid signals , then, after the iterative update, is

Defined intermediate variables , and . The initial quantity was usually set to zero matrices. In each iteration process, the data block matrix of the current cycle number contains effective signals, and then the iteration update quantity is

3.3.3. Dictionary Update

The formulas (20) and (21) were used to complete dictionary updating, and the dictionary of the current iteration number is

Solved formula (22) and differentiate both sides concerning , then, formula (22) became

Here, was the identity matrix and the was the symmetric matrix, then formula (23) can be further deduced as

Taking the column vectors of the dictionary as the target, took the column derivatives one by one, then, formula (24) became

Combining formula (25) with the Newton iteration method, the updated value of the m column of the dictionary can be obtained as follows:

Normalizing the above-updated values, the dictionary value of the final m column after updating is

The loop ended with the final dictionary of the current scale, angle, and position .

3.4. De-Noising

Finally, the sparse coefficient was multiplied by the final dictionary , and the meander coefficient after de-noising is

3.5. Inverse Transformation of Meander

If you superimpose the curve-wave coefficients and perform the inverse curve-wave transformation, then the image result after final processing can be obtained, and the inverse transformation formula is

The algorithm flow is shown in the following Figure 1.

The corresponding pseudocode of the algorithm is shown in Figure 2.

4. Algorithm Testing and Application

To verify the algorithm presented in this paper, a standard grayscale image of 512 × 512 containing additive noise was used for testing. The test machine was a Windows system. The code was compiled and developed using the open-source library Python. To prevent the threshold value from changing too quickly and missing the extreme value of the intermediate variables, the exponential method was used to reduce the threshold value during the continuation of the iteration, and then the iteration was continued. Through the test, it can be seen that the number of cycles was usually around the number of layers of the scale of the curvilinear transform to search for the optimal threshold, and usually, the maximum number of cycles can be set to two times the number of layers of the scale of the curvilinear transform. Therefore, the operation efficiency was mainly influenced by the multiscale layers.

The original noise-free and noise-containing images are shown in Figure 3, and then tested using the K-SVD algorithm, the bilateral filtering method, the nonlocal mean filtering method, the curvilinear wave transform algorithm, the BM3D algorithm, the literature [13] method, the literature [14] method, the literature [15] method, and the method in this paper. Using the noisy image shown in Figure 2 as the input image, the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) values of the de-noising results of different methods were counted, and the results are shown in Table 2.

It can be seen from Table 2that in 90 groups of test results, the average improvement of PSNR and SSIM of K-SVD was 39.7% and 0.36, respectively. The average PSNR and SSIM of bilateral filtering were improved by 45.5% and 0.38, respectively. The average PSNR and SSIM of nonlocal mean filtering are improved by 49% and 0.394, respectively. The average PSNR and SSIM of the curve transformation results were increased by 50.1% and 0.402, respectively. BM3D results showed an average increase of 56.62% in PSNR and 0.442 in SSIM. The average improvement of PSNR and SSIM in the literature [13] was 52% and 0.406, respectively. The average PSNR and SSIM of the literature [14] improved by 53.7% and 0.418, respectively. The average improvement of PSNR and SSIM in the literature [15] was 54.8% and 0.424, respectively. The PSNR and SSIM of this paper method were improved by 56.6% and 0.436 on average. Among the above results, the results with the highest PSNR and SSIM were generated by the BM3D method, and the results in this paper were the closest to those of the BM3D method.

To better evaluate the quality of the images, a completely nonreference algorithm (Natural image quality evaluator, NIQE) can be used to evaluate the de-noising results, and the expert evaluation values (percentage system) were counted together, then the evaluation values of the nine methods average are shown in Table 3. It can be seen from Table 3 that the quality of the de-noised image obtained using the de-noising algorithm proposed in this paper was the same as that of the image obtained using the BM3D de-noising algorithm, which can retain the details and texture features of the image relatively well.

Then, the noisy images shown in Figure 3(d) are denoised using different methods, and the results of the nine treatments are shown in Figure 4. For a detailed analysis of the detail retention, the images of the boxed area in Figure 4 are displayed enlarged, and the enlargement results were located at the top left position of the respective image.

To further verify the effectiveness of the algorithm proposed in this paper, a standard data set was used for testing, and some of the results are shown in Figures 5 and 6. Figure 5(a) shows the original noise-free image, and Figure 5(b) shows the noisy image. Figure 6 shows the effect of different methods of de-noising.

From Figures 4 and 6, it can be seen that the image edge information was blurred and texture details were damaged in the de-noising result of the K-SVD method. In the bilateral filtering result, the phenomenon of “ringing” appeared. In the nonlocal mean de-noising result, the degree of de-noising was good, but the edges were blurred. The noise removal effect of the curvilinear transform was slightly better than that of the K-SVD results, bilateral filtering results, and nonlocal mean results, but the local details of the noise removal results were still blurred and some “line” noise existed. The details were shown in Figures 4(d) and 6(d). The results of the literature [13], literature [14], and literature [15] showed that the image quality was significantly better compared to the results of the abovementioned four methods, but the quality of the local details of the images was still blurry or “ringing” compared to the results of the BM3D method and this paper. The processing results of the method in this paper and the BM3D results were the best. The edge/texture details in the results of both methods were better maintained, and the overall image was closest to the original noise-free image, as shown in Figures 4(e)/6(e) and Figures 4(i)/6(i).

In the following, the time spent on different images using different processing methods was counted, and the average time was obtained using 30 experiments, respectively, as shown in Figure 7. It can be seen from Figure 7 that the running time of the proposed method in this paper was shorter and the efficiency of the algorithm was higher compared to other de-noising methods. From Figure 7, it can be seen that the execution efficiency of the de-noising algorithm proposed in this paper was better than several other de-noising algorithms.

5. Summary

This paper proposes an improved multiscale learning dictionary de-noising algorithm by combining the adaptive nature of the K-SVD method with the advantages of multiscale and directionality of the curvilinear wave transform, and standard gray-scale image testing confirmed that the method in this paper had good de-noising ability and detail preservation effects such as edge and texture. The results were better than the conventional algorithm, the de-noising results were similar to the BM3D results, and the de-noising ability was about the same as the BM3D technique. However, in terms of computing efficiency, it was much superior than the BM3D technique, which had a certain extension impact on picture processing. In this research, the original picture was separated into three scales for dictionary generation, and the dictionary and sparse coefficients were updated using thresholding. In fact, the scale was still restricted for more complex pictures, and the following work will need to work on various scales with adaptive thresholds under varied scale coefficients.

Data Availability

All the data used to support the findings of the study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This study was funded by the Ankang University Education Teaching Reform Research Project in 2022 (No. JG202225) and the Shaanxi Provincial Department of Education Special Project (No. 21JK0007).