Abstract

Spatial images are inevitably mixed with different levels of noise and distortion. The contourlet transform can provide multidimensional sparse representations of images in a discrete domain. Because of its filter structure, the contourlet transform is not translation-invariant. In this paper, we use a nonsubsampled pyramid structure and a nonsubsampled directional filter to achieve multidimensional and translation-invariant image decomposition for spatial images. A nonsubsampled contourlet transform is used as the basis for an improved Bayesian nonlocal means (NLM) filter for different frequencies. The Bayesian model adds a sigma range in image a priori operations, which can be more effective in protecting image details. The NLM filter retains the image edge content and assigns greater weight to similarities for edge pixels. Experimental results both on standard images and spatial images confirm that the proposed algorithm yields significantly better performance than nonsubsampled wavelet transform, contourlet, and curvelet approaches.

1. Introduction

In spatial rendezvous and docking, spatial images are obtained by multisource remote sensors. Spatial images are inevitably mixed with different levels of noise and distortion. The accurate image feature extraction will be helpful for spatial object recognition and can directly influence the success of spatial rendezvous and docking [1, 2]. Image feature extraction of spatial images is based on the definition of image features; to some extent, it can be said that it is based on sensitivity changes to image grayscale values for the human eye. Multidimensional image representation can process images for the sparsest representation, especially for 2D image signals [3, 4]. This approach identifies optimal high-dimensional function representation for an image and yields superior image-processing results for an effective solution. A nonlocal means (NLM) filter uses redundant image information on the basis that structural similarity superimposed on pixel noise is random and noise can be effectively removed using weighted averages [5, 6]. Compared to traditional statistical filtering methods, NLM filtering overcomes the constraint of the local neighborhood and extends pixel similarity to block-based similarity, so it is very suitable to deal with spatial images.

In this paper, we use a nonsubsampled pyramid structure and a nonsubsampled directional filter to achieve multidimensional and translation-invariant image decomposition for spatial images. A nonsubsampled contourlet transform is used as the basis for an improved Bayesian nonlocal means (NLM) filter for different frequencies. The Bayesian model adds a sigma range in image a priori operations, which can be more effective in protecting image details. The NLM filter retains the image edge content and assigns greater weight to similarities for edge pixels. Experimental results both on standard images and spatial images confirm that the proposed algorithm yields significantly better performance than nonsubsampled wavelet transform, contourlet, and curvelet approaches.

The rest of this paper is organized as follows. Section 2 describes multidimensional image decomposition, with a focus on contourlet and nonsubsampled contourlet transforms (NSCTs). Section 3 outlines application of an NLM filter and proposes an improved NLM algorithm based on a Bayesian model. Section 4 applies the improved NLM filter to NSCT, especially NSDFB, to process image features for further extraction. Section 5 compares feature extraction results for the proposed algorithm and other algorithms. Section 6 concludes the paper.

2. Contourlet Transform Decomposition

2.1. Multidimensional Image Decomposition

The target of image multidimensional representation is to provide a description of image with less characteristic information. The wavelet transform is a classic image multidimensional representation algorithm that has a good effect on image edge points [7, 8]. However, the wavelet transform can capture only limited direction information in the horizontal, vertical, and diagonal directions, as shown in the left side of Figure 1. It is difficult to express image smoothness contours; a better image representation is shown in the right side of Figure 1.

Other well-known multidimensional image decomposition algorithms include bandlets, brushlets, edge multidimensional transform, complex wavelets, and wedgelet. However, these algorithms require image edge detection and then summarize a representative adaptive coefficient. A decomposition algorithm that can transform an image into fixed decomposition coefficients is desirable. These coefficients can then be used in a broader context that does rely on edge detection alone but also includes better directional image decomposition.

In 2004, CandΓ¨s and Donoho proposed a curvelet transform that uses a value approximation algorithm for a continuous 2D spatial domain and adds a smooth signal on the basis of a 1D Fourier transform [9]. The best approximation deviation is 𝑂((log𝑀)3π‘€βˆ’2) for curvelet and 𝑂(π‘€βˆ’1) for wavelet transforms. The curvelet transform is first applied to a continuous signal and then combines a multidimensional filter and ridgelet transformation. A second curvelet transform is based on frequency segments and extreme judgment. The curvelet transform is universally applicable to continuous signals, but there will be parallel noise in discrete fields [10]. It is also biased in directional image decomposition. The reason is that the typical rectangular sampling mode leads to a priori geometric deviation in decomposition of discrete image signals, especially in the horizontal and vertical directions. This limitation prompted researchers to develop a new multiscale decomposition algorithm that does not depend on edge detection and can decompose images in cross-scale multidimensions.

2.2. Contourlet Transform

The contourlet transform is a multidimensional decomposition algorithm proposed by Do and Vetterli in 2005 [11]. The transform can be directly used for multidimensional decomposition of discrete image signals. It has a dual filter for image decomposition and yields a smoother sparse representation of the original image. The two filters are a Laplacian pyramid (LP) filter [12, 13] and a directional filter bank (DFB) [14]. The LP yields nonconsecutive image points, and then the DFB connects consecutive points into a nonlinear structure. The process is shown in Figure 2.

A subsample contourlet transform uses a relevance factor 𝑀 for image subsampling at each decomposition level. A 2D filter is evolved from the 1D filter. For complete image reconstruction, the following relationship holds for the 1D filter:𝑀0(𝑧)𝑁0(𝑧)+𝑀1(𝑧)𝑁1(𝑧)=2,π‘†π‘š=𝐷2π‘™βˆ’1,2ξ€Έ,for0β‰€π‘šβ‰€2π‘™βˆ’1βˆ’1,𝐷2,2π‘™βˆ’1ξ€Έ,for2π‘™βˆ’1β‰€π‘šβ‰€2π‘™βˆ’1,(2.1) where 𝑀0(𝑧) and 𝑀1(𝑧) represent low- and high-pass analysis filters, and 𝑁0(𝑧) and 𝑁1(𝑧) represent low- and high-pass synthesis filters, respectively. Downsampling matrices π‘†π‘š are shown above. For 2D complete decomposition,𝑀0(𝑀′(𝑧))𝑁0(𝑀′(𝑧))+𝑀1(𝑀′(𝑧))𝑁1(𝑀′(𝑧))=2.(2.2)

π‘€ξ…ž(𝑧) represents discrete properties of the heterogeneous domain, which can help to reduce the filter complexity from O(𝑁2) to O(𝑁). In two dimensions, the first DFB step is construction of the spectrum in the frequency domain using two-channel quincunx filter banks to decompose an image into horizontal and vertical directions [15]. The DFB equivalent parallel family is𝐷(𝑙)π‘˜ξ‚ƒπ‘›βˆ’π‘†(𝑙)π‘šξ‚„ξ‚‡0β‰€π‘šβ‰€2𝑙,π‘šβˆˆπΏ2.(2.3)

Cutting operations on both directions for the decomposition spectrum provide 2D directional and segmental image decomposition. Like the discrete wavelet transform, the discrete downsampling contourlet transform is shift-invariant [16].

2.3. Nonsubsampled Contourlet Transform

NSCT is a fast implementation of the contourlet transform that provides a shift-invariant and multidimensional image representation [17]. Compared with subsampled contourlet transforms, NSCT is closer to the nonredundant wavelet transform [18]. NSCT uses a 2D nonsubsampled filter bank and can be expressed as𝑀0(𝑧)𝑁0(𝑧)+𝑀1(𝑧)𝑁1(𝑧)=1,(2.4) where 𝑀(𝑧) represents a 2D filter of the 𝑧 transform, 𝑀0(𝑧) and 𝑀1(𝑧) represent 2D low- and high-pass analysis filters, and 𝑁0(𝑧) and 𝑁1(𝑧) represent 2D low- and high-pass synthesis filters, respectively. There are also other limitations for the filter design.

NSCT involves two steps: multidimensional representation and directional decomposition. Multidimensional representation is achieved by nonsubsampled pyramid decomposition. This step is similar to the 1D discrete nonsubsampled wavelet transform (NSWT), which uses the Γ  trous method [19]. Compared to NSWT, NSCT uses a nonsubsampled 2D filter. The frame bound of an NSCT directional decomposition is𝑃1≀||𝑀0(π‘’πœ€)||+||𝑀1(π‘’πœ€)||ξ„Ώξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…ƒξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…€ξ…Œπ‘‘(π‘’πœ€)≀𝑃2,𝑃1=essinf𝑑(π‘’πœ€),𝑃2=esssup𝑑(π‘’πœ€),πœ€βˆˆ[βˆ’πœ‹,πœ‹]2.(2.5)

After decomposition of the first layer, the sampling filter banks provide multiscale decomposition of the underlying properties. The process for two-layer nonsubsampled pyramid decomposition is shown in Figure 3.

The frequency domain for layer 𝑗 supported by a low-pass filter is [βˆ’(πœ‹/2𝑗),βˆ’(πœ‹/2𝑗)]; the replacement domain is from [βˆ’(πœ‹/2π‘—βˆ’1),βˆ’(πœ‹/2π‘—βˆ’1)] to [βˆ’(πœ‹/2𝑗),βˆ’(πœ‹/2𝑗)], which is supported by a high-pass filter.

Each step in NSWT image decomposition involves three directions. The total image redundancy is 3𝐽+1; in NSP, the result redundancy is 𝐽+1 [20]. The second NSCT step provides directional information via the nonsubsampled filter, which combines two-channel quincunx sampling filters and a resampling operation for 2D frequency division on directional edges [21]. More accurate directional details can be sampled discretely on a sample stage. Sampling uses a quincunx matrix 𝑄 and considers image direction alignment. The process is shown in Figure 4.

3. NLM Filter Based on a Bayesian Approach

3.1. NLM Filter

Different frequency components play different roles in an image structure. Low-frequency components account for most image energy, forming basic local gradation areas, but play a small role in image content or structure. High-frequency components form the main image edges and determine its basic content or structure and are thus the most important components. Changes in high-frequency information lead to changes in the basic image content or structure, and information extracted from the image by the human eye will thus be subject to major changes. Thus, high-frequency components play the most important role in image perception by the human eye.

At present, many image filters only consider adjacent pixels; some filters take into account information for neighboring pixels, such as Yaroslavsky neighborhood filters [22] and bilateral filters [23]. A nonlinear filter involves additive white noise and can effectively handle image redundancy [24].

The NLM algorithm takes advantage of grayscale image redundancy and structural redundancy through a weighted average of pixel values to estimate the current pixel value. The value of each pixel is calculated using the Gaussian-weighted Euclidean distance between subblocks; a pixel as taken as the right center of the corresponding subblock. This ensures that pixels with a similar structure are assigned greater weight. For an original image 𝑣={𝑣(𝑖)βˆ£π‘–βˆˆπΌ}, the expression for the image processed using the NLM filter, 𝑁𝐿(𝑣), is𝑁𝐿(𝑣,𝑖)=ξ“π‘—βˆˆπΌπ‘€(𝑖,𝑗)𝑣(𝑗),(3.1) where 𝑀(𝑖,𝑗) is the Gaussian-weighted Euclidean distance between pixels 𝑖 and 𝑗, which represents the similarity of the image subblocks with 𝑖 and 𝑗 as centers:𝑀(𝑖,𝑗)=1𝑍(𝑖)expβŽ‘βŽ’βŽ£βˆ’β€–β€–π‘£(𝑁𝑖)π‘‘Γ—π‘‘βˆ’π‘£(𝑁𝑗)𝑑×𝑑‖‖2β„Ž2⎀βŽ₯⎦,𝑍(𝑖)=𝑗expβŽ‘βŽ’βŽ’βŽ£βˆ’β€–β€–π‘£ξ€·π‘π‘–ξ€Έπ‘‘Γ—π‘‘βˆ’π‘£ξ€·π‘π‘—ξ€Έπ‘‘Γ—π‘‘β€–β€–22π‘Žβ„Ž2⎀βŽ₯βŽ₯⎦,β„Ž2=1𝜎2ξ€·1+π‘Ÿ2ξ€Έπ‘ξ“π‘–π‘ξ“π‘—ξ€·π‘Ÿπ‘π‘–βˆ’π‘π‘—ξ€Έ2,(3.2) where 𝑍(𝑖) is a normalization factor, 𝑣(𝑁𝑖) is a set of subblocks with pixel 𝑖 as the center, β€–β‹…β€–2π‘Ž is a Gaussian-weighted Euclidean distance function, π‘Ž is Gaussian kernel standard deviation, and β„Ž is a filter parameter that controls the degree of smoothing. The weight 𝑀(𝑖,𝑗) meets 0≀𝑀(𝑖,𝑗)≀1, if neighborhood pixels are more similar with 𝑣(𝑁𝑖), the weight of center pixel is greater.

3.2. NLM Filter Combined with a Bayesian Method

The traditional NLM algorithm is very similar in both grayscale and structure content for smooth neighborhood areas. The algorithm yields the best results in flat areas, where better denoising effects can be obtained. At image edges and in texture-rich regions, the algorithm performs poorly because these regions have many repeat structures, the difference in grayscale content is greater, and the larger Euclidean distance makes the weights very small and reduces denoising capability, especially the ability to retain image detail [25]. To improve the edge retention capacity of the NLM algorithm, a Bayesian algorithm was added to make use of image edge information and adjust the similarity of the neighborhood structure so that the center pixel of edge contents that are similar can be given greater weight. This provides a more effective approach for protecting image detail.

The Bayesian NLM filter is expressed as𝑛(π‘₯)=βˆ‘π‘¦βˆˆΞ”(π‘₯)𝑝(𝑣(π‘₯)βˆ£π‘’(𝑦))𝑝(𝑒(𝑦))𝑒(𝑦)βˆ‘π‘¦βˆˆΞ”(π‘₯)𝑝(𝑣(π‘₯)βˆ£π‘’(𝑦))𝑝(𝑒(𝑦)),(3.3) where 𝑣(π‘₯) represents noise data, 𝑒(π‘₯) represents nonnoise image data, 𝑛(π‘₯) is the average pixel precision weight for gray value 𝑒(𝑦) at a rate of change Ξ”π‘₯, and 𝑝(𝑣(π‘₯)βˆ£π‘’(𝑦))𝑝(𝑒(𝑦)) is the similarity between 𝑣(π‘₯) and 𝑒(𝑦). Equation (5) can then be rewritten as𝑀(𝑖,𝑗)=1𝑍(𝑖)expβŽ‘βŽ’βŽ’βŽ£βˆ’β€–β€–π‘£ξ€·π‘π‘–ξ€Έπ‘‘Γ—π‘‘βˆ’π‘£ξ€·π‘π‘—ξ€Έπ‘‘Γ—π‘‘β€–β€–22π‘Žβ„Ž2βˆ’β€–β€–π‘›ξ€·π‘₯π‘–ξ€Έπ‘‘Γ—π‘‘βˆ’π‘›ξ€·π‘₯𝑗𝑑×𝑑‖‖22π‘Žβ„Žξ…žβŽ€βŽ₯βŽ₯⎦.(3.4)

3.3. Improved Bayesian NLM Filter

We propose an improved Bayesian NLM filter in which a sigma range is added to the prior image operation for more effective protection of image details [26]. The first step is analysis of the probability density for pixel levels, which takes the average variance for the improved Bayesian filter. Considering the independence and integrity of an image, its conditional probability distribution can be expressed as𝑝(𝑣(π‘₯)βˆ£π‘’(𝑦))=π‘€Γ—π‘€ξ“π‘š=1π‘ξ€·π‘£π‘š(π‘₯)βˆ£π‘’π‘š(𝑦)ξ€Έ,(3.5) where π‘£π‘š(π‘₯) and π‘’π‘š(𝑦) are the image probability densities at pixel π‘š and π‘£π‘š(𝑦) is a subset of π‘’π‘š(𝑦). For L-level image decomposition, the conditional probability density function isπ‘ξ€·π‘£π‘š(π‘₯)βˆ£π‘’π‘š(𝑦)ξ€Έ=π‘£π‘š(π‘₯)πΏβˆ’1𝐹(𝐿)ξ‚΅πΏπ‘’π‘š(𝑦)ξ‚Άexpξ‚΅βˆ’πΏπ‘£π‘š(π‘₯)π‘’π‘š(𝑦)ξ‚Ά,(3.6)𝐹(𝐿)=βˆšπΊπ‘Ž(𝐿)𝐿𝑙expξ€·π‘’π‘š(𝑦)π‘™βˆ’1βˆ’π‘£π‘š(π‘₯)π‘™βˆ’1ξ€Έ.(3.7)

Because of multiscale features, we assume that the prior probability 𝑝(𝑒(𝑦)) is continuous and uniform, 𝑝(𝑒(𝑦))=1/|Ξ”π‘₯|. The proposed algorithm uses an iterative technique, which takes the observed value 𝑣(π‘₯) as the initial value 𝑒(𝑦). This treatment can process data directly but takes longer, and details can become fuzzy. If the frequency window is too large, the result will have too much edge details and point targets become even more blurred. Experimental results confirmed that 3 Γ— 3 window is an appropriate choice. The algorithm uses an a priori estimate mean 𝑒′(𝑦) to replace 𝑒(𝑦) to reduce image noise bias and Ξ”x is replaced by 𝑁(π‘₯). The new Bayesian filter can be expressed as𝑛(π‘₯)=βˆ‘π‘¦βˆˆΞ”(π‘₯)𝑝(𝑣(π‘₯)βˆ£π‘’β€²(𝑦))𝑝(𝑒′(𝑦))𝑒′(𝑦)βˆ‘π‘¦βˆˆΞ”(π‘₯)𝑝(𝑣(π‘₯)βˆ£π‘’β€²(𝑦))𝑝(𝑒′(𝑦)).(3.8)

𝑁(π‘₯) can be expressed as 𝑁(π‘₯)=Ξ”π‘₯βˆ©π‘1(π‘₯)βˆ©π‘2(π‘₯), where 𝑁1(π‘₯) and 𝑁2(π‘₯) are a priori regional image characteristics and pixel features, respectively. The a priori regional characteristic is image region Ξ”π‘₯, and unrelated points are removed using a region similarity algorithm. The a priori pixel feature is the set obtained by comparing the similarity of adjacent pixels [27]. A priori pixel characteristics are generally always overlooked in NLM filter processes. In fact, a priori pixel characteristics are good for excluding pixel noise [28].

The sigma range between pixel π‘₯ and the a priori mean 𝑒′(π‘₯) can be defined as (𝑒′(π‘₯)𝐼1,𝑒′(π‘₯)𝐼2) and the range (𝐼1,𝐼2) meets πœ‰=∫𝐼2𝐼1𝑝(𝑠)𝑑𝑠, where 𝑝(𝑠) is the image probability density function. For different sigma values πœ‰βˆˆ{0.1,0.2,…,0.9}, the range can be calculated by pixel search [29].

It is desirable to have a greater sigma weight; however, under conditional probability, the sigma range cannot be greater than the maximum upper boundary, 𝑒′(π‘₯)𝐼2<𝑉max, where 𝑉max is the maximum image density. It has been demonstrated that a priori pixel characteristics can have a good effect on retention of image edges, but there will be some situations in which isolated pixels are ignored. To solve this problem, the proposed algorithm uses a threshold 𝑇=𝑉max/2 to separate two pixels [30]. For a priori pixels, only 𝑒′(π‘₯)<𝑇 are retained.

4. Image Feature Extraction Based on the Contourlet Transform

The proposed algorithm can improve the accuracy and completeness of image feature extraction based on direct contourlet decomposition. An image is first processed by the contourlet transform to yield a multidimensional domain, with multiple-resolution decomposition coefficients for large-scale details (low-frequency signal) and finer image details (high-frequency signal). Next, the algorithm applies deeper decomposition to the large-scale approximation. The whole process can be repeated until the algorithm yields the detail required.

Figure 5 shows the two-layer decomposition, where 𝐼 is the observed image, 𝐼′ is the image processed using the contourlet transform, LP is a Laplacian pyramid decomposition filter, DFB is a direction filter bank, LF is the low-frequency signal and HF is the high-frequency signal.

The decomposition coefficients for different frequencies are processed using the Bayesian-based NLM filter with a decomposition threshold. In particular, we use the wavelet threshold approach for the low-frequency part and the NLM approach for the high-frequency part. The specific steps in the algorithm are as follows.

Step 1. Decompose image I using the nonsubsampled contourlet transform.

Step 2. Apply the decomposition threshold method to the low-frequency part for noise suppression and feature extraction.

Step 3. Apply the improved Bayesian NLM filter to the high-frequency part for feature extraction.

Step 4. Reconstruct the high-frequency and low-frequency parts of the image processed using the contourlet transform.

For the low-frequency part, threshold decomposition is used to remove image noise. First, the threshold value 𝑇 is set. Decomposition coefficients smaller than 𝑇 are considered to be noise and thus are set to zero; coefficients greater than 𝑇 are reserved. The decomposition threshold is𝑇=πœƒπ‘›2π‘˜βˆš2log𝑁,πœƒπ‘›=𝑀(𝑑1)0.6745,𝑀(𝑑1)=𝑁𝑖=0ξ“π‘›βˆˆπ‘§2𝑑1ξ€Ί2𝑛+π‘˜π‘–ξ€»,(4.1) where π‘˜ is the number of layers for wavelet decomposition, πœƒπ‘› is a function for estimating mean absolute deviation, where 𝑑1 is the high-frequency coefficient for first-layer contourlet decomposition. The high-frequency part of the first layer usually contains few signal components and comprises mainly noise. For the high-frequency part, 𝐺[π·π‘˜(π‘₯)] and 𝐺[π·π‘˜(𝑦)] are two blocks for image I.𝐺[π·π‘˜(𝑝)] represents the rectangular neighborhood around 𝑝 as center. The proposed algorithm uses an improved Bayesian NLM image filter and a Euclidean distance to represent similarity between high-frequency image blocks. The similarity is represented by 𝑀(π‘₯,𝑦), which is defined as𝑀(π‘₯,𝑦)=1𝑍(π‘₯)π‘’βˆ’(‖𝐺[π·π‘˜(π‘₯)]βˆ’πΊ[π·π‘˜(𝑦)]β€–2/β„Ž2),𝑍(π‘₯)=π‘₯,π‘¦βˆˆπΌπ‘’βˆ’(‖𝐺[π·π‘˜(π‘₯)]βˆ’πΊ[π·π‘˜(𝑦)]β€–2/β„Ž2),(4.2) where β„Ž is a constant used to control the exponential decay rate. Compared with the original NLM filter, the contourlet transform decomposes the image at different resolutions and the proposed approach uses different algorithm to process an image: threshold decomposition is used for the low-frequency part, and an improved Bayesian NLM filter is used for the high-frequency part. The NLM filter involves time-consuming calculations. If an image is decomposed by the contourlet transform for π‘˜ levels, the improved algorithm only has to process 1/2π‘˜ of the original size. This not only reduces the computational complexity but also greatly improves the accuracy of feature extraction.

5. Experimental Results

Spatial image can directly influence the success of spatial rendezvous and docking. We must ensure that the most accurate image feature can be extracted from spatial images. The proposed algorithm can also be applied to general image processing. To verify the performance of the algorithm, we carried out experiments both on Spatial images and standard images.

5.1. Performance Evaluation Based on Spatial Images

Tests were carried out on image I and image II for an image size of 512 Γ— 512. Multidimensional contourlet and curvelet decomposition algorithms were used for comparison. The same parameters were used for all algorithms.

Image feature extraction results were evaluated according to subjective and objective standards. Figure 6 shows the processing results for image I, and Figure 7 shows the results for image II. It is evident that the proposed algorithm yields a better subjective visual effect compared with the curvelet and contourlet algorithms.

The results show that feature extraction with the curvelet transform leads to confusion for some background information, and the contourlet transform yields a blurry image. By contrast, the proposed algorithm effectively suppresses noise and displays the main features of the image. Finer image details are shown in Figure 8.

5.2. Performance Evaluation Based on Standard Images

Figures 9 and 10 show the processing results for image as III, IV. Image as III and IV are standard images for an image size of 512 Γ— 512. It is shown that our algorithm also provides better performance than that of Curvelet transform and Contourlet transform.

Results are objectively evaluated using the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM), defined as follows:PSNR=10log10(πΏβˆ’1)2βˆ‘π‘€π‘–=1βˆ‘π‘π‘—=1[𝑅(𝑖,𝑗)βˆ’πΉ(𝑖,𝑗)]2,SSIM(π‘₯,𝑦)=ξ€·2𝑒π‘₯𝑒𝑦+𝐢1ξ€Έξ€·2𝜎π‘₯𝑦+𝐢2𝑒2π‘₯+𝑒2𝑦+𝐢1ξ€Έξ€·πœŽ2π‘₯+𝜎2𝑦+𝐢2ξ€Έ,MSSIM(𝑋,π‘Œ)=1π‘Šπ‘Šξ“π‘Ÿ=1SSIMξ€·π‘‹π‘Ÿ,π‘Œπ‘Ÿξ€Έ,(5.1) where 𝑒π‘₯ and 𝑒𝑦 are the mean and 𝜎π‘₯ and πœŽπ‘¦ are the standard deviation for the original and processed images, respectively, 𝜎π‘₯𝑦 is the covariance for the original and processed images, and 𝐢1 and 𝐢2 are constants. MSSIM is mean SSIM, and π‘Š is the number of image subblocks. For greater PSNR and MSSIM (0≀MSSIM≀1), the processed image is closer to the original.

Table 1 compares the feature extraction performance for images I and II for several algorithms and different noise levels. The PSNR results show that the contourlet transform is superior to the curvelet transform for image I by almost 0.5 dB. Both the contourlet and curvelet transforms provide good edge detection. The proposed CT+NLM algorithm showed even better performance (~1.1 dB) compared with the contourlet transform but retained the good edge detection of the latter method (Table 1).The proposed algorithm uses an NLM filter for adaptive image expression. The MSSIM results show that the proposed algorithm yields the best performance for Gaussian, Poisson, Salt and Pepper and Speckle noise (Table 2).

Our algorithm uses a nonsubsampled key point filter for which 𝐽+1 redundancy is the most efficient. In pyramid decomposition, a lesser extent of image loss can be considered as an effective means to reduce redundancy. The proposed algorithm, which uses a nonsubsampled pyramid filter and a directional filter, leads to some image loss in reducing redundancy. Search windows of 16Γ—16, 32Γ—32, and 64Γ—64 were applied to images I and II. The size of the search window can affect the computational complexity of the NLM filter.

Comparison of the experimental results for different window sizes reveals that the proposed algorithm delivers better noise suppression and feature extraction than the other algorithms. It provides a maximum PSNR value and a minimum MSSIM value for all windows (Table 3).

6. Conclusions

Focusing on the actual needs of spatial images analysis, an improved contourlet transform, consisting of a nonsubsampled pyramid transform and nonsubsampled directional filter banks, was used to reduce the filter design problem of spatial images. The improved contourlet transform uses a mapping approach to solve the 2D filter design problem. The algorithm uses a Bayesian NLM filter for high-frequency information to suppress noise and improve the accuracy of image feature extraction. Experimental results confirm that the NLM filter can effectively retain structural information and reduce the residual structure. In the NSCT domain, the proposed algorithm showed better denoising and enhancement effects compared with the contourlet transform. Moreover, in comparison with NSWT, the algorithm is a more mature and sophisticated image-processing method.

Acknowledgments

This work was supported by the National Basic Research Program of China (973 Program) (2012CB821206), the National Natural Science Foundation of China (no. 91024001, no. 61070142), and the Beijing Natural Science Foundation (no. 4111002).