Abstract

We propose a novel automatic side-scan sonar image enhancement algorithm based on curvelet transform. The proposed algorithm uses the curvelet transform to construct a multichannel enhancement structure based on human visual system (HVS) and adopts a new adaptive nonlinear mapping scheme to modify the curvelet transform coefficients in each channel independently and automatically. Firstly, the noisy and low-contrast sonar image is decomposed into a low frequency channel and a series of high frequency channels by using curvelet transform. Secondly, a new nonlinear mapping scheme, which coincides with the logarithmic nonlinear enhancement characteristic of the HVS perception, is designed without any parameter tuning to adjust the curvelet transform coefficients in each channel. Finally, the enhanced image can be reconstructed with the modified coefficients via inverse curvelet transform. The enhancement is achieved by amplifying subtle features, improving contrast, and eliminating noise simultaneously. Experiment results show that the proposed algorithm produces better enhanced results than state-of-the-art algorithms.

1. Introduction

Acoustic remote sensing technologies, such as high-resolution multibeam and side-scan sonars imaging in water, are widely used in marine geology, commercial fishing, offshore oil prospecting and drilling, and so forth [14]. Due to transmission loss and acoustic wave scattering, sonar images are notorious for low contrast, edge-blurring, and being full of noise. Therefore it is necessary to amplify faint edges and eliminate noise in sonar images simultaneously for further image processing, such as image segmentation and object detection and classification.

Image enhancement approaches can generally be divided into two categories: spatial domain methods and transform domain methods. Spatial domain enhancement methods deal with the image pixels. Desired enhancement can be achieved by manipulating the pixel values. Commonly-used spatial techniques are linear stretch, histogram equalization (HE) [5], convolution mask enhancement, adaptive histogram equalization, and so forth. The conventional histogram equalization has received considerable attention due to its simple and straightforward implementation, but it often amplifies noise, blurs subtle edges, and tends to over-enhance the image contrast if there are high peaks in the histogram [6]. These spatial domain methods usually cannot effectively discriminate edges from noise, because edges and noise have similar properties in spatial domain.

One way to solve this problem is to use multiscale geometric analysis (MGA) to decompose the image into different frequency bands and process the image in each band independently. It belongs to transform domain methods, the second category. Multiscale wavelet-based image enhancement algorithms have achieved promising results over the last decades [7, 8]. However, two-dimensional (2D) wavelet transform commonly used is a separable extension of 1D wavelet transform, which does not work very well in capturing the image’s geometric edges because of its isotropy. To overcome the limitation of the wavelet transform, other multiscale analyses have been developed during the past decade, including curvelet transform [9] and nonsubsampled contourlet transform (NSCT) [10]. These approaches capture edges better than the wavelet transform owing to their high directional sensitivity and anisotropy. The curvelet transform therefore has been widely applied in the image processing field [1115]. A contrast enhancement method based on curvelet transform has been developed, which uses a gain function with four parameters to modify the curvelet transform coefficients [11]. However, it requires appropriate manual parameter settings for different images that might otherwise result in image degradations. Lu et al. [16] proposed a piecewise function based enhancement method in curvelet transform domain (PFBE) to enhance the sonar image’s contrast. This method reduces the complexity of parameter adjustment by using an improved gain function with only one parameter, but it still requires parameter selection, which is manually set according to the input sonar images. In order to avoid manual parameter tuning, an automatic image enhancement method based on NSCT (AIE-NSCT) is proposed, which adjusts the NSCT coefficients by using a nonlinear mapping function [17]. This state-of-the-art image enhancement method has achieved good results in both grayscale and colour images. When processing the sonar image which has very low signal-to-noise ratio and strong noise, AIE-NSCT cannot sufficiently adjust contrast and eliminate noise. Furthermore, owing to the high redundancy of NSCT, NSCT-based methods are more time-consuming than curvelet-based methods.

Curvelet transform is better in representing edges and removing noise than classical wavelet transform for its anisotropy and multidirectional decomposition capabilities, and it is also faster than many other multiscale geometric transforms for its less redundancy. Moreover, curvelet transform well coincides with the sparse coding mechanism and the multichannel processing mechanism of the human visual system (HVS), which is composed of a series of parallel channels with each channel corresponding to a specific range of image spatial frequencies. Therefore, we propose an automatic side-scan sonar image enhancement method based on curvelet transform in this paper. The proposed algorithm utilizes the curvelet transform to model a multichannel enhancement structure based on the HVS and adopts a new adaptive nonlinear mapping scheme to modify the curvelet transform coefficients in each channel independently and automatically. Experiment results show that the proposed method can effectively enhance the contrast while eliminating noise and preserving edges in side-scan sonar images. The proposed method outperforms the state-of-the-art enhancement techniques in both qualitative and quantitative assessments.

The remainder of this paper is organized as follows. Section 2 describes the curvelet transform. Section 3 presents the curvelet-based multichannel enhancement structure, which is inspired by the multichannel processing mechanism of the HVS. Also in this section, an adaptive nonlinear mapping integrating noise removal with feature enhancement is proposed for each independent channel. The experimental results and performance evaluation are given in Section 4. The conclusions are drawn finally in Section 5.

2. Curvelet Transform

Curvelets were first introduced by Candès and Donoho in 1999 [9], which broke an inherent limit of wavelet in representing the geometry of image edges. The first-generation curvelet transform based on multiscale ridgelets combines with a spatial bandpass filtering operation to isolate different scales. However, this transform is very complicated and redundant, including many steps, such as subband decomposition, smooth partitioning, renormalization, and ridgelet analysis [15]. Later, a considerably simpler second-generation curvelet transform based on frequency partition technique was proposed by the same authors [1820]. The frequency domain of image is separated into disjoint wedge regions. Then, the local Fourier transform is implemented on these regions.

Assume that we work throughout in two dimensions, that is, , and set as spatial variable, as a frequency domain variable, and as polar coordinates in the frequency domain [21]. Let and be a pair of nonnegative, real-valued, and smooth window functions, called “radial window” and “angular window,” respectively. These windows will always satisfy the admissibility conditions:For each scale , we introduce the frequency window defined in the Fourier domain bywhere is the integer part of .

Define a “mother” curvelet as , and its Fourier transform . Then all curvelets at scale are obtained by rotations and translations of .

Introduce the equispaced sequence of rotation angles , with , , and the sequence of translation parameters . Then define curvelets at scale , orientation , and position bywhere is the rotation by radians. A curvelet coefficient is then given byAccording to Plancherel’s theorem, the curvelet transform can be expressed as the integral over the frequency plane Figure 1 summarizes the key components of the construction in second-generation continuous-time curvelet transform. The figure on the left represents the induced tiling of the frequency plane. The figure on the right represents the spatial Cartesian grid associated with a given scale and direction [21].

In practical implementation, we define Cartesian window bywhere is defined as the product of lowpass one-dimensional windows:Introduce the set of equispaced slopes: , , and definewhere is the shear matrix,The family implies a concentric tiling whose geometry is pictured in Figure 2.

Thus, the discrete curvelet transform is defined as where takes on the discrete values .

Two versions of fast discrete curvelet transform (FDCT), namely, FDCT via Unequispaced FFTs (USFFT) and FDCT via Wrapping, were developed [21]. They were simpler, faster, and less redundant than the first-generation curvelet transform. In this paper, the Wrapping-based version, which is faster than USFFT-based version, is chosen to implement the digital curvelet transform.

3. Automatic Enhancement and Denoising Algorithm

3.1. Multichannel Enhancement Structure Based on HVS

It is well known that for the HVS, the receptive fields of simple cells in primary visual cortex can be characterized as being spatially localized, oriented, and bandpass [22]. The HVS captures the essential information of a natural scene using a least number of visual active cells, which is called sparse coding on a natural scene. This result suggests that for an efficient computational image representation, it should be based on a local, directional, and multiresolution expression. In addition, the HVS is a multichannel processing mechanism. It is composed of a series of parallel channels and each channel runs independently of the visual cortex. There is an agreement that each channel is sensitive to a specific range of image spatial frequencies. Therefore, these channels usually can be divided into a lowpass channel and several bandpass channels.

As a multiscale multidirectional transform, the curvelet transform allows an almost optimal nonadaptive sparse representation of objects with edges. Because the curvelet transform exactly coincides with the mechanism of human visual perception, we can model the multichannel enhancement structure using the curvelet transform. The low frequency subband of the curvelet transform corresponds to the lowpass channel of the HVS, and each high frequency directional subband corresponds to each bandpass channel. Figure 3 shows a multiscale decomposition of a side-scan sonar image using the curvelet transform, from which we can see the outstanding capability of multiscale edges representation of the curvelet transform.

3.2. Adaptive Nonlinear Mapping Scheme

Because sonar images are full of strong noise, the critical problem for sonar image enhancement is to effectively remove noise while adaptively adjusting dynamic range and amplifying weak edges. After curvelet decomposition on a sonar image, the low frequency subband, which is almost noiseless, contains overall contrast information. While the high frequency subband in each scale and direction contains not only edges but also noise. Edges are geometric structures, while noise is not, so we can use the curvelet transform to distinguish edges from noise. Consequently, the low frequency subband, which corresponds to the lowpass channel of the HVS, needs to be stretched appropriately. However each high frequency subband, which corresponds to each bandpass channel of the HVS, needs to be sufficiently enhanced with denoising. According to the fundamental requirements proposed by Laine et al. in [23], an ideal design of a nonlinear mapping function should satisfy the following rules:(1)low contrast area should be enhanced more than high contrast area;(2)sharp edges should not be blurred;(3)the nonlinear function should be monotonically increasing, in order to maintain the location of local extrema and avoid generating new extrema;(4)the nonlinear function should be antisymmetric, for example, , in order to preserve the polarity of phases and avoid the phenomenon of ring.

To achieve the adaptive multichannel enhancement, we propose a nonlinear mapping scheme to modify the curvelet transform coefficients in each channel independently.

3.2.1. Adaptive Nonlinear Mapping of Low Frequency Channel

For the low frequency subband (), the nonlinear mapping function is defined as wherewhere is an original curvelet transform coefficient at location in the subband indexed by scale and direction . is the curvelet transform coefficient processed. For the low frequency subband, . Considerwhere denotes the maximum absolute coefficient amplitude in the subband indexed by scale and direction . One haswhere denotes the mean value of absolute coefficient amplitude in the subband indexed by scale and direction . Considerwhere denotes the standard deviation of absolute coefficient amplitude in the subband indexed by scale and direction .

Subjective brightness (intensity as perceived by the HVS) is a nonlinear logarithmic function of the light intensity incident on the eye, as shown in Figure 4 [5]. Figure 5(a) shows the nonlinear mapping curve representing the enhanced curvelet transform coefficients versus the original coefficients in the low frequency subband. From Figure 5(a), it is observed that the proposed nonlinear mapping function is an approximate logarithmic mapping, which is consistent with the characteristics of the HVS perception. Therefore, the dynamic range in the low frequency subband is well stretched by the proposed mapping function as shown in Figure 5(a).

3.2.2. Adaptive Nonlinear Mapping of High Frequency Channel

For the high frequency subband in each scale and direction , the nonlinear mapping function is defined as where and are given by (12) and (14), respectively, in Section 3.2.1. For each high frequency subband, . is a hard threshold in the corresponding high frequency subband for adaptive denoising. In this paper, the threshold for each high frequency subband is given by which has been termed as -sigma thresholding [24]. We set for the finest scale and for the others. The noise standard deviation of the original image is estimated using the robust median operator [25]; that is, , where refers to the curvelet transform coefficients in the finest subband. An approximate value of the individual variances in the th directional subband of the th scale is calculated using Monte-Carlo simulations [26].

Because the shapes of mapping curves in all the high frequency subbands are similar, the mapping curve of the high frequency subband indexed by and is given as an example in Figure 5(b). As expected, noise has been effectively suppressed by setting the coefficients which are smaller than the threshold to be zeros. The absolute slope of the mapping curve in Figure 5(b) is decreasing with the increasing of the absolute coefficient amplitude in the enhanced intervals . Therefore, compared with strong edges, weak edges which correspond to smaller coefficients will be enhanced more obviously. The proposed mapping function is in accord with the nonlinear logarithmic property of the HVS and can well balance noise reduction and edge enhancement, as shown in Figure 5(b). Moreover, the proposed mapping function also well satisfies the rules of monotonicity and antisymmetry.

In summary, the proposed nonlinear mapping achieves the following targets: preserving strong edges by keeping the large coefficients, enhancing weak edges by amplifying the small coefficients, and removing noise by eliminating noise coefficients using thresholding in the high frequency subbands . Meanwhile, it increases overall contrast by adjusting the dynamic range in the low frequency subband . Therefore, it adequately improves the visual quality of side-scan sonar images.

3.3. Block Diagram of the Proposed Algorithm

A block diagram of the proposed image enhancement algorithm is shown in Figure 6. To summarize, the proposed algorithm can be described as follows.

Input: original image .(1)Implement the curvelet transform of and obtain the curvelet transform coefficients in the low frequency subband and each high frequency directional subband.(2)Calculate the enhanced curvelet transform coefficients in the low frequency subband using (11).(3)Estimate the noise standard deviation and the individual variances in each high frequency directional subband to obtain the threshold .(4)Calculate the enhanced curvelet transform coefficients in each high frequency directional subband using (17).(5)Reconstruct the enhanced image from the coefficients via inverse curvelet transform.

Output: enhanced image .

4. Experimental Results and Discussion

In this section, the effectiveness of the proposed algorithm is validated through computer simulation. We compare our algorithm with three image enhancement algorithms: HE [5], AIE-NSCT [17], and PFBE [16]. HE and AIE-NSCT are free of parameter selection for optical image enhancement, but PFBE requires parameter tuning for sonar image enhancement. We use the fast discrete curvelet transform via Wrapping in our experiments [21].

4.1. Qualitative Assessment

Four typical side-scan sonar images, shown in Figure 7, are chosen as test images and enhancement results on sonar images are given in Figures 8(a)8(d), 9(a)9(d), 10(a)10(d), and 11(a)11(d) respectively. In order to further evaluate the performance of the different enhancement algorithms, we also use the Canny edge detector to extract the corresponding edges of the enhanced images, as shown in Figures 8(e)8(h), 9(e)9(h), 10(e)10(h), and 11(e)11(h). The gray-level mapping curves of the enhanced image versus the original image are shown in Figure 12.

The original image Sand in Figure 7(a) is a dark side-scan sonar image of the seafloor sand ripple. Figures 8(a)8(d) show the enhanced images by using HE, AIE-NSCT, PFBE, and the proposed algorithm, respectively. HE overenhances the image. This over enhancement is represented as a sharp change of its mapping curve in Figure 12(a). Moreover, it produces the brightness distortion of the sand ripple’s bottom; that is, the dark region at the bottom becomes too bright. The mapping curve verifies the observation, where input gray levels in the range of 0–20 are mapped to output gray levels in the range of 36–138. AIE-NSCT is able to enhance the contrast to some extent, but the overall contrast enhancement is not satisfactory and some details are still faint. It can also be seen from the mapping curve that the stretching of the dynamic range is insufficient. For PFBE, we choose the parameter to obtain the best enhanced image. PFBE increases the overall brightness of the image but cannot provide an adequate contrast enhancement. This is verified by its mapping curve, which is linear and almost parallel to the no-change mapping. However, the proposed algorithm efficiently emphasizes the subtle features. Meanwhile, it improves the overall contrast and removes noise. This is verified by its mapping curve in Figure 12(a), where input low gray-level range is compressed and high gray-level range is stretched sufficiently.

The original image Plane in Figure 7(b) is a side-scan sonar image of the plane wreckage in Lake Washington obtained by Marine Sonic Technology, Ltd. Figures 9(a)9(d) show the enhancement results by adopting HE, AIE-NSCT, PFBE, and the proposed algorithm, respectively. HE again overenhances the image accompanied with annoying amplified noise, resulting in the overbrightness of the fuselage. This is verified by the mapping curve in Figure 12(b), where the rising slope of the curve is the largest. AIE-NSCT enhances the contrast of some of the image structures, but the boundary of the plane’s shadow area is still unclear. For PFBE, we choose the parameter to get the optimal enhanced image. PFBE improves the overall contrast, but the edges of the plane are a little fuzzy. The proposed algorithm well distinguishes the background, the plane, and the shadow area, simultaneously yielding visually pleasing brightness. It can be seen from Figure 12(b) that our mapping curve is located between the mapping curve of HE and that of PFBE.

The original image River landform in Figure 7(c) is a side-scan sonar image of the riverbed in Changzhou city section of China Grand Canal. Figures 10(a)10(d) show the enhancement results by using the four algorithms, respectively. HE obviously amplifies noise, resulting in the undesired blur of weak edges. AIE-NSCT can preserve edges and reduce noise partly, but the overall contrast enhancement is not noticeable for its limited dynamic range adjustment. For PFBE, we choose the parameter to obtain the best enhanced image. Although the noise is effectively suppressed by PFBE, the perceived contrast is not significantly improved. The proposed algorithm efficiently enhances the dynamic range and edges while simultaneously removing noise. Furthermore, it can recover the details of small silt pits at the left bottom. This is verified by the mapping curves in Figure 12(c), where the input gray levels are stretched either excessively in HE or inadequately in AIE-NSCT and PFBE but appropriately in our algorithm.

The original image Bridge pier in Figure 7(d) is a side-scan sonar image of the Huai De bridge pier across Changzhou city section of China Grand Canal. We choose the parameter to get the optimal enhanced image for PFBE. The proposed algorithm is the only approach which can simultaneously enhance the contrast, sharpen the edges, and reduce noise. This can be illustrated by the visual results shown in Figures 11(a)11(d) and the mapping curves shown in Figure 12(d).

The corresponding edge detection results of the enhanced images are shown in Figures 8(e)8(h), 9(e)9(h), 10(e)10(h), and 11(e)11(h), respectively. The over enhancement and amplified noise provided by HE give rise to many false edge points, which can be seen in Figures 8(e), 9(e), 10(e) and 11(e). Owing to the insufficient contrast enhancement and edge details blurring produced by AIE-NSCT and PFBE, the detected edges are incomplete (e.g., the loss of weak edges in the middle of Sand shown in Figures 8(f)-8(g), the edge loss of the plane tail and wings in the shadow area of Plane shown in Figures 9(f)-9(g), the loss of texture details in bottom right of River landform in Figures 10(f)-10(g), and the incomplete outer contour of Bridge pier in Figures 11(f)-11(g)). Because the proposed method can significantly reduce noise and strengthen edges, it obtains more accurate, clean, and complete edges, which can be clearly observed in Figures 8(h), 9(h), 10(h), and 11(h). These edge detection results further demonstrate that the proposed method has a significant advantage of being able to suppress noise while preserving edges.

4.2. Quantitative Assessment

To acquire quantitative evaluation of enhancement results, the image contrast measure called the measure of enhancement by entropy or EME using entropy is proposed in [27]:where an image is split into blocks of size , and , are the maximum and minimum values of the pixels in each block , respectively. is a small constant equal to 0.0001 to avoid dividing by 0.

The EME by entropy (EMEE), which is of the entropy formula form XlogX, is actually measuring the entropy, or information, in the contrast of the image [28]. The EMEE value should increase by a significant magnitude when the contrast of an image is enhanced noticeably. We use the EMEE to compare the enhancement performance of different algorithms as listed in Table 1. Because HE overenhances the images of Sand and Plane, the maximum and minimum values of each block simultaneously become large, resulting in small EMEE values. Forthe images of River landform and Bridge pier, HE amplifies noise, resulting in large EMEE values. It is observed that the proposed algorithm offers the largest value of EMEE for each test image, which indicates that the enhanced images of our algorithm have the highest contrast. Both subjective and quantitative assessments have shown that the proposed algorithm outperforms other algorithms in enhancing image contrast, improving edge sharpness, and suppressing noise.

4.3. Comparison of Running Time

All the algorithms are implemented under MATLAB R2011b environment on a PC with 3 GHz Pentium(R) Dual-core CPU E5700 and 2 GB RAM. The running times of the four methods are given in Table 2. The test images are Sand, Plane, River landform, and Bridge pier, respectively with 500 × 512 pixels, 256 × 256 pixels, 500 × 500 pixels, and 300 × 300 pixels. For all the methods, the running time is proportional to the size of the image. The larger the size of the image is, the longer the running time is. The results indicate that our algorithm consumes much less time than AIE-NSCT because of the lower computational complexity of the curvelet transform. Meanwhile, compared with HE, our algorithm produces much better enhanced images though it costs a little more time. Compared with PFBE, our algorithm is also faster. It should be noticed that an extra time is needed for PFBE to select the optimal parameter manually, and the time is not included in the running time of PFBE in Table 2 because it is difficult to determine.

5. Conclusion

In this study, a new automatic side-scan sonar image enhancement algorithm in curvelet transform domain is proposed. We present an adaptive multichannel enhancement structure based on the HVS, combining the nonlinear mapping scheme with the curvelet transform. The proposed nonlinear mapping scheme is well designed to achieve the following goals: in the high frequency subbands, amplifying the coefficients of weak edges, preserving the coefficients of strong edges, and inhibiting noise coefficients, and in the low frequency subband, adjusting the dynamic range adequately. The nonlinear mapping is adaptive without any parameter tuning and is consistent with the nonlinear logarithmic property of the HVS. Therefore, the proposed algorithm can automatically achieve noise suppressing, edge sharpening, and contrast enhancement for side-scan sonar images. The proposed algorithm is tested on real sonar images and is compared with some popular enhancement algorithms. Experiment results demonstrate that the proposed algorithm outperforms the existing enhancement algorithms in terms of subjective visual evaluation and objective quantitative evaluation measure of EMEE. Moreover, compared with HE, the proposed algorithm can enhance the image much better with only a bit more time consumption. Compared with NSCT-based enhancement algorithm, our algorithm not only produces better results but also consumes much less time. Compared with PFBE, which is a nonadaptive curvelet-based enhancement algorithm, our algorithm can achieve better enhancement results without adjusting parameters manually. Therefore, the proposed approach can be easily and effectively used for sonar image enhancement.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (no. 60972101 and no. 41306089) and the Natural Science Foundation of Jiangsu Province (no. BK20130240).