Abstract

In order to get accurate location and continuous edges, Gaussian mixture model and local direction modulus nonmaxima suppression are used in high frequency subbands of nonsubsampled Contourlet transform. The distribution of NSCT high frequency subbands coefficients has the “high spikes, long tail” non-Gaussian statistical characteristic. Gaussian mixture model (GMM) is used to distinguish the linear singular signal and the nonlinear singular signal on the high frequency subbands. Local direction modulus nonmaxima suppression is used to refine the linear singular signal. An appropriate threshold is used to distinguish edge pixels and nonedge pixels to get binary image. The experimental results demonstrate that the proposed method can capture more continuous edges in multiple directions and has accurate edge location. And the edges are with great convenience for the image recognition.

1. Introduction

Image edge detection plays a very important role in image processing [1]. The results of image edge detection will directly affect the performance of image understanding, analysis, and recognition. The classical edge detectors based on gradient are concise and easy to understand, such as Sobel and Prewitt operators. Because of doing the average operation in these algorithms, they will lose some detailed information and make the edges blur. So the accuracy of location is not high. Canny edge detection usually gives good performance. Canny algorithm uses two different thresholds to detect strong edges and weak edges, respectively. Weak edges will be detected when weak edges are connected to strong edges. But weak edges will be ignored when weak edges are not connected to strong edges. Due to using Gaussian filter in the Canny edge detection method, some weak edges will be smoothed. And the detected edges obtained are more miscellaneous, so it is difficult to identify the main contours.

As the multiscale analysis is introduced, the image processing methods based on wavelet have obtained good results. Wavelet transform is famous for its time frequency localization, multiscale, and multiresolution. It is a very effective tool for image edge detection in image processing [2], such as the wavelet modulus maxima edge detection method. But the basis functions of wavelet transform are isotropic, so it cannot capture more directional information in images [3]. In the recent years, multiscale geometric analysis has grown constantly, and new methods based on the new multiscale transforms are proposed, such as the curvelet and Contourlet transform [4]. The Contourlet transform was proposed to address the lack of geometrical structure in the separable two-dimensional wavelet transform. The research results show that the performance of the image edge detection based on the new multiscale transforms is better than those based on the wavelet transform. The Contourlet transform is a real 2D image representation using a directional filter and a Laplacian pyramid, which can effectively capture contours in an image and can achieve better expression of image than the wavelet transform. It has the anisotropic characteristics. Because of the process of downsampling and upsampling, the Contourlet transform is shift-variance and always Gibbs phenomena around singularities. In 2006, the nonsubsampled Contourlet transform (NSCT) [5] was proposed by Cunha et al. The NSCT has better performance of the NSCT in image denoising and enhancement applications. NSCT is composed of the nonsubsampled pyramid filter and the nonsubsampled direction filter, so it is shift-invariance [6].

In this paper, a method of edge detection based on nonsubsampled Contourlet transform and Gaussian mixture model is proposed. The probability and statistics analysis is applied in the NSCT coefficients. The Gaussian mixture model is used to distinguish the linear singular signal and nonlinear singular signal on the high frequency subbands automatically. Local direction modulus nonmaxima suppression is used to refine the linear singular signal. An appropriate threshold is used to distinguish edge pixels and nonedge pixels to get binary image. We compare the proposed method with the classical Canny operator, Sobel operator, the wavelet method [2], and the method based on cellular neural network (CNN) [7]. The proposed method can capture more continuous edges in multiple directions and has accurate edge location.

2. Modeling NSCT High Frequency Coefficients

2.1. Nonsubsampled Contourlet Transform

NSCT consisted of the nonsubsampled pyramid filter banks (NSPFB) and the nonsubsampled directional filter banks (NSDFB) [6]. The two independent parts are shift-invariant. NSPFB is used for multiscale decomposition. NSDFB uses singular points of the same direction to synthesize the NSCT coefficients. NSDFB is shown in Figure 1(a). Frequency resolution is shown in Figure 1(b).

The distribution of NSCT high frequency subbands coefficients has the “high spikes, long tail” non-Gaussian statistical characteristic, as shown in Figure 2(a). High frequency subband coefficients consist of a few “big” coefficients and most “small” coefficients. The “small” coefficients contain less information and the “big” coefficients contain main information. Other high frequency directional subbands also have the similar distribution. Based on these characteristics to distinguish the linear singular signal and the nonlinear singular signal, Gaussian mixture model is used to model subbands, as shown in Figure 2(b); we use one Gaussian density function with big variance and one Gaussian density function with small variance to approximate the distribution. The “big” coefficients represent the linear singular signal. The linear singular signal is corresponding to the image edges.

2.2. Parameter Estimation of Gaussian Mixture Model

We model each high frequency directional subband through the Gaussian mixture model. is the coefficient which is in the scale and the high frequency directional subband; is the number of all coefficients in . Let be in matrix, where is the column and is the row. Assume that each sample in is independent of each other and obeys the Gaussian distribution [8], . will be divided into two classes: one class is “big” state; the other is “small” state.

Density function of is . There, , where or 1: 0 means “big” state and 1 means “small” state; is the class means; is the class covariance, where is the decomposition scale and is the filter direction.

Expectation maximization algorithm can be used to calculate the optimal parameters of GMM by E-Step and M-Step.

E-Step: calculating the probability of generated by each componentIn formula (1), is the probability of and is the weighting factor.

M-Step: calculating , , and , iteratively

2.3. Algorithm 1: Data Screening Algorithm by Gaussian Mixture Model

See Algorithm 1.

Input: high-frequency directional sub-bands
() Employ (1)~(3) to calculate the probability of generated by each component
() Employ (4)~(7) to calculate the , ,
() Repeated iteration, until the values of the likelihood function achieves convergence.
() Calculate the probability of generated by each component according to , ,
, . Let pro0 be the probability of generated by the “big” variance of Gaussian
model; let pro1 be the probability of generated by the “small” variance of Gaussian
model.
() If pro0 < pro1, set 0 to
Output: (new high-frequency directional sub-bands )
2.4. Local Direction Modulus Nonmaxima Suppression

After executing Algorithm 1, we use local direction modulus nonmaxima suppression to refine the linear signal. High frequency directional subbands contain the directional information, so there is no need to calculate the direction of the modulus; just compare the modulus of pixel and the modulus of the two gradient directional pixels of this pixel [9]. Determine whether it is a local maximum modulus and retain if it is.

For example, there are eight directions in some new high frequency directional subbands. So there are eight sectors, which are , , , , , , , and . Mod () is the modulus of . is the equivalent gradient of . Figure 3 is the directional subband equivalent gradient.

2.5. Algorithm 2: Local Mold Maxima Suppression Algorithm

See Algorithm 2.

Input: new high-frequency directional sub-bands
If
 If
  ;
 End
End
ElseIf
 If
  ;
 End
End
ElseIf
 If
  ;
 End
End
Else
 If
  ;
 End
End
Output: (new high-frequency directional sub-bands )
2.6. Our Proposed Edge Detection Algorithm

The proposed edge detection algorithm is summarized as follows:

Input: original image.

Step 1. Decompose the original image into high frequency directional subbands and low frequency subband by NSCT.

Step 2. Let low frequency subband coefficients be 0.

Step 3. Employ Algorithm 1 to process , we get new , and new are the linear singular signal here.

Step 4. Employ Algorithm 2 to process , which is got by Step 3, we get new , and new are the linear singular signal refined here.

Step 5. Reconstruct image using high frequency coefficients processed by Step 3 and low frequency coefficients processed by Step 1.

Step 6. After NSCT inverse transform, there will be a data matrix. There will be two types of data in that matrix, the modulus of one type is big, and the modulus of the other type is small. An appropriate threshold can be easily set to separate them. Set threshold to distinguish edge pixels and nonedge pixels to get binary image.

Output: edge detection result.

3. Experimental Results

In order to quantify the performance of the proposed method, FOM (Pratt’s Figure of Merit) [10] is used to compare the performance of edge detection methods, because the edges of the standard image must be obtained while using FOM. Because of this, we can just use the synthetic images. The FOM is computed as follows:where and are the number of ideal and detected edge pixels, respectively, the parameter is a constant, and is the vertical distance between an actual edge pixel and the nearest ideal edge pixel. The FOM measures the accuracy of localization. The more the FOM is, the more the quality of the edge detection is. In order to objectively discuss the accuracy of the localization, the false edge pixels are not covered while computing the FOM. A synthetic image is used to calculate FOM.

The proposed method, CNN, Canny, Sobel operators, NSCT, and the wavelet method are used to detect the edge of the synthetic image. The NSCT method is used to detect edges without using GMM. Figure 4 shows the results detected by the different methods. Table 1 shows the FOM calculations by different methods. The results demonstrate that the accuracy of the localization of the proposed method is more accurate than Canny operator, Sobel operator, wavelet method, NSCT method, and the method based on CNN. The accuracy of the localization of the proposed method is the best. Because of not using GMM to distinguish the linear singular signal and the nonlinear singular signal on the high frequency subbands, the edges detected by the NSCT method are interrupted by the nonlinear singular signal. So the accuracy of the localization of the NSCT method is very low. The Canny, Sobel operators, and CNN method are sensitive to noise, so their accuracy of the localization is lower than the proposed method.

Three standard images, which are Barbara (size ), Lenna (size ), and boat (size ), are used to test in the experiments. We select “9-7” pyramid decomposition and “pkva” directional filter bank for NSCT decomposition. Decomposition level is 3 and each level has eight high frequency directions. Experimental results are shown in Figures 5, 6, and 7.

The linear singular edges of the table cloth detected in Figure 5(b) are clearer than those in Figures 5(c), 5(d), 5(e), and 5(f). The texture edges of the tassels in Figure 6(b) are more continuous than those in Figures 6(c), 6(d), 6(e), and 6(f). Contour edges detected of the hull in Figure 7(b) are more complete than those in Figures 7(c), 7(d), and 7(f). The edges are too miscellaneous to distinguish the main contours in Figures 5(c), 6(c), 7(c), 5(d), 6(d), and 7(d). Because the wavelet just has two directions, it cannot capture more directional details. Because of not using GMM to distinguish the linear singular signal and the nonlinear singular signal on the high frequency subbands, the edges detected by the NSCT method are interrupted by the nonlinear singular signal. From Figures 5(g), 6(g), and 7(g), we also cannot distinguish the main contours. Because the NSCT has good directionality, anisotropy, and decorrelation, the proposed method can capture more directional details. And the proposed method can capture the main contours because of using the GMM. From Figures 5(b), 6(b), and 7(b), we can clearly see the main contours in the images.

In order to evaluate the algorithm, the quantitative analysis is adopted [11]. In Table 2, CEN denotes the number of continuous edge pixels in edge image. TEN represents the number of all pixels in edge image. denotes the ratio of TEN to CEN. The more is, the more image edge is continuous. is computed as follows:

Table 2 shows that the edges obtained by the proposed method are more continuous than other methods. Because the wavelet just has two directions, it cannot capture more directional details. Though the NSCT method can capture more edges than the Canny and Sobel operators, the pixels detected by the NSCT method contain so much the nonlinear singular signal. The proposed method can capture more details than other methods.

MSE (Mean Squared Error) is a standard to measure the accuracy of localization. Generally, when MSE is greater, the accuracy of localization is less. When MSE is less, the accuracy of localization is higher. The MSE is computed as follows:

Table 3 shows the MSE results of different methods. We can see that the proposed method has a lower MSE than other methods. It means that the proposed method has high accuracy of localization. Because of not using GMM to distinguish the linear singular signal and the nonlinear singular signal on the high frequency subbands, the edges detected by the NSCT method are interrupted by the nonlinear singular signal. So the accuracy of localization is low in the NSCT method.

The results show that, compared with traditional edge detection methods, the proposed method possessed better edge locating ability and keeps more edge details.

4. Conclusions

We have presented an edge detection method based on nonsubsampled Contourlet transform and Gaussian mixture model. The proposed method is based on nonsubsampled Contourlet transform and Gaussian mixture model. It can capture more directional information and continuous edges. It overcomes the shortage of the limited directions of wavelet transform. The main contours in the image can be clearly expressed. This method can effectively locate the edge. The thick edges detected by the proposed method express the main edges in the images, which are with great convenience for the image recognition. The proposed method runs slowly; it needs to be improved in the future.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Grant no. 61440044, no. 61561001, no. 61102008, no. 61163017, no. 61261043, and no. 61462002), the Ningxia Science Foundation of China (Grant no. NZ13097), and the Foundations of Research Projects of State Ethnic Affairs Commission of P. R. China (Grant no. 14BFZ003).