Research on a Pulmonary Nodule Segmentation Method Combining Fast Self-Adaptive FCM and Classification
The key problem of computer-aided diagnosis (CAD) of lung cancer is to segment pathologically changed tissues fast and accurately. As pulmonary nodules are potential manifestation of lung cancer, we propose a fast and self-adaptive pulmonary nodules segmentation method based on a combination of FCM clustering and classification learning. The enhanced spatial function considers contributions to fuzzy membership from both the grayscale similarity between central pixels and single neighboring pixels and the spatial similarity between central pixels and neighborhood and improves effectively the convergence rate and self-adaptivity of the algorithm. Experimental results show that the proposed method can achieve more accurate segmentation of vascular adhesion, pleural adhesion, and ground glass opacity (GGO) pulmonary nodules than other typical algorithms.
In recent years, morbidity and mortality rates of lung diseases in China are growing annually, resulting from environmental degradation induced by air pollution and increase of smoking and passive smoking. As indicated by data provided by National Cancer Institute, Ministry of Health, the mortality rate of lung cancer in China increased by 465% and its morbidity rate increased by 26.9% in the past 30 years, and furthermore, rather than liver cancer, lung cancer has been the prime cause of death due to malignant tumors, which takes up 22.7% of all such deaths (morbidity and mortality due to lung cancer increase drastically, http://health.sohu.com/20121127/n358789664.shtml). Therefore, research of computer-aided diagnosis on lung cancer has being increasingly important.
Currently, as far as the characterization of lung diseases is concerned, CT imaging is the best imageological means to diagnose lung diseases among various medical imaging types . Since pulmonary nodules are the most commonly seen symptoms in the early stage of lung cancer, automatic extraction and assisted diagnosis of pulmonary nodules by means of spiral CT play a significant role in the detection of early lung cancer , which reduces effectively the errors in interpreting images by eyeballs and the work load of image reading. More importantly, it improves the repeatability of diagnosis and consistency of image interpretation to some extent by providing accurate quantitative analysis for doctors by techniques such as image segmentation and quantification of characteristics .
In the process of research, a series of characteristics of pulmonary nodules such as size, form, density, and patterns of strengthening and growth need to be analyzed comprehensively in order to provide a basis for identification of benignity and malignancy of tumors . For instance, pulmonary nodules have clear edges and the form of edges (smooth, lobulated, speculated protuberance, or burr) are key indicators of benignity and malignancy of tumors. So it cannot be more important to implement correct segmentations of pulmonary nodules.
In recent years, a lot of pulmonary nodule segmentations methods have been proposed, which can be categorized as threshold methods [5, 6], morphological methods , variable model methods , clustering methods [9–11], and image segmentation methods [12, 13]. Nevertheless, many contemporary methods need human interventions and most of them perform well only for isolated solid pulmonary nodules. However, pulmonary nodules usually occur in forms of vascular adhesion, pleural adhesion, ground glass opacity (GGO), and cavitary pulmonary nodules, as demonstrated in Figure 1. These types of nodules have higher probability of malignancy than isolated ones. Obviously, because of attachment of these nodules to surrounding tissues, fuzzy edges, and similar gray scales, fast, correct, and automatic segmentation of these nodules is still a difficult problem.
Fuzzy -means (FCM) algorithm  is a popular algorithm and has been widely used in image segmentation; it applies fuzziness to the membership judgment of pixels, which is consistent with human perception and convenient for realization. However, conventional FCM algorithm does not consider any spatial information and neighborhood correlation, which makes it perform poor in low contrast, in-homogeneity, and noisy images.
In order to overcome these problems, many researchers have introduced spatial information through modifying the objective function or altering the distance form measurement between pixels and cluster centers. Fergus et al. proposed a modified FCM algorithm (SFCMpq), which considers the fact that the adjacent pixels will have similar characteristic value and the probability that they belong to the same class is very big. Chuang et al.  proposed a weighted FCM integrating the spatial neighborhood information. Ahmed et al.  proposed FCM_S algorithm where the objective function is modified by introducing a neigborhood term, which is used to compensate the intensity inhomogeneity and allow the labeling of a pixel to be influenced by the labels of its immediate neighborhood. However, the execution efficiency of FCM_S is very low , since it needs to compute a series of neighborhood labelings during each iteration. Two improved variants, FCM_S1 and FCM_S2, of FCM_S were proposed with an intent to simplify the computation of parameters and then Chen and Zhang  extend them to the corresponding kernelized versions, KFCM_S1 and KFCM_S2, by the kernel function substitution. An enhanced FCM (EnFCM) algorithm  was proposed to speed up the clustering process on the basis of the gray level histogram instead of pixels, and it uses a linearly weighted sum image formed from both original image and each pixel’s local neighborhood average gray level. Hence, the computational time of EnFCM algorithm is reduced greatly since the number of gray levels in an image is much smaller than that of pixels. Fast generalized FCM (FGFCM) algorithm  introduces the spatial information combining the intensity of the local pixel neighborhood and the number of gray levels in an image. The quality of segmentation result is well enhanced, and the computational time of FGFCM algorithm is small similarly. Krinidis and Chatzis proposed a novel robust fuzzy local information -means clustering (FLICM) algorithm  by incorporating the local spatial and gray level information, which is free of the empirically adjusted parameters and enhances the clustering performance.
Although the above-modified FCM algorithms have improved the performance of conventional FCM and segmented different types of images to some extent, they still have the following disadvantages: they require a large number of iterations to achieve the convergence; they have the worst performance in segmenting CT images of lungs with complicated features; for example, the blood vessels have similar grayscales to pulmonary nodules, GGO pulmonary nodules have low contrast to backgrounds, and so forth.
Discussing these issues, this paper presents a pulmonary nodule segmentation method combining fast self-adaptive FCM and classification. The improved spatial function will calculate the cluster centers adaptive by both grayscale similarity and spatial similarity of pixels neighborhood space and update the fuzzy membership degree of center pixel during each iteration, obtaining the cluster centers at a faster rate. Afterward, we can construct the classifier based on the clustering results according to the labeled information.
2. Modified Fast Self-Adaptive FCM Segmentation
In addition to lung tissues, a typical CT image of chest section includes other organs, skeletons, and trachea. To increase the accuracy of CAD detection to lesion tissues and image processing speed, the primary mission of this paper is to remove these irrelevant items in images and extract images of pulmonary parenchyma by means of global binarization of threshold, detecting boundaries of trunks, removing background noises, and inpainting lung areas. Thereafter, nodules in lungs as well as blood vessels and bronchi which have similar characteristics with pulmonary nodules can be analyzed. CT images of pulmonary nodules are generally bright in center and dark in peripheral area so that points mapped into the joint characteristic space of brightness and greyscales are dense in center and scattered in peripheral. Considering the diversity of types of pulmonary nodule density distribution, weak edge distribution of peripheral tissues, and robustness of algorithm to noise, the following modifications of traditional FCM algorithm are proposed in this paper: the new spatial function considers the contributions to fuzzy membership from both the grayscale similarity between central pixels and single neighboring pixels and the spatial similarity between central pixels and neighborhood. The modified algorithm improves effectively the convergence rate and adaptivity to spatial characteristics of pulmonary nodule edges.
2.1. The Traditional FCM Algorithm
As a clustering algorithm based on division, the traditional FCM algorithm divides pixels into fuzzy groups, searches the clustering center of each group, and updates the clustering centers through updating fuzzy membership of the pixels relative to each clustering. The update is implemented by means of minimizing the objective function, which is defined aswhere is the amount of clusters, is the amount of pixels in the image, is the fuzzy membership of the th pixel with respect to the th pixel, is the weight exponent acting on the fuzzy membership, and is the center of the th clustering. The constraint for the objective function minimization is
This algorithm minimizes the objective function by assigning higher fuzzy membership to pixels whose greyscales are close to those of specific clustering centers and assigning lower fuzzy membership to pixels whose greyscales are distinct from those of specific clustering centers.
Nonetheless, the traditional FCM algorithm calculates the fuzzy membership of each pixel based on the distance of the pixel and clustering centers, resulting in the fact that it is not suitable to pulmonary nodule images in which characteristics of forms and distributions are complex and diverse because it is difficult to obtain smooth segmentation edges and closed segmentation intervals. In addition, the executive efficiency of the FCM algorithm is low since it needs multiple iterations to converge. Existing modified FCM algorithms, such as FCM_S, FCM_S1, FCM_S2, FGFCM, and EnFCM, take into account information of pixels in neighborhoods of central pixels and introduce a parameter or that controls the intensity of neighborhood information. This parameter is important because its selection influences directly the performance of algorithm. However, this parameter cannot vary self-adaptively; that is, its value maintains unchanged during the whole process of segmentation for an image. Obviously, the selection of does not reflect the characteristics of the neighborhood of the pixels.
We acclaim that the value of should be chosen based on similarity of local greyscales; that is, should depend on the difference between the greyscales of central pixels and neighboring pixels in order to suppress noise and speed convergence; local spatial similarity; that is, also depends on the size of neighborhoods and the difference between greyscales of neighborhood average and central pixels so that the adaptivity of algorithm is enhanced.
2.2. 2D Vector Representation of the Spatial Relationship between a Pixel and Its Neighborhood
For an extracted pulmonary parenchyma image, the amount of pixels is denoted as and greyscale lies in the interval , the neighborhood of a pixel , , is , and is a pixel in the neighborhood. Denote the greyscale at the central pixel as and the average value in a neighborhood centered at this pixel with the size as , where the parameter is determined based on image resolution and noise. The 2-dimensional vector represents the greyscale at and that of its neighborhood; it will have different values in the whole lung parenchyma. The algorithm segments the image by sampling the above 2D vector instead of the traditional greyscale.
To analyze the statistical rules of the correspondence between 2D vectors and various tissues in pulmonary parenchyma, a 2D greyscale matrix is defined and constructed, in which the abscissa represents the greyscales at pixels and the ordinate represents the average in their neighborhoods. The frequency of the vector value is where means the number of times that the vector value occurs.
Figure 2(b) shows the distribution of greyscales and neighborhood averages of pixels corresponding to pulmonary parenchyma shown in Figure 2(a), for which is chosen as 2. By statistical analysis from the 2D vector distributions at all pixels, since pulmonary parenchyma is the background in images, the greyscale values at corresponding pixels are close to their neighborhood averages, which is indicated by points around the diagonal in region 1 in Figure 2(b); greyscale values at pixels inside tracheas, blood vessels, and pulmonary nodules are also close to their neighborhood averages, indicated by points around the diagonal in region 3 in Figure 2(b); amount of pixels at edges of nodules, tracheas, and great blood vessels is relatively large and greyscales at these areas are significantly different from neighborhood averages, shown by points densely distributed in region 4. In addition, discrete points showing low frequency of occurrence and huge difference between local greyscale values and neighborhood averages correspond to pixels at tiny blood vessel edges and lung wall edges.
From above, for a pixel, the size of its neighborhood, the difference between local greyscale and neighborhood average, and its frequency can provide local pattern features of the neighborhood around the central pixel for the algorithm updating fuzzy membership.
2.3. FCM Algorithm Based on Weighted Spatial Information and Greyscale Information
To improve the traditional FCM algorithm, this paper introduces a new spatial function which includes the spatial features and greyscale information of the neighborhood around the central pixel. The spatial function is defined as where represents the probability that the pixel determined from spatial information of the neighborhood belongs the th kind and is the membership that the neighboring pixels belong to the th kind, which is calculated by (3). Obviously, is maximum if all pixels in the neighborhood of the pixel belong to the th kind. Otherwise, is minimum if all pixels in the neighborhood of the pixel do not belong to the th kind.
A polynomial represents the global scale factor determined by the difference between greyscales at the pixel and neighboring pixels ; that is, the spatial function depends on the fuzzy membership of neighboring pixels , and more importantly contributions to the spatial function from different neighboring pixels around should be distinct. The less the greyscales at and are different, the closer their relation is and the greater the corresponding weight coefficient should be. Otherwise, the weight coefficient should be smaller. Thus, is defined aswhere is equivalent to the parameter in other classical algorithms. But is able to adjust the difference between greyscales at central pixels and their neighborhoods according to the degree of importance of the 2D vector in local pattern space. Considerwhich shows that is variable and enhances the self-adaptivity of pixels to their neighborhoods.
It can be seen that formula (7) combines the greyscales at central pixels and pixels in their neighborhoods and adjust dynamically the brightness around central pixels depending on the sizes of neighborhoods and mean square deviation in neighborhoods. This practice protects the pixels in smooth areas, magnifies the differences in edges, and eliminates noise and edges of tiny suspect objects in the fuzzy membership matrix.
The spatial function is plugged into the following formula in order to update the fuzzy membership of the pixel : where parameters and control the relative significance of the two functions.
Based on the above descriptions, the self-adaptive FCM algorithm for training is as follows:
Algorithm 1 (fast self-adaptive FCM).
Consider the following.
Input. Extract CT image of pulmonary parenchyma.
Output. Segment pulmonary nodules.
Initializations. Choose the amount of clustering centers , the maximum number of iterations , fuzzy weight exponent , size parameter of neighborhoods , parameters that control the relative significance of the two functions and , and termination threshold ; set counter for iteration ; initialize randomly a clustering prototype matrix and normalize this matrix so that it satisfies constraint (2); finally, obtain clustering centers using formulas (3) and (4).
Step 1. Calculate and plot the 2D histogram for the pulmonary parenchyma image and obtain frequencies of occurrence of all 2D vectors.
Step 2. Calculate fuzzy membership of pixel and assemble the partition matrix :
If , such that , then and ( for ).
Step 3. Update fuzzy memberships according to formulae (7)–(9) and obtain new fuzzy memberships .
Step 4. Update clustering prototype modes ; that is, update clustering centers using new memberships :
Step 5. If or the maximum count of iteration is reached, enter Step 6; otherwise, return to Step 3.
Step 6. Deblur using the method of maximum membership function. The type of clustering that the pixel belongs to is represented by :
In the algorithm, the fuzzy weight exponent is a parameter introduced to generalize fuzzy clustering objective functions by Bezdek, which plays a role in suppressing noise and smoothing membership functions. The empirical range of given by Bezdek is . Later, the most suitable physical interpretation was obtained for . Thus is chosen as 2 in this paper.
Now, let us analyze the number of iterations and running time of the algorithm for image processing. Since the iterations loop over each pixel, the time complexity of the algorithm depends closely on amount of pixels and number of iterations, that is, , where is the amount of pixels, is the number of clusters, and is the number of iterations. The operational speed of the algorithm in this paper is significantly improved because the number of iterations of this algorithm is much less than traditional FCM and FCM_S algorithms. Besides, EnFCM and FGFCM algorithms transform segmentation based on pixels into segmentation based on greyscale (0–255) levels. Due to the fact that the number of greyscale levels is much less than amount of pixels, the speed of algorithm becomes much faster.
Figure 3 demonstrates the results of fuzzy memberships of pixels at pulmonary parenchyma, edges, and pulmonary nodules given by the above algorithm applied to an image of blood vessel adhesion pulmonary nodules. The yellow rectangular area in Figure 3(a) is the area to be analyzed. Figure 3(b) shows the distribution of greyscales at pixels in the rectangular area in the form of level sets. Figure 3(c) indicates the results of fuzzy memberships at the pixels in the interesting area relative to the foreground clustering calculated by using the new spatial function. To make the demonstration of results more visually sense-making, the values shown in the figure are the results that actual values are multiplied by 100 and rounded. Black arrows point to the evolutionary directions of fuzzy memberships. Pixels in shadow areas represent segmented boundaries based on the membership threshold.
Let us elaborate the effectiveness of the new spatial function by taking the pulmonary parenchyma shown in Figure 4, for instance. The positions that arrows point to can be regarded as noise. The frequency that the corresponding 2D vector occurs is extremely small and its greyscale differs significantly from most neighboring pixels. Thereafter, in the initial stage, is relatively small while is large, which lead to the result that given by the algorithm through the spatial function (6) is large and the new fuzzy memberships by substituting into formula (9) is also relatively large. After each iteration, the probability that this pixel enters the th clustering increases until the termination of the algorithm. Thus, this pixel will be finally categorized into the th clustering where its neighboring pixels belong to, although its greyscale is distinct from the center of the th clustering . It is right opposite to the case of pleural adhesion nodules shown in Figure 4(b), in which the area of pulmonary nodules extends until being connected to pleura that shows higher greyscale levels. Though greyscale levels at the connections are low, they transit gradually so that the difference of greyscales at central pixels and neighborhoods is relatively small. The factor in formula (8) adjusts self-adaptively the brightness around pixels in the connection area by increasing the weight of , which also increases the spatial function . After one iteration, fuzzy membership changes significantly.(1)The central pixel is a point of noise and pixels in its neighborhood are homogeneous in greyscale. Let us take a point of tiny blood vessel noise shown in Figure 4(a) as an example, in which the size parameter of neighborhood . Greyscales and initial fuzzy memberships of the pixels in this area are listed in Table 1(a). Tables 2(a), 2(b), and 2(c) demonstrate the memberships and clustering centers of the pixels in this area after 1, 4, and 7 iterations. In Table 2, the upper and lower parts show results given by formula (1) and Step 3 in Algorithm 1, respectively. By comparing the memberships and clustering centers before and after updating, it can be seen that clustering centers after each iteration are rarely influenced by central pixels, speed of convergence is improved significantly, and fuzzy memberships of pixels in neighborhoods tend to be consistent. Thus the noise of tiny blood vessels is correctly categorized into the background clustering .(2)The central pixel is not a point of noise but is homogeneous in greyscale with pixels in its neighborhood. Table 3 demonstrates the memberships and clustering centers of the pixels in the neighborhood of the example pixel shown in Table 1(b) after 1, 4, and 7 iterations. It can be seen that since the central pixel of the neighborhood is not noise, only the greyscales at diagonal pixels are relatively dark and the greyscales in the neighborhood show smooth transitions. Through comparing the memberships and clustering centers before and after updating as listed in Tables 3(a), 3(b), and 3(c), it is observed that after one iteration in the algorithm proposed by this paper, memberships at pixels in the neighborhood are lower than those computed by Step 2 of Algorithm 1 and membership values tend to be consistent after several iterations. The central pixel is correctly categorized into the foreground clustering .
The above two examples provide theoretical explanations about the robustness of the algorithm. From the point of view of localized methods, formula (6) fully combines the local spatial information and greyscales of the image, which enhances the robustness of the algorithm to noise or outliers and the self-adaptivity to homogenous neighborhoods. In the meantime, it can change the fuzzy memberships of central pixels significantly and improve the executive efficiency of the algorithm.
3. Experimental Results and Analysis
This paper uses experimental data from whole lung image series provided by the Lung Image Database Consortium (LIDC (USA Lung Image Database Consortium (LIDC), https://public.cancerimagingarchive.net/ncia/login.jsf)) , USA, and the lung CT image database of Qianfoshan Hospital, Shandong Province. Table 4 describes the two sets of experimental data, 2D CT images selected for experiments, and information about pulmonary nodules. The experimental images contain labels of pulmonary nodules provided by clinical experts.
To verify the effectiveness of the newly proposed algorithm, the new algorithm is compared with 5 existing typical algorithms (FCM, FCM_S, EnFCM, FGFCM, and FLICM) about effect of segmentation, operational efficiency, and error rate. In the whole experiment, parameters involved in the above 5 algorithms are as follows: , , , , and ; that is, local neighborhoods are in size.
3.1. Effect of Pulmonary Nodule Segmentations
Segmentations to isolated, blood vessel adhesion, pleural adhesion, and ground glass opacity pulmonary nodules are implemented and compared in order to verify and compare the performance of the above algorithms on removing noise and treating tiny tissues, objects with fuzzy edges, and objects with similar greyscales.
Figure 5 demonstrates the image of an isolated pulmonary nodule, the image of pulmonary parenchyma with salt and pepper noise, and images of locally zoomed-in segmentation results obtained by the above six algorithms and by manual segmenting. Through analysis of the segmentation results shown in Figures 5(c)–5(h), it can be found that the performances of FCM and FGFCM are influenced by noise so severely that they cannot separate objectives from background. A large amount of noise still exists in the result of background segmentation and even tiny blood vessels are segmented as objectives. FCM_S and EnFCM perform better than above algorithms, although the segmentation effects of objectives and tiny blood vessels are much less than ideal. The algorithm proposed by this paper can remove the influence of noise satisfactorily and successfully separate objectives from background.
(a) Original image
(b) Pulmonary parenchyma with noise
(h) Our method
(i) Manual segmentation
Figure 6(a) shows the CT image of a blood vessel adhesion pulmonary nodule. The segmentation results of this nodule by the above six algorithms are demonstrated by Figures 6(b)–6(g). It is observed that, among all these algorithms, FCM yields a result that differs the most from that given by manual segmentation because it only extracts the brightest region in the center of the nodule and loses all information about nodule boundaries. Segmentations provided by FCM_S, EnFCM, FGFCM, and FLICM show errors of boundary leakage at the connections of nodules and blood vessels, and all of these four algorithms make more or less wrong segmentations to left lung walls connected to blood vessels. The method proposed by this paper overcomes these problems and performs well in segmenting blood vessel adhesion pulmonary nodules.
(a) Original image
(b) Pulmonary parenchyma
(h) Our method
(i) Manual segmentation
Figure 7(a) shows the CT image of a pleura adhesion pulmonary nodule. The segmentation results of this nodule by the above six algorithms are demonstrated for comparison by Figures 7(b)–7(g). The shape of pulmonary nodule edges segmented by our method are the closest to those provided by experts’ manual segmentations. The connections between nodules and pleura are especially delicate, indicated by the fact the extended area of nodules and background areas of pulmonary parenchyma are distinguished.
(a) Original image
(b) Pulmonary parenchyma
(h) Our method
(i) Manual segmentation
Figure 8(a) shows the CT image of a ground glass opacity pulmonary nodule, which is indicated by a red arrow. The segmentation results of this nodule by the above six algorithms are demonstrated for comparison by Figures 8(b)–8(g). It can be seen that FCM hardly segments pulmonary nodules. FCM_S, EnFCM, and FGFCM perform less than satisfactorily on segmentations of nodules and tiny blood vessels. Segmentations by FLICM are good and our method gives segmentations that are closer to edges expected by experts.
(a) Original image
(b) Pulmonary parenchyma
(h) Our method
(i) Manual segmentation
Generally, blood vessels have similar grayscales to pulmonary nodules; GGO pulmonary nodules have low contrast to backgrounds; grayscales at connections between pleura and pulmonary nodules transit smoothly, but the contrast is also very low, compared to the grayscales of backgrounds. The traditional FCM algorithm implements segmentations of images by clustering similar data points in the characteristic space and does not include interactions between neighboring pixels, which yields the worst performance in segmenting CT images of lungs with complicated features. Improvements by introducing neighborhood information and methods to measure distances and correcting memberships are made by all other four typical algorithms, which remove the influence of image noise. However, more or less defects exist in considering spatial similarities and grayscale similarities of pixels in neighborhoods so that these algorithms show problems of falling into local extrema and being sensitive to initial values.
To illustrate more results of segmentations of pulmonary nodules, Figure 9 demonstrates several typical pulmonary nodules, including isolated, adhesion, and GGO nodules. Column 1 shows locally zoomed-in images of original pulmonary parenchyma; column 2 shows segmentation results given by using the neighborhood size parameter , that is, using neighborhoods; column 3 shows segmentation results given by using the neighborhood size parameter , that is, using neighborhoods; column 4 shows manual segmentation results provided by experts for reference.
Analysis of experimental results demonstrates the following: to some extent, the segmentation obtained by our method is close to the manual references, especially at nodule edges where burrs are obvious. Images in the same group can be referred to each other and those in different groups cannot be compared because different CT images are distinct in doses, resolutions, and quality, especially grayscale, and in the process of implementation the algorithm adjusts the parameter self-adaptively according to the grayscale features of the current image. Neighborhood size influences the segmentation significantly. Nodule edges which resulted from are relatively fine, especially where nodules are adhered to blood vessels or lung walls, as shown in the 5th and 6th images; however oversegmentations and wrong segmentations to tiny blood vessels are easy to occur, as shown in the 4th and 5th images. Comparatively, neighborhoods when are larger and the difference between grayscales at edges and the neighborhood averages increases. Parts of segmentation results are close to manual segmentations, like the 2nd and 6th images. However, undersegmentations occur sometimes, as shown in the 5th and 8th images, and it is also possible to segment nodules that are tightly adhered, as the 3rd image.
3.2. Executive Efficiencies of Algorithms
In order to further demonstrate the performance of the algorithm proposed by this paper, Table 5 shows average numbers of iterations (maximum number of iterations ) and running time (seconds) to obtain segmentation results in Figures 5–8 by the above five typical algorithms and our algorithm.
Table 5 indicates that when the FCM algorithm is employed to segment images, the more the iterations are needed to satisfy the convergence threshold value , the longer the program runs. Other algorithms (FCM_S, EnFCM, FGFCM, and FLICM) can converge with less iterations and have higher efficiencies. The algorithm proposed in this paper needs the least number of iterations and has the shortest running time, which is consistent with the observation described in Section 2.3 that fuzzy memberships change considerably after one iteration. In the meantime, prior knowledge plays a role in supervising unlabeled or sparsely labeled data, which speeds up the convergence of the objective function when improving the effect of clustering.
3.3. Error Rate of Segmentation
Figure 10 demonstrates the comparison between boundaries of all types of pulmonary nodules by our algorithm and those obtained by manual segmentations of experts . Figures 10(a), 10(b), 10(c), and 10(d) correspond to four types of pulmonary nodules in test samples with different amounts. In each plot of Figure 10, the ordinate represents number of pulmonary nodules and the abscissa represents rate of wrong segmentations (%). A positive rate of wrong segmentations means undersegmentation resulting from the algorithm with respect to manual segmentations. Otherwise, a negative rate of wrong segmentations means oversegmentation resulting from the algorithm with respect to manual segmentations.
(a) 162 solid nodules
(b) 97 vascular adhesion nodules
(c) 58 pleural adhesion nodules
(d) 46 GGO nodules
It is seen from Figure 10 that the algorithm performs the worst for GGO nodules. The proportion of GGO nodules whose rates of wrong segmentations lie in the interval is less than 70%. This proportion for any of the other three types of nodules reaches 85%. It is also noticeable that the rates of wrong segmentations for some nodules approach −200 or 100.
In this paper, a fast and self-adaptive FCM pulmonary nodule segmentation method combining clustering and classification learning is proposed in order to obtain satisfactory segmentations for blood vessel adhesion, pleura adhesion, and GGO pulmonary nodules. The new method improves the traditional FCM algorithm according to features of the above pulmonary nodules. It updates fuzzy memberships of central pixels based on grayscale similarities and spatial similarities of neighborhoods, so that clustering centers can be obtained quickly. The new method solves the problems of traditional segmentation methods such as the reliance on the contrast between objectives and backgrounds, difficulty to obtain weak edges of pulmonary nodules and attached tissues, slow convergence of objective functions, and so forth. In the meantime, according to the artificial labeled data, we can realize the aided supervision for pulmonary nodule segmentation missing classification information. Experimental results indicate that the newly proposed algorithm can segment blood vessel adhesion, pleura adhesion, and GGO pulmonary nodules fast and exactly and performs better than traditional FCM, FCM_S, EnFCM, FGFCM, and FLICM in segmentation effects, executive efficiencies, and rates of error.
However, there are still some disadvantages about the new method. For example, it does not work well for segmenting tiny nodules (diameter < 1 cm) and segmentations of GGO nodules and tightly adhered nodules are unstable. In future research, we will solve these problems by extracting more characteristics of pulmonary nodules and deep learning method. We will also analyze the symptoms of benign and malignant nodules comprehensively and provide real aided diagnosis for early detection and screening of lung cancer.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
J. Li, J. Dang, and Y. Wang, “Medical image segmentation algorithm based on quantum clonal evolution and two-dimensional tsallis entropy,” Journal of Computer-Aided Design and Computer Graphics, vol. 26, no. 3, pp. 465–471, 2014.View at: Google Scholar
K. Murphy, B. van Ginneken, A. M. R. Schilham, B. J. de Hoop, H. A. Gietema, and M. Prokop, “A large-scale evaluation of automatic pulmonary nodule detection in chest CT using local image features and k-nearest-neighbour classification,” Medical Image Analysis, vol. 13, no. 5, pp. 757–770, 2009.View at: Publisher Site | Google Scholar
J. C. Bezdek, Pattern Recognition with Fuzzy Objective Function Algorithms, Plenum Press, New York, NY, USA, 1981.View at: MathSciNet
M. A. Balafar, A. R. Ramli, S. Mashohor, and A. Farzan, “Compare different spatial based fuzzy-c_mean (FCM) extensions for MRI image segmentation,” in Proceedings of the 2nd International Conference on Computer and Automation Engineering (ICCAE '10), vol. 5, pp. 609–611, IEEE, Singapore, February 2010.View at: Publisher Site | Google Scholar
L. Szilagyi, Z. Benyo, S. M. Szilagyi, and H. S. Adam, “MR brain image segmentation using an enhanced fuzzy C-means algorithm,” in Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 724–726, September 2003.View at: Publisher Site | Google Scholar