Abstract

Accurate classification of Pap smear images becomes the challenging task in medical image processing. This can be improved in two ways. One way is by selecting suitable well defined specific features and the other is by selecting the best classifier. This paper presents a nominated texture based cervical cancer (NTCC) classification system which classifies the Pap smear images into any one of the seven classes. This can be achieved by extracting well defined texture features and selecting best classifier. Seven sets of texture features (24 features) are extracted which include relative size of nucleus and cytoplasm, dynamic range and first four moments of intensities of nucleus and cytoplasm, relative displacement of nucleus within the cytoplasm, gray level cooccurrence matrix, local binary pattern histogram, tamura features, and edge orientation histogram. Few types of support vector machine (SVM) and neural network (NN) classifiers are used for the classification. The performance of the NTCC algorithm is tested and compared to other algorithms on public image database of Herlev University Hospital, Denmark, with 917 Pap smear images. The output of SVM is found to be best for the most of the classes and better results for the remaining classes.

1. Introduction

Cervical cancer is one of the most common cancers affecting women worldwide and the most common in developing countries [1]. It can be cured, if it is detected in early stages and identify in which stage it belongs and further proper treatment has given in time. At the same time the occurrence and the death rate remains even high in the developing and under developed regions of the world. It is reported that annually 132,000 new cases were diagnosed and 74,000 deaths in India, which is nearly one-third of the global cancer deaths [2]. Screening of cervical cancer can be done by Pap test, which is believed to be the gold standard forever. Due to the subjective disparity of different cytologists, the screening results show more of inconsistencies [3]. The test output shows more of false positive and false negative results, which make the reliability of the screening process a question mark [4]. Also in manual cervical screening process, hundreds of images are analyzed daily; the classification of cells become tough and the possibility of human errors become high.

Many automatic and semiautomatic methods have been proposed in various times to detect various stages of cervical cancer. Many of these methods were not supported in achieving the objectives of providing measured variables which could eliminate the interpretation errors and interobserver discrepancy [5]. Pap smear images are rich in various features like color, shape, and texture. The process of accurate extraction of unique visual features from these images would very well help in developing an automated screening device. This can be achieved only through texture feature than others since all the cellular changes are observed only using these features. Since the texture parameters are the simple mathematical representations like smooth, rough, and grainy, then the analysis becomes easier [6]. Analyzing all the above issues, two important challenges are considered. First, selection of unique texture features suitable for classification. Second, selection of the most efficient and scalable classifier improves the accuracy more.

Plissiti et al. [7] have developed the fully automated method to detect the nucleus in Pap smear images. Using morphological analysis, the nuclei centroids are detected and by applying distance dependent rule and classification algorithms on resulted centroids, the undesirable artifacts were removed from the cell.

By considering nucleus as the most informative region of the cell, Sobrevilla et al. [8] have been presented an algorithm for automatic nuclei detection of cytology cell. This algorithm combines color, cytopathologists knowledge, and fuzzy systems which show high performance and more computational speed. Harandi et al. [9] have developed a system for the detection of cytoplasm and nucleus from ThinPrep images. The geometric active contours were used as the segmentation tool. In this method, localization of cell objects were done in low resolution and boundary detection of cytoplasm and nucleus were done in high resolution. Bergmeir et al. [10] have developed an algorithm used to detect cell nuclei and cytoplasm. This algorithm used the combination of voting scheme and prior knowledge to locate the cell nuclei and elastic segmentation to determine the shape of nucleus. The noise is removed with mean-shift and median filters, and edges were extracted with canny edge detection algorithm.

Most of the segmentation methods discussed in this literature focused on nucleus and cytoplasm extraction, which requires higher contrast around the boundaries of nucleus. The heavily stained cervical smears, overlapping of cell images and blurred images to overexposing or underexposing of light in microscope even cause difficulties in segmentation [11].

The automatic classification of Pap smear images focuses on marking of single cells into any one of binary classes (normal and abnormal) or multiple classes (based on severity). Multiple classification of Pap smear images became popular when CIN based classification on cervical cytology images was proposed. Holmquist et al. [12] developed a binary classification method to distinguish between normal and abnormal cells. The dual wavelength method was used for the automatic isolation of nucleus from cytoplasm. The classification procedures were done based on the extraction of density-oriented, shape-oriented, and texture-oriented parameters. Chou and Shapiro [13] have proposed a method used hierarchical multiple classifier scheme. This method used graph-theoretic clustering algorithm to group the training data, component classifiers as the inputs to a super-classifier, and subclass labeling is used to improve the classification accuracy. Marinakis et al. [14] proposed a metaheuristic approach to classify the Pap smear cells. Uniquely described twenty features are extracted from each cell image and classified as normal and abnormal type. The genetic algorithm is used to find the best possible performing subset selection.

Most of the medical image classification methods supervised learning algorithms which challenges to find out the connection between the independent and dependent variables. Support vector machine (SVM) and neural network (NN) are the most promising methods of this category. Support vector machine (SVM) is a supervised classification method introduced by Cortes and Vapnik [15] in 1992. SVM plays vital role in cytology image analysis including yeast cells on suspension in bioreactors [16], cells in culture using microscopy [17], and cells on sections of brain tumors [18]. NN classifiers are based on statistical probabilities in making classification decisions [19]. NN uses the training set which contains inputs, outputs, and the learning rules.

Chen et al. [20] have developed an algorithm for segmenting nucleus and cytoplasm counters. This system classifies the Pap smear cells into anyone of four different types of classes using SVM. Two experiments were conducted to validate the classification performance which showed the best performance outputs. Mat-Isa et al. [21] have proposed an automatic cervical cancer diagnostic system based on hierarchical hybrid multilayered perceptron network. In this method, region-growing-based algorithm is used for the feature extraction procedure and classification is done by NN.

The above literatures clearly show that none of the single texture feature is suitable in improving classification accuracy. The necessity of scalable and cost effective algorithm is required to improve the efficiency of the classification system. Our classification method NTCC classifies the Pap smear cells into anyone of seven stages, based on the work done by researchers on the public database of Herlev University Hospital, Denmark [22, 23]. The detailed description of Pap smear cells is shown in Table 1. By carefully extracting 24 features, the images are classified into any one of seven classes, namely, superficial squamous, intermediate squamous, columnar, mild dysplasia, moderate dysplasia, severe dysplasia, and carcinoma in situ. Already many researchers have contributed various classification algorithms based on this public database [14, 24].

2. Nominated Texture Based Cervical Cancer (NTCC) Classification System

The NTCC classification system consists of texture feature extractor, SVM trainer, SVM classifier, and the database to store the trained and stored database. The architecture of the proposed classification system is depicted in Figure 1. The system involves the following steps: preparation of Pap smear images and preprocessing, segmentation of nucleus and cytoplasm, extraction of texture features, and classification.

2.1. Preparation and Preprocessing of Pap Smear Images

The cytology images are acquired through a powerful microscope by the skilled cytotechnicians. All images were captured with a resolution of μm/pixel from the public database of cervical cancer, Herlev University Hospital, Denmark [23]. The purpose of the preprocessing step is to suppress the unwanted noise found in cervical image samples and enhances them for further processing. In general the nucleus region of the cervical cytology cell has larger distribution of darker pixel than cytoplasm. In this step the input images are first inverted, and then the image binarization followed by morphological closing operation with structuring element of five has been performed. The rough segmentation of cell nucleus can be done through morphological filling operation.

2.2. Feature Extraction and Selection

Feature selection chooses the optimum subset of features from the enormous set of potentially useful features which may be available in a given problem domain. By selecting the precise number of features, it is able to reduce the storage space and computational time, which will certainly improve the performance [25, 26]. Also too many dimensions in the feature space may drastically increase the computational complexity and deteriorate the discriminating power of the feature set due to the distortion and noise [27].

In cervical cancer classification system seven sets of features are extracted. They are relative size of nucleus and cytoplasm, dynamic range and first four moments of intensities of nucleus and cytoplasm, relative displacement of nucleus within the cytoplasm, gray level cooccurrence matrix features, local binary pattern histogram, tamura features, and edge orientation histogram which includes the total features of 24.

The size of nucleus and cytoplasm plays a vital role in classifying the cervical cell type: where Nucleusarea is the proportion of number of pixels of nucleus (N) and Cytoplasmarea is the proportion of number of pixels of cytoplasm (C).

Dynamic range of an image and the first four moments of the intensities of nucleus and cytoplasm provides four dissimilar statistical moments. They are calculated by individual values of image pixels and not on the cooccurrence of neighboring pixel values. Dynamic range (dr) of an image is the difference between the intensity values of the brightest and the darkest pixel in the image. The four moments are mean (), variance , skewness , and Kurtosis as follows [28]:

The movements position of nucleus inside the cytoplasm help in classifying stages. This can be achieved by extracting the relative displacement of nucleus within the cytoplasm: where Cytocentroid is the centroid of cytoplasm and Nuclcentroid is the centroid of nucleus.

Relative displacement can be calculated by

Haralick’s gray-level cooccurrence matrices (GLCMs) have been used very successfully in texture classification [29]. Out of 14 features outlined, we considered first 11 texture features suitable for our experiment.(a)Angular second moment: (b)Contrast: (c)Correlation: (d)Sum of squares: (e)Inverse difference moment: (f)Sum average: (g)Sum variance: (h)Sum entropy: (i)Entropy: (j)Difference variance: (k)Difference entropy:

Local binary pattern (LBP) [30] transforms an image into an array or integer labels. It is computed by comparing a given pixel with its neighbors: where is variant to changes of the mean gray value of the image.

Tamura’s texture features like coarseness, contrast, and directionality [31] are extracted which are purely based on human visual perception are extracted for our experiment.(a)Coarseness: (b)Contrast: (c)Directionality:

Edge orientation histogram (EOH) aims to build a histogram with the directions of gradients of the edges:

2.3. Classification Using SVM

The proposed classification system classify the Pap smear images into one of seven classes, namely, superficial squamous, intermediate squamous, columnar, mild dysplasia, moderate dysplasia, severe dysplasia, and carcinoma in situ. The manual classification of the entire dataset has been already done by the experts. Figure 2 shows sample single cell images in which first row represents three classes of normal image type and the second row represents four classes of malignant type. In the proposed classification system, we used SVM algorithm to classify the Pap smear images. This SVM is based on the work done by Vapnik et al. which is implemented as “LibSVM” [15, 32].

3. Experimental Results and Discussion

In the present work, features of Pap smear images such as relative size of nucleus and cytoplasm, dynamic range and first four moments of intensities of nucleus and cytoplasm, relative displacement of nucleus within the cytoplasm, gray level cooccurrence matrix features, local binary pattern histogram, tamura features, and edge orientation histogram are extracted. By the combinations of extracted features, the performance of various classification algorithms are analyzed and compared.

The step by step procedures of the cervical cancer classification method is illustrated as follows. Table 2 demonstrates the examples of preprocessing steps done in Pap smear using this method. The color images are converted into gray scales and further the segmentation of nucleus and cytoplasm is done in this stage. Table 3 describes the various features set extracted from the cytology images. Tables 4 and 5 provide the performances measures (precision and recall) for all categories with different combination of features set using SVM classifier. The different classifiers used in this experiment are depicted in Table 7. The classification and diagnostic performance (precision) of the SVM classifiers and its comparison with the NN classifiers are summarized in Table 6. The performance metric and ROC curve for all the classifiers are shown in Figure 3. Area under curve for various classifiers is shown in Table 8. The result of 10-fold cross-validation (confusion matrix) is depicted in Table 9.

In our work, the cell samples are collected from the public database which is free of sampling errors. In order to extract the precise features of the cell nucleus and cytoplasm, the segmentation of cell nucleus from cytoplasm relics a demanding issue. Segmentation is done through morphological filling operations, which is found to be better than the other methods in the detection of nucleus. In this work, the 23 unique features were selected and classified into seven sets, including relative size of nuclei and cytoplasm, dynamic range and first four moments of intensities of nuclei and cytoplasm, relative displacement of Nucleus within the cytoplasm, gray level cooccurrence matrix features, local binary pattern histogram, tamura features, and edge orientation histogram. The result shows that, by using the combinational feature set, it is able to classify all seven types of Pap smear images. It shows that all the dysplasia cells and carcinoma in situ classes have higher nuclear proportion and irregular nucleus and these findings are compatible with the human findings.

The output of SVM shows that the best precision for normal squamous (97.38%), intermediate squamous (93.89%), mild dysplasia (87.33%), severe dysplasia (58.52%), and carcinoma in situ (84.72%) is achieved through the combination of (F1, F2, F3, F4, F5, F6, and F7) feature set. With the single feature set F7, the accuracy rate of 89.35% is achieved in columnar type. Likewise the accuracy of 84.10% for moderate dysplasia is achieved through the combination of F4 and F6 feature set. These observations that show no single feature set produce the best results for all classes. Some feature set shows the predominance results for some classes and as an average, the best overall classification performances were obtained through the combination of all the seven feature sets. Again the recall values of SVM classifiers show that the better performance is obtained through combinational feature set.

The very low precision rate of 58.52% is only observed in severe dysplasia class by any combinations. Most of these image types do not follow the classification rules and it even shows the very poor segmentation results. The need of separate set of classification methods and unique feature selection procedures will improve the performance.

The classification performances of various classifiers are shown in Table 6. The first three classifiers are of SVM based kernel types and the fourth one is the SVM based multilayer perceptron. The other three are neural network based, where the first two use single layer neural network with 10 and 30 nodes and the third uses two hidden layers of (10,10) nodes. The results show that the better performance is achieved through linear kernel SVM classifier than any other classifier. In addition, the performance of the classifiers builds using SVM, outperformance of the NN classifier.

In order to evaluate 917 image instances, 10-fold cross-validation has been done, which hold out a percentage of their data set in each fold to properly evaluate the classifier. The results are shown as confusion matrix (Table 9) with the entire set of features. Out of 917 images, 71/74 normal squamous, 65/70 intermediate squamous, 85/98 columnar, 158/182 mild dysplasia, 121/146 moderate dysplasia, 157/197 severe dysplasia, and 137/150 carcinoma in situ are found to be correct. Except severe dysplasia and carcinoma in citu categories, this method provides compromising outputs.

4. Conclusion

In this paper, an improved method for classifying Pap smear images using selected texture image feature is proposed. The study reveals that the method not only helps in classification but also helps in selecting the features which are most suitable for all types of classes. These results show that there is no unique set of feature suitable for all classes. In this classification method, it is concluded from the analysis that the linear kernel SVM classifier out performs than any other classifier.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.