Wireless capsule endoscopy (WCE) is a new noninvasive instrument which allows direct observation of the gastrointestinal tract to diagnose its relative diseases. Because of the large number of images obtained from the capsule endoscopy per patient, doctors need too much time to investigate all of them. So, it would be worthwhile to design a system for detecting diseases automatically. In this paper, a new method is presented for automatic detection of tumors in the WCE images. This method will utilize the advantages of the discrete wavelet transform (DWT) and singular value decomposition (SVD) algorithms to extract features from different color channels of the WCE images. Therefore, the extracted features are invariant to rotation and can describe multiresolution characteristics of the WCE images. In order to classify the WCE images, the support vector machine (SVM) method is applied to a data set which includes 400 normal and 400 tumor WCE images. The experimental results show proper performance of the proposed algorithm for detection and isolation of the tumor images which, in the best way, shows 94%, 93%, and 93.5% of sensitivity, specificity, and accuracy in the RGB color space, respectively.

1. Introduction

Gastrointestinal (GI) tract diseases are so common all over the world. Therefore, the possibility of direct observation of the GI tract is beneficial for treatment of these patients. None of the traditional methods like endoscopy and colonoscopy can reach the small intestine which is the mean body of the GI tract. This limitation occurs due to the complex structure of the small intestine. Wireless capsule endoscopy is a new instrument which helps doctors achieve this purpose and alleviates the pain caused by the traditional endoscopy for the patients. As a result, it is widely used in hospitals. WCE video data identification is done by physicians, which takes a long time due to the large number of images. The average number of WCE images in each experiment is about 50,000 which takes around two hours to be reviewed and analyzed [1, 2]. Furthermore, some abnormalities appear just in one or two frames of the video which might be missed by physicians. On the other hand, variation of abnormalities in size, color, and distribution makes their detection too difficult by naked eyes. All these problems motivate researchers to develop other reliable systems to reduce the great burden of physicians. However, it should be considered that this goal is very challenging because the appearance of varied diseases is so different and the same diseases have lots of variations in color, shape, and size [3, 4].

Tumor is one of the GI tract diseases which greatly threaten human health [5]. In case of early detection, this abnormality can be prevented and cured easily; therefore, many studies have been done to propose a computer aided detection (CAD) system to identify tumors in the WCE images. Karkanis et al. utilized color wavelet covariance features and a linear discriminant analysis classifier to detect tumors in the colonoscopy videos. This feature is based on the covariance of the second-order textural measures of the different color bands [6]. Moreover, Li used the wavelet transform and uniform local binary pattern (ULBP) as the texture features for detection of tumors in the WCE images [7, 8]. Martins et al. utilized the advantage of high directional sensitivity of the discrete curvelet transform and the Gaussian mixture model as a classifier to detect tumors in the WCE images [9].

In this work, a new algorithm is proposed which exploits color based texture features and SVM to detect tumor in the WCE images. The proposed method integrates advantages of DWT and SVD to characterize the normal and tumor WCE images. DWT is used for multiresolution image analysis as a preprocessing step. In the following step, the SVD based features which are invariant against rotation are extracted. Finally, SVM which has a better ability to classify high dimensional feature vectors is used to classify the WCE images. The experimental results from the presented data set show that this new scheme achieves an encouraging performance in tumor detection.

This paper is organized as follows. The new wavelet based SVD color texture feature extraction for the WCE images is presented in Section 2. In Section 3, SVM, which was used to classify our data set, is briefly introduced. Section 4 presents the experimental results in detail, including the detection evaluation. Finally, the conclusion will be given in Section 5.

2. Texture Features

Due to development in the image analysis methods, the normal and abnormal regions in the WCE images can be distinguished properly. In the present work, we attempt to design a CAD system to recognize tumor in the WCE images. Tumor and normal WCE images have different color and texture features as shown in Figures 1 and 2. These different appearances motivate us to select color texture feature as an image analyzer. This texture is reddish compared to the normal ones [10] due to the increment of blood capillary density in the tumor. Also, the inflation and inflammation surface of tumors can isolate them from the normal textures [11]. The most important problems in this identification are the time varying illumination environment and the great variety in tumor appearances which can be observed in Figure 1 [3]. Therefore, the CAD system should be powerful enough to overcome these problems.

To have a local analysis, the input WCE images are divided into nine patches according to Figure 3. Since the size of the input image is 256 × 256 pixels, the number of 128 × 128 overlapping patches is set to three in both directions. For better image analysis, DWT is applied to each patch to have multiresolution and zooming ability. In addition, the SVD based features which are invariant to rotation are extracted from the created subpatch. Finally, SVM is used to classify the WCE images because of its better ability to classify high dimensional feature vectors. The methodology of the proposed CAD system is shown in Figure 4.

2.1. Discrete Wavelet Transform

Wavelet transform is a powerful tool for multiresolution image analysis [12]. Also, zooming ability and local characterization of the wavelet transform provide a much efficient analysis of the WCE images which contain different scale information. DWT can decompose an image into different subbands with both time information and frequency information that are implemented with a separable filter-bank [13]. To create decomposed subimages that are shown in Figure 5, an image is convoluted with a low pass filter (L) and a high pass filter (H). The LL subimage, which is called approximation component, is obtained from low pass filtering in both directions. This subimage is very similar to the original image and is used to generate second-level decomposition. The other three subimages are called detailed components. HL is derived from low pass filtering along the vertical direction and high pass filtering along the horizontal direction. The remaining LH and HH subimages can be explained similarly [8, 14]. In this study, we only apply the two-level DWT to each color channel of the WCE images.

2.2. Singular Value Decomposition

Singular value decomposition is a tool for exploratory data analysis which dissociates the data matrix as follows [15]: where is an orthonormal matrix whose columns are called the left singular vectors or the gene coefficient vectors of . Also, is an orthonormal matrix whose columns are called the right singular vectors or the expression level vectors of . Matrix is an diagonal matrix which is represented as

The diagonal elements of matrix are called the singular values of which are used to create the feature vectors [16]. If the input image size is 256 × 256, the singular value size will be 1 × 256.

In the created subimages, instead of the normal part, there are more texture details in tumors; therefore, the corresponding singular values are bigger than the other parts. This texture difference is used to identify tumors in the WCE images. As mentioned before, textural features are better analyzed in detailed subimages [17], so we use , , and () as the bases for the textural feature analysis. Since the WCE images are color ones, each color channel is processed independently. To decrease the feature vector dimension, we compare the accuracy of three color channels in each level of DWT and select the biggest one to extract the SVD based features. Accordingly, the feature vector dimension for each WCE image using the proposed method is , as illustrated in Figure 6.

In this work, RGB (red, green, and blue) and HSV (hue, saturation, and value) color spaces are used to find the best color space for our feature extraction method. The RGB model is the most convenient color space in image and video processing applications because of the high correlation among the RGB channels [18]. On the other hand, the HSV color model separates color information into chromaticity and intensity, and it is invariant to illumination direction and illumination intensity [19]. Also, the HSV color space is very close to the perceived colors by the human being [6]. These properties have made the HSV model less sensitive to illumination variation which is a common problem of the WCE images as the battery weakens over time.

3. Support Vector Machine

Support vector machine as an optimal marginal classifier is developed by Vapnik [20]. SVM has been successfully employed in lots of applications providing satisfying results compared to the other methods [7]. The SVM algorithm processes data by minimum classification structural error and good ability to tackle high dimensional problems. As previously mentioned, the SVM classifier, with strong generalization capability, is able to overcome the problem of having great variations in tumor appearances in the WCE images [21].

SVM uses a training set to build a binary classifier which separates the positive and negative samples. The training set is composed of input vectors of and the corresponding target values of for (). The aim is to find a function which correctly classifies the test data as a positive or negative sample.

The SVM algorithm finds a hyperplane with maximized separation margin which is defined by the following equation:where is a nonlinear mapping from the input space to the bigger dimension feature space which has less overlap between the two classes and is a scalar which can be estimated from the Karush-Kuhn-Tucker complementary condition. The maximized margin hyperplane is calculated by the following equation [20]: where is the Lagrange multiplier that can be calculated by maximization of where is a kernel function which is implemented by the linear, polynomial, sigmoid, and radial basis functions (RBF) [22]. The test input vector is applied to the SVM function and it returns +1 for positive samples and −1 for negative ones with regard to the equation below:

4. Results

For performance evaluation of the proposed algorithm, we use an experimental data set which contains 400 tumor images from 6 pathological cases with different tumor appearances and 400 normal images from 6 normal pathological cases. These images are obtained from PillCam Company (http://www.givenimaging.com/). The resolution of these images is 256 × 256 pixels. In addition, to obtain reliable results, some WCE images with a few bubbles and feces, similar to the ones in Figure 7, have been added to our data set.

To extract the performance of the proposed method, each image is manually labeled as a positive sample (tumor image) or a negative one (normal image) to create the ground truth. In this case, 300 normal samples and 300 tumor samples are used as the training set, while the remaining 100 normal samples and 100 tumor ones are used as the test set. This procedure is repeated four times in order to prevent overfitting of the classification results by exchanging the training and testing images. At last, the average recognition rates are used to demonstrate the performance of the proposed method. Success of the WCE image identification is measured due to its sensitivity, specificity, and accuracy which can be explained as follows [68]:

Haar wavelet is utilized in this work for implementing DWT because of its superior discriminating power which is illustrated in [23]. Also, to achieve the optimal classification performance, RBF kernel function is used for implementation of SVM.

The performance of the proposed two-level wavelet based SVD features for automatic tumor detection in the RGB color channels is shown in Tables 1, 2, and 3, independently. According to the results, the utmost accuracy is obtained in the first and second levels of DWT from the R and G color channels, respectively, which are used to extract features in the proposed method. The efficiency of the red channel is due to the fact that tumor has more blood vessels compared to the normal textures. Also, the best recognition rate of the proposed method in the first and second levels of DWT in the HSV color channels is obtained from the value channel. These are shown in Tables 4, 5, and 6. Finally, to calculate the performance of the proposed method, the selective color channel is utilized for the first and second levels of DWT to extract the corresponding singular values to create the feature vector. The performance of the proposed algorithm in the RGB and HSV color spaces is shown in Table 7.

To evaluate the performance of the proposed method, it is compared with the wavelet based ULBP which is the latest similar feature proposed by Li and Meng [7] to be used for tumor detection. SVM is used as a classifier for this feature. Tables 8 and 9 show the recognition rates obtained from implanting the wavelet based ULBP in the RGB and HSV color spaces with the current data set.

Finally, the extracted results from the proposed algorithm and two-level wavelet based ULBP in the RGB and HSV color models are compared in Figure 8. The results indicate that the presented SVD based algorithm has a better performance in automatic tumor detection in the WCE images in terms of sensitivity, specificity, and accuracy compared to the two-level wavelet based ULBP algorithm in both color spaces. The highest accuracy obtained from the proposed algorithm is 93.5% in the RGB color model while the best recognition rate extracted from employing the two-level DWT + ULBP is 88.5% in the same color model. High accuracy obtained by the proposed CAD system demonstrates that the system can overcome the problem of the great variety of tumor appearances in the time varying illumination environments. Better recognition rate of the proposed method for tumor detection in the WCE images is due to the fact that SVD is a better tool to analyze data from different appearance textures. On the other hand, local analysis of the WCE images can help in improving the performance of the proposed method.

5. Conclusions

A new CAD system has been proposed in this paper for automatic tumor detection in the WCE images. This method locally investigated the input WCE images to extract features. The advantage of the proposed system is based on a new feature extraction method with multiresolution and rotation invariant analysis ability which exploits wavelet based singular value decomposition. In addition, support vector machine with high dimensional classification ability is used to distinguish between the normal and tumor samples in the WCE images. The experimental results from the presented data set which contains 800 WCE images show that this scheme is more powerful in tumor detection. The best recognition rates of 94%, 93%, and 93.5% are achieved in terms of sensitivity, specificity, and accuracy using the two-level selective wavelet based SVD feature extractor in the RGB color space.

Competing Interests

The authors declare that they have no competing interests.