Application of Image Processing Techniques in Molecular Imaging of CancerView this Special Issue
Research Article | Open Access
Head and Neck Cancer Tumor Segmentation Using Support Vector Machine in Dynamic Contrast-Enhanced MRI
Objective. We aimed to propose an automatic method based on Support Vector Machine (SVM) and Dynamic Contrast-Enhanced Magnetic Resonance Imaging (DCE-MRI) to segment the tumor lesions of head and neck cancer (HNC). Materials and Methods. 120 DCE-MRI samples were collected. Five curve features and two principal components of the normalized time-intensity curve (TIC) in 80 samples were calculated as the dataset in training three SVM classifiers. The other 40 samples were used as the testing dataset. The area overlap measure (AOM) and the corresponding ratio (CR) and percent match (PM) were calculated to evaluate the segmentation performance. The training and testing procedure was repeated for 10 times, and the average performance was calculated and compared with similar studies. Results. Our method has achieved higher accuracy compared to the previous results in literature in HNC segmentation. The average AOM with the testing dataset was 0.76 ± 0.08, and the mean CR and PM were 79 ± 9% and 86 ± 8%, respectively. Conclusion. With improved segmentation performance, our proposed method is of potential in clinical practice for HNC.
Head and neck cancer (HNC) is an aggressive cancer at the head and neck region with high incidence in southern China especially in Hong Kong and Guangdong . Medical imaging has been very important in the diagnosis and treatment of HNC. Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is an imaging method in which T1-weighted MRI scans are acquired dynamically after injection of MRI contrast agent, providing information about the characteristics of the physiological procedure. DCE-MRI tracks the diffusion of the contrast agent (a paramagnetic substance, normally Gadolinium-based) over time into the tissue by repeated imaging to reflect hemodynamic information such as the formation and permeability of microvascular in living tumor . The DCE-MRI image stores the time-intensity curve (TIC), which is different among tissues, like cancer, normal soft tissue, bone, and so on. Compared with the traditional MRI images and CT images, the differences in DCE-MRI images among tissues are more characteristic .
The diagnosis and treatment of HNC require accurate tumor lesion segmentation. Regarded as the ground truth, artificial segmentation operated by experienced radiologists is nonetheless time-consuming, and the accuracy is limited by the experience of radiologists. In recent years, automatic segmentation has attracted much attention. Machine learning algorithms have been applied in the segmentation of HNC, such as supervised learning, unsupervised learning, semisupervised learning, and enhanced learning. These automatic segmentation methods may reduce the subjectivity and improve the quality in the segmentation tasks.
Among these methods, Support Vector Machine (SVM), a supervised learning algorithm, has showed great superiority with small sample size of data . In this study we aimed to develop an automatic segmentation method for HNC based on DCE-MRI by using SVM.
2. Materials and Methods
2.1. DCE-MRI Data
In our study, all subjects were recruited from The First Affiliated Hospital, Sun Yat-sen University. DCE-MRI was performed on a 3.0-T system (Magnetom Trio, Siemens) with field of view (FOV) of 22 × 22 × 6 cm (AP × RL × FH), a flip angle of 15°, and scanning time of 6 minute 47 seconds with 65 dynamic scans, 5.9 seconds per scan. The contrast agent gadodiamide Gd-DTPA (Omniscan; Nycomed, Oslo, Norway) was injected intravenously as a bolus into the blood at around the 8th dynamic acquisition using a power injector system (Spectris; Medrad, Indianola, Pennsylvania), immediately followed by a 25-mL saline flush at a rate of 3.5 mL per second. The dose of Gd-DTPA was 0.1 mmol/(kg body weight) for each patient. The reconstructed DCE-MRI images were a 4D matrix (144 × 144 × 20 × 65) with 20 slices.
One hundred and twenty samples of DCE-MRI images containing the HNC tumor lesions were used as our database. Each sample was the DCE-MRI time series of a slice and thus was a 144 × 144 × 65 matrix. Eighty samples were selected randomly as the training dataset while the remaining 40 samples were the testing dataset to verify the accuracy of segmentation.
2.2. Feature Extraction
Before extracting the features from the TIC in the DCE-MRI images, we performed the normalization as where denotes the final normalized TIC, denotes the original TIC, and denotes the average intensity in the first eight scans (before the injection of contrast agent) of .
In several studies some features had already been extracted from DCE-MRI images and successfully applied to classify the tumors from the surrounding tissue [8, 9]. In our study, with the normalized TIC (), the same TIC features were calculated. The maximum intensity was calculated asThe time of reaching the maximum intensity, namely, time to peak, was calculated asThe onset time was defined as the time to reach 10% of the maximum signal intensity after the 8th time point:The wash-in rate was defined as the mean gradient between the two time points of and the maximum intensity:The wash-out rate was defined as the mean gradient between and the 65th time point:
Besides, we also used Principal Component Analysis (PCA)  in this study to extract the principal components of the TIC. We chose the first two components (the eigenvector with the two highest eigenvalues) from PCA results and then multiplied them by the original data to produce two features. These two new features were used in the segmentation tasks.
2.3. SVM Training and Testing
2.3.1. SVM Training
For the training dataset of 80 samples, we firstly carefully drew some rectangular regions of interest (ROIs) for 4 regions, namely, the tumor lesions, the vessels, the normal tissue, and the cavity. This was done by an experienced radiologist (Dr. Wei Deng, 12 years’ experience in Radiology) in ImageJ (National Institutes of Health, Bethesda, MD) and double-checked by another experienced radiologist with 14 years’ experience who were blind to our study. We then calculated the mean TIC curve for the four regions, respectively, in the 80 samples aswhere denotes the mean TIC in this ROI, denotes the TIC of a voxel, and denotes the total number of voxels. Thus, with 80 samples, we obtained 80 × 4 average TICs. We then calculated the 7 features (5 TIC characteristics, and 2 by PCA) for all the 320 TICs. We labeled these features with their corresponding type (tumor, vessel, normal tissue, and cavity). These features and labels formed our training dataset.
For SVM training, we used the MATLAB toolbox libsvm 3.17 (http://www.csie.ntu.edu.tw/~cjlin/libsvm/). After normalized across different samples, the training dataset was used to train three SVM classifiers. We tried and compared between the five curve features and the two PCA features and selected the PCA features in training the SVM classifier for classifying between cavity and the other three tissues, the 5 TIC features in classifying the normal tissue and blood vessels from the other tissues. The radial basis function (RBF) kernel was used in libsvm. The parameters of and in libsvm 3.17 were selected by cross-validation and the grid-search technique.
2.3.2. SVM Testing
Before segmentation, a rectangular ROI was roughly drawn in each of the 40 testing samples. We then applied the three trained classifiers to these ROIs for voxel-by-voxel classification. First, the voxels of vessels were classified by the first classifier. Then the voxels in cavity were also classified by the second classifier. Finally, the voxels in normal tissues were also classified. We removed all the voxels classified above, and thus the tumor lesions were ultimately segmented.
To evaluate the segmentation performance of our method, we compared the automated segmentation results with the ground truth and calculated the area overlap measure (AOM) as where is the segmentation results and is the ground truth. Again, the ground truth for the tumor lesions in these 40 testing samples was manually drawn by an experienced radiologist (Dr. Wei Deng) and double-checked by another experienced radiologist with 14 years’ experience who was blind to our study.
To evaluate the superiority of our proposed method to other studies, the corresponding ratio (CR) and percent match (PM) were also calculated aswhere true positive (TP) denotes the correctly identified tumor region, false positive (FP) denotes the tumor lesion that was incorrectly predicted as nontumor tissue, and the ground truth (GT) denotes the correct tumor region drawn by the radiologist. We repeated the above training and testing for 10 times in order to calculate the mean value of AOM, CR, and PM.
The unnormalized and normalized TICs of four different regions of one sample were shown in Figure 1. As shown, after normalization, the TICs of different regions were well distinguished between each other. Figure 2(a) shows the average original TICs of different regions in a typical training sample, and Figure 2(b) shows the two components selected by PCA.
HNC tumor segmentation by using the proposed method was successfully performed on the 40 testing samples. The mean AOM was 0.76 with standard deviation of 0.08. Figure 3 shows four typical cases of HNC lesion segmentation, including the ground truth in Figure 3(a) and the automated segmentation results in Figures 3(b)–3(e).
The comparison of segmentation performance between our method and the similar studies is summarized in Table 1. By our method, the mean CR was 79 ± 9%, and the mean PM was 86 ± 8%, which were both higher than those in the previous studies.
In this study, a SVM-based method for tumor segmentation in DCE-MRI images of HNC was proposed. Experimental results indicated that this proposed method could effectively segment HNC lesions with high accuracy. We achieved an average AOM of 0.76 ± 0.08. Compared with the SVM-based method proposed in the previous studies , the CR value of 79 ± 9% (72 ± 6%) and the PM value 86 ± 8% (79 ± 7%) in our study were both higher. Compared with other methods about HNC tumor segmentation [5–7], our method also showed higher CR and PM values.
There may be several reasons for better performance of our method. Firstly, the normalized TICs makes the data dimensionless and comparable. As shown in Figure 1, before normalization, the TICs of different tissues especially blood vessel and tumor region are similar, while, after normalization, they are well distinguished and meanwhile the differences of TIC are more obvious.
In addition, the extraction and selection of features are essential in segmentation tasks. We chose the features by using PCA and the features of TIC change for the three classifiers. On the one hand, we found that the classification performance of the PCA features in cavity was more obvious. As shown in Figure 2, by PCA, although only two principal components are shown, the differences in curve variation are still obvious and the computational expense is reduced. On the other hand, we believed that the combination of different SVM classifiers with different features improves the accuracy of segmentation. In our method with three SVM classifiers, blood vessel, cavity, and normal tissue have been classified independently and successively (as shown in Figure 3). As a supervised learning algorithm, SVM has shown a strong learning ability ; thus, with more training samples, the classification performance can be better.
Our study has several limitations. In fact, there is a thin layer of mucosa membrane around the HNC tumor. This tissue might be an obstacle while designing the algorithm based on TIC features, because the TIC is quite similar to the HNC tumor. In the future, we intend to incorporate the high-resolution MRI images for better classification between these two. Another way to improve our method may be the deep learning-based approaches, with which we may obtain more discriminative features and yield improved performance .
We successfully proposed an automatic segmentation method based on SVM for HNC. The results of this study showed that the segmentation performance was superior to previous studies. Our method, if was further verified with more data, is of potential in the clinical practice of HNC patient management.
Conflicts of Interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Wei Deng, Liangping Luo, and Xiaoyi Lin are equal contributors and co-first authors.
The study was jointly funded by The National Natural Science Foundation of China (no. 81301273), Shenzhen High-Caliber Personnel Research Funding (no. 000048), and Shenzhen Municipal Scheme for Basic Research (no. JCYJ20160307114900292; no. JCYJ20160608173106220).
- W. Deng, “Analysis of the incidence and mortality of nasopharyngeal carcinoma in China between 2003 and 2007,” Tumor, vol. 32, no. 3, pp. 189–193, 2003.
- A. R. Padhani, “Dynamic contrast-enhanced MRI in clinical oncology: Current status and future directions,” Journal of Magnetic Resonance Imaging, vol. 16, no. 4, pp. 407–422, 2002.
- C. A. Cuenod and D. Balvay, “Perfusion and vascular permeability: basic concepts and measurement in DCE-CT and DCE-MRI,” Diagnostic and Interventional Imaging, vol. 94, no. 12, pp. 1187–1204, 2013.
- V. N. Vapnik, “An overview of statistical learning theory,” IEEE Transactions on Neural Networks, vol. 10, no. 5, pp. 988–999, 1999.
- K.-W. Huang, Z.-Y. Zhao, Q. Gong, J. Zha, L. Chen, and R. Yang, “Nasopharyngeal carcinoma segmentation via HMRF-EM with maximum entropy,” in Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2015, pp. 2968–2972, Milan, Italy, August 2015.
- P. Ritthipravat, C. Tatanun, T. Bhongmakapat, and L. Tuntiyatorn, “Automatic segmentation of nasopharyngeal carcinoma from CT images,” in Proceedings of the BioMedical Engineering and Informatics: New Development and the Future - 1st International Conference on BioMedical Engineering and Informatics, BMEI 2008, pp. 18–22, Sanya, China, May 2008.
- J. Zhou, K. L. Chan, P. Xu, and V. F. H. Chong, “Nasopharyngeal carcinoma lesion segmentation from MR images by support vector machine,” in Proceedings of the 2006 3rd IEEE International Symposium on Biomedical Imaging: From Nano to Macro, pp. 1364–1367, Arlington, VA, USA, April 2006.
- N. F. Haq, P. Kozlowski, E. C. Jones, S. D. Chang, S. L. Goldenberg, and M. Moradi, “A data-driven approach to prostate cancer detection from dynamic contrast enhanced MRI,” Computerized Medical Imaging and Graphics, vol. 41, pp. 37–45, 2015.
- H. Akbari, L. Macyszyn, X. Da et al., “Pattern analysis of dynamic susceptibility contrast-enhanced MR imaging demonstrates peritumoral tissue heterogeneity,” Radiology, vol. 273, no. 2, pp. 502–510, 2014.
- H. Abdi and L. J. Williams, “Principal component analysis,” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 4, pp. 433–459, 2010.
- Y. Bengio, A. Courville, and P. Vincent, “Representation learning: a review and new perspectives,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 8, pp. 1798–1828, 2013.
Copyright © 2017 Wei Deng et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.