Abstract

Detecting mismatch-repair (MMR) status is crucial for personalized treatment strategies and prognosis in rectal cancer (RC). A preoperative, noninvasive, and cost-efficient predictive tool for MMR is critically needed. Therefore, this study developed and validated machine learning radiomics models for predicting MMR status in patients directly on preoperative MRI scans. Pathologically confirmed RC cases administered surgical resection in two distinct hospitals were examined in this retrospective trial. Totally, 78 and 33 cases were included in the training and test sets, respectively. Then, 65 cases were enrolled as an external validation set. Radiomics features were obtained from preoperative rectal MR images comprising T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI), contrast-enhanced T1-weighted imaging (T1WI), and combined multisequences. Four optimal features related to MMR status were selected by the least absolute shrinkage and selection operator (LASSO) method. Support vector machine (SVM) learning was adopted to establish four predictive models, i.e., ModelT2WI, ModelDWI, ModelCE-T1WI, and Modelcombination, whose diagnostic performances were determined and compared by receiver operating characteristic (ROC) curves and decision curve analysis (DCA). Modelcombination had better diagnostic performance compared with the other models in all datasets (all ). The usefulness of the proposed model was confirmed by DCA. Therefore, the present pilot study showed the radiomics model combining multiple sequences derived from preoperative MRI is effective in predicting MMR status in RC cases.

1. Introduction

Rectal cancer (RC) represents a major gastrointestinal malignancy worldwide, with steadily increasing incidence and death rates [13]. To date, immune checkpoint inhibitors (ICIs) have become a crucial therapeutic option for improving prognosis in several solid tumors [4, 5]. Previous clinical trials have shown that microsatellite instability (MSI) and/or mismatch-repair deficiency (dMMR) constitute significant tissue-agnostic molecular markers for the prediction of ICIs’ efficacy [69]. Because genetic and immunohistochemical (IHC) tests for MMR deficiency are available, pembrolizumab and nivolumab as monotherapies or combined with ipilimumab have had approval from the US Food and Drug Administration (FDA) for treating chemoresistant MSI/dMMR mCRC cases [10].

Accurate prediction and diagnosis of MMR status in patients with RC is important in designing a treatment plan and prognostic evaluation. Although laboratory genetic testing and tissue biopsy have been applied to assess the amounts of MMR proteins in RC, including MLH1, MSH2, MSH6, and PMS2, these approaches are costly, invasive, and/or time-consuming [11, 12]. More importantly, since different parts of the tumor could have distinct MMR expression levels, MR imaging may better capture this heterogeneity as a whole rather than a needle biopsy of a single tumor component.

Currently, radiomics, a novel noninvasive tool, has been widely used for pretreatment assessment as well as treatment outcome, distant metastasis, and local recurrence predictions in RC, providing important details of tissue characteristics inaccessible to human eyes [1316]. The radiomics approach was inspired by the notion that medical images comprise considerable information reflecting potential pathophysiological properties through quantitative analysis of digital medical images for the whole tumor. Since not all patients are subjected to genomic tests, radiogenomics is vital because individuals may undergo imaging examinations during the course of disease. Therefore, radiomics data originated from the complete tumor rather than only a tissue sample and could provide gene expression or mutation data to increase diagnostic, predictive, and prognostic capabilities, enabling precision therapy [1720]. However, the prognostic and predictive value of MRI-based radiomics for evaluating MMR status preoperatively in RC still deserves further attention.

Therefore, this study focused on the radiomics features of RC, aimed at assessing the value of radiomics models derived from multiparametric MR imaging for preoperatively predicting MMR status.

2. Materials and Methods

2.1. Patients

The current trial had approval from the Committee on Ethics of Changhai Hospital and Ruijin Hospital Luwan Branch, Shanghai, China. Informed consent was not required because of the retrospective design.

Pathologically confirmed RC cases administered rectal MRI and surgical resection in Changhai hospital from January 2018 to December 2019 were enrolled into the training and test sets. Next, individuals meeting the above eligibility criteria in Ruijin Hospital Luwan Branch were enrolled from January to December 2020 into the validation set (external validation cohort).

Inclusion criteria were as follows: (1) MRI with a pathologic diagnosis of RC, (2) baseline MRI exam within 2 weeks before surgical resection, (3) immunohistochemical test for MMR after surgery, and (4) single focal lesion. Exclusion criteria were as follows: (1) palliative resection; (2) previously administered pelvic surgery, radiation therapy, chemotherapy, or chemoradiotherapy; (3) image quality unsuitable for tumor segmentation; and (4) hereditary colorectal cancer syndrome. Totally, 111 and 65 patients were eventually included in the Changhai and Ruijin cohorts, respectively (Figure 1). Then, the random number technique () was carried out for assigning 70% and 30% of the cases in the Changhai cohort to the training and test sets, respectively.

Baseline clinical information was collected, including age, gender, BMI, presurgical carcinoembryonic antigen (CEA) and carbohydrate antigen (CA19-9) levels, and distant metastasis. An experienced radiologist (G.J.), with 10 years of experience, obtained the data from medical records.

2.2. Image Acquisition

After fasting for 4 hours, the patients were administered enema with 20 ml of glycerin prior to MR scanning. Raceanisodamine hydrochloride was not utilized because of potential contraindications.

Routine rectal MRI sequences were carried out on a 1.5 T or 3.0 T MR scanner, i.e., oblique axial high-resolution T2WI without fat suppression, sagittal T2WI, axial diffusion-weighted imaging (DWI; value = 0; 1000 s/mm2), axial T1-weighted imaging (T1WI), and gadolinium contrast-enhanced T1WI (CE-T1WI) in the sagittal, coronal, and axial planes. CE-T1WI scans were obtained at 1 min following Gd-DTPA (Beilu Pharmaceutical, China) injected intravenously at 2 ml/s with a high-pressure syringe and saline flush (20 ml at 2 ml/s). Details regarding the parameters applied for the above sequences are listed in Supplemental Table 1.

2.3. Pathological Evaluation

Mismatch-repair (MMR) status was determined based on surgical specimens, confirmed by immunohistochemical staining of four MMR proteins, including MLH1, MSH2, MSH6, and PMS2. Deficiency in any of these proteins was defined as dMMR. Based on the National Comprehensive Cancer Network and American Joint Committee on Cancer (AJCC) TNM system (8th Edition) [21], two pathologists with more than 10 years of work experience determined the tumor’s TN stage, histological type, differentiation status, tumor deposit, lymphovascular invasion, perineural invasion, tumor budding, and circumferential resection margin (CRM) from hematoxylin and eosin- (H&E-) stained slices. In case of discrepancy, both examiners discussed to reach a consensus.

2.4. Image Segmentation

The original DICOM images underwent importation into the Radcloud radiomics platform (Huiying Medical Technology, China). As MR image acquisition utilized distinct MRI systems in both hospitals, the images were normalized for homogeneity using the following formula: where is the normalized intensity, is the original intensity, is the mean of the image intensity value, is the standard deviation of the image intensity value, and is an optional scaling, by default, which is set to 1.

Regions of interest (ROIs) in all RC cases were manually delineated slice-by-slice along the clearest solid border that best fitted the lesion area, excluding the blurry margin, on each of these three sequences (T2WI, CE-T1WI, and DWI with  s/mm2). Then, volumes of interest (VOIs) were derived from the obtained ROIs. All images were processed by 2 experienced radiologists (Z.L. and H.L., with 11 and 6 years of experience in abdominal imaging, respectively) in an independent manner, blinded to group assignment. All segmentations were checked by one senior radiologist (F.S., who had 12 years of experience in rectal MRI).

2.5. Radiomics Feature Selection

Based on the derived whole VOI segmentations, radiomics feature extraction was performed with the above platform from T2WI, DWI, CE-T1WI, and combined sequences, respectively. Four types of features were obtained: (1) first-order statistics, including peak and mean value (with variance), quantifying the distribution of voxel intensities on MR scans; (2) shape properties, including volume, lesion area, and spherical value, reflecting the 3D features of the delineated area’s shape and size; (3) texture features, i.e., gray-level cooccurrence, run length, size zone, and neighborhood gray-tone difference matrices, quantitating a given area’s heterogeneity; and (4) higher-order statistics, encompassing transformed first-order statistics and texture features [1315], e.g., logarithm, gradient, square, and wavelet transform.

In the training cohort, inter- and intraclass correlation coefficients (ICCs) were determined to assess feature robustness. Features with inter- and intraobserver ICCs above 0.8 were further analyzed. Next, the variance threshold. Select--best and least absolute shrinkage and selection operator (LASSO) algorithms were employed to choose optimal parameters. In addition, the synthetic minority oversampling technique (SMOTE) was utilized to tackle the imbalanced samples in the training cohort for subsampling. The SMOTE algorithm represents an improved sampling strategy, with a given novel synthetic sampling computed according to the Euclidian distance for variables.

2.6. Radiomics Model Establishment

The Radcloud platform was utilized for SVM learning with the “scikit-learn” package in Python (v0.24.1, https://scikit-learn.org/stable/). Based on the optimal features related to MMR status, the SVM with linear kernel was employed to construct 4 predictive models: (1) ModelT2WI, (2) ModelDWI, (3) ModelCE-T1WI, and (4) Modelcombination, with optimal features extracted from T2WI, DWI, and CE-T1WI sequences in combination. To prevent overfitting, the validation set was employed for verifying and comparing the performances of the final models. Figure 2 depicts the radiomics workflow. Furthermore, the penalty coefficient with the best performance was used to train the final SVM model.

2.7. Statistical Analysis

Continuous variables were assessed for normality by the Kolmogorov-Smirnov test, and group comparisons were performed by the -test or the Wilcoxon test. Categorical variables were compared by the Chi-square or Fisher’s exact test. Variance threshold methods were applied for selecting radiomics features (variance ), taking out eigenvalues below 0.8. For the select--best method, utilizing values for analyzing the associations of radiomics features with MMR status, features with were used. For the LASSO algorithm, L1 regularizer was used as the cost function, and optimal value was derived based on the minimum of the average mean square error by 5 cross-validation and 1000 iterations. Radiomics features with a nonzero coefficient in LASSO were chosen by linearly combining the chosen features multiplied by the corresponding coefficients for each patient. Receiver operator characteristic (ROC) curve analysis was carried out for performance evaluation for each model by deriving the area under the ROC curve (AUC) and determining sensitivity, specificity, and accuracy. ROC curve comparisons utilized the DeLong test. Decision curve analysis (DCA) was applied to assess the benefits of each model. The nomogram was analyzed with R v3.6.3. Other data were analyzed with SPSS 20.0 (SPSS, USA) and MedCalc 19.6.1. was deemed statistically significant.

3. Results

3.1. Patient Features

Totally, 111 and 65 individuals in the Changhai and Ruijin cohorts were finally enrolled, respectively. Table 1 lists the patient features, which had similar characteristics in both cohorts (Supplemental Table 2). Based on MMR status determined by postsurgical pathological assessment, 20/111 individuals (18.0%) were categorized as dMMR in the Changhai cohort, versus 11/65 (16.9%) in the Ruijin cohort (). Subsequently, 78 (70.3%) and 33 (29.7%) cases in the Changhai cohort were assigned to the training and test sets, respectively.

3.2. Radiomics Features

Totally 1409 radiomics features were obtained from T2WI, DWI, and CE-T1WI data. Totally, 1270/1409 (90.1%), 1232/1409 (87.4%), and 1268/1409 (90.0%) of them had inter- and intraobserver ICCs above 0.8, respectively, from T2WI, DWI, and CE-T1WI, and were further examined. Totally, 429, 466, and 406 features were selected in subsequent variance threshold and the select--best algorithm, respectively. Eventually, two, five, and ten optimal features associated with MMR status were determined with the LASSO algorithm from T2WI, CE-T1WI, and DWI data, respectively (Supplemental Table 3). Then, the combination of T2WI, CE-T1WI, and DWI resulted in four screened features from 3770 features for predicting MMR status. Details are presented in Table 2. A heat map showed the discrepant distribution of the selected features between the dMMR and pMMR groups (Figure 3).

3.3. Radiomics Models

In the training population, ModelT2WI, ModelDWI, ModelCE-T1WI, and Modelcombination had diagnostic performances reflected by AUCs between 0.670 and 0.910 (Figure 4(a)). Modelcombination achieved the best diagnostic performance (; ). In the test set, the four models had diagnostic performances reflected by AUCs between 0.568 and 0.901 (Figure 4(b)). Modelcombination had the best performance (; ).

While validating the radiomics models, Modelcombination had the best diagnostic performance (AUC, 0.874; sensitivity, 90.9%; specificity, 81.5%; accuracy, 83.1%) in the validation set (Figure 4(c)). These findings indicated Modelcombination had improved discrimination performance in comparison with other models () in all datasets. Table 3 presents the detailed findings.

3.4. Decision Curve Analysis

DCA demonstrated an adequate performance for Modelcombination in distinguishing dMMR lesions from pMMR counterparts in the validation cohort. Modelcombination had clinical superiority over the other models for net benefit within a large threshold probability (Figure 5), suggesting the multiparametric MRI approach had significantly improved power in comparison with other models.

4. Discussion

Here, a machine learning model based on the combination of multiple MRI sequences constituted an effective, noninvasive, novel imaging approach for evaluating MMR status in RC cases, with an external validation set examined by different MRI scanning equipment and conditions.

High microsatellite instability (MSI) results from malfunction of the dMMR system and is responsible for about 3–5% metastatic colorectal cancers (mCRCs) [2225]. Detecting MMR in rectal cancer is important in clinical decision making, identifying individuals with differential treatment response and prognosis. Studies have shown that standard nCRT might be less effective in dMMR RC patients than in patients with proficient mismatch repair (pMMR) [24]. It might lead to the modification of clinical practice in avoidance of unnecessary nCRT with poor response and dMMR can be used as a biomarker to guide clinical immunotherapy.

Although it is recommended to perform neoadjuvant CRT for most patients with locally advanced RC according to the NCCN guideline. However, in clinical practice, determination of whether to receive perioperative CRT was at the discretion of the surgeon, oncologist, and patient. If CRT was not performed before operation, the postoperative CRT could be considered. The management and treatment strategies could be tailored after identification of LARC patients who would benefit from adjuvant therapy or are not likely to exhibit a good response to nCRT, if MMR status was detected in the pretreatment approach.

Universal immunohistochemical testing for evaluating MMR status is recommended. However, due to this relatively costly and time-consuming approach, some patients remain untested [26]. Therefore, broadly available, low-cost, and noninvasive methods are urgently needed to help select patients for evaluation. More importantly, since different parts of the tumor could have distinct MMR expression levels, imaging approach may better capture this heterogeneity as a whole rather than a needle biopsy of a single tumor component. This study investigated a machine learning-based model for automatically predicting MMR directly from MR images.

In comparison with routine strategies using imaging methods, radiomics substantially improves disease diagnosis, tumor grading, and prognostic evaluation, providing a comprehensive guidance for treatment planning [1618]. A previous study demonstrated that the SVM model showed good classification performance related to pathological features in patients with RC [13]. With continuous technology progress, the concept of “radiogenomics” has been widely applied in tumors recently. Via extraction of multiple quantitative parameters from imaging findings combined with genomics data, and performing a deep mining of associations between both data types, radiogenomics can be used to retrieve quantitative image information that can reflect gene expression for deeper understanding of the occurrence and development of tumors, through noninvasive, conventional imaging methods [27].

Currently, several studies have reported the correlation between radiomics features and MSI status in colorectal cancer based on CT images [2831]. Meanwhile, only limited recently published studies have developed MRI-based radiomics models for predicting MSI status preoperatively in rectal cancer [32, 33]. Although the latter reports found that radiomics models show great potential in predicting MSI status, there is currently no correlation study with external validation between MMR prediction and multiparametric MRI-based radiomics in RC.

The most valuable aspect of the present study is the multiparametric approach that enhances the MRI-based radiomics model by mining complementary information provided by multiparametric MRI and considering the heterogeneity of tumors for predicting differential features involved in MMR status [33]. By extracting radiomics features hardly detectable visually from the preoperative MR scans of the segmented VOIs of whole primary tumors, we developed four predictive models with T2WI sequence alone, DWI sequence alone, CE-T1WI sequence alone, and the combination of these three sequences, respectively. ModelT2WI contained phenotypic features, while ModelDWI and ModelCE-T1WI contained heterogeneous data describing microcirculation for the entire rectal tumor. Heat map analysis revealed a correlation between MMR status and features, suggesting the chosen multiparametric features had relevance to MMR status preferably.

The combined radiomics model achieved the overall best performance in predicting MMR status among all models in both cohorts, showing superiority over single-sequence models. Good clinical utility was demonstrated by decision curve analysis. Combining many MRI sequences and deep mining of correlations among distinct radiomics features could allow a comprehensive assessment of tumor heterogeneity, which might increase predictive efficiency and potentially guide in distinguishing cases who need individualized treatment.

The second vital aspect of this study is that we had an actual external validation dataset, adding value to existing reports. Machine learning models raise high concern for overfitting. Using an external cohort is very helpful for overcoming the weakness that the developed model has no exposure to a validation cohort in the training phase in any form.

However, this project is still in its infancy, with many limitations. First, an important limitation of the current retrospective trial was its relatively small sample size and unbalanced distribution. This implies selection bias and low generalizability of the results, although we used an external validation cohort and the SMOTE algorithm to reduce the effect of unbalanced sample distribution. Consequently, large multicenter studies are warranted for reducing the effects of selection bias on model accuracy. Secondly, the imaging segmentation approach was manual rather than automatic, which may suffer from subjective errors and could be unsuitable for data processing in case of large sample size [34, 35]. Thirdly, a study previously developed and validated deep learning models for the prediction of MSI status in RC based on MRI data [36]. In future research, deep learning model with feature map may show more advantages over other approaches to visualize heterogeneous distribution. It provides a possibility that the deep learning can be used to predict which tumor area is most likely to show dMMR to guide biopsy.

5. Conclusions

Overall, based on preoperative rectal MRI, the established multiparametric machine learning model demonstrated good performance in predicting MMR status in RC patients. This radiomics approach could better the current strategy for the pretreatment of patients, with the advantage of being noninvasive and cost-effective, potentially helping select patients suitable for individualized therapy.

Abbreviations

RC:Rectal cancer
DWI:Diffusion-weighted imaging
MRI:Magnetic resonance imaging
T1WI:T1-weighted imaging
T2WI:T2-weighted imaging
VOIs:Volumes of interest
ROC:Receiver operating characteristic
AUC:Area under the ROC curve
SVM:Support vector machine
FOV:Field of view
BMI:Body mass index
CEA:Carcinoembryonic antigen
CA19-9:Carbohydrate antigen 19-9
dMMR:Deficient mismatch repair
pMMR:Proficient mismatch repair
DCA:Decision curve analysis
SMOTE:Synthetic minority oversampling technique
LASSO:Least absolute shrinkage and selection operator.

Data Availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Ethical Approval

The present study was approved by the Biomedicine Ethics Review Committees of Ruijin Hospital and Changhai Hospital.

Written informed consent to publish the presented information was obtained from study participants.

Disclosure

The funders developed the main idea and designed the study.

Conflicts of Interest

The authors declare that they have no competing interests.

Authors’ Contributions

JL, YL, and FS conceived the project. XM and HL acquired the data. GJ, YC, and ZL analyzed and interpreted the patient data regarding radiomics features. YX performed statistical analyses and feature extraction. GJ, YC, and XM was the major contributor in writing the manuscript. All authors read and approved the final manuscript. Guodong Jing, Yukun Chen, and Xiaolu Ma contributed equally to this work.

Acknowledgments

The study was supported by the Project of the Action Plan of Major Diseases Prevention and Treatment (2017ZX01001-S12) and the Special Project of Integrated Traditional Chinese and Western Medicine in General Hospitals of Shanghai (ZHYY-ZXYJHZX-201901).

Supplementary Materials

Supplementary 1. Supplemental Table 1: main sequences and parameters of rectal MRI.

Supplementary 2. Supplemental Table 2: characteristics of the two examined patient cohorts.

Supplementary 3. Supplemental Table 3: selected radiomics features for different sequences.