A Prognostic Model Based on Six Metabolism-Related Genes in Colorectal Cancer
An increasing number of studies have shown that abnormal metabolism processes are closely correlated with the genesis and progression of colorectal cancer (CRC). In this study, we systematically explored the prognostic value of metabolism-related genes (MRGs) for CRC patients. A total of 289 differentially expressed MRGs were screened based on The Cancer Genome Atlas (TCGA) and the Molecular Signatures Database (MSigDB), and 72 differentially expressed transcription factors (TFs) were obtained from TCGA and the Cistrome Project database. The clinical samples obtained from TCGA were randomly divided at a ratio of 7 : 3 to obtain the training group () and the test group (). After univariate and multivariate Cox regression analyses, we constructed a prognostic model based on 6 MRGs (AOC2, ENPP2, ADA, GPD1L, ACADL, and CPT2). Kaplan–Meier survival analysis of the training group, validation group, and overall samples proved that the model had statistical significance in predicting the outcomes of patients. Independent prognosis analysis suggested that this risk score might serve as an independent prognosis factor for CRC patients. Moreover, we combined the prognostic model and the clinical characteristics in a nomogram to predict the overall survival of CRC patients. Furthermore, gene set enrichment analysis (GSEA) was conducted to identify the enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways in the high- and low-risk groups, which might provide novel therapeutic targets for CRC patients. We discovered through the protein-protein interaction (PPI) network and TF-MRG regulatory network that 7 hub genes were retrieved from the PPI network and 4 kinds of differentially expressed TFs (NR3C1, MYH11, MAF, and CBX7) positively regulated 4 prognosis-associated MRGs (GSTM5, PTGIS, ENPP2, and P4HA3).
Statistically, CRC is the fourth leading malignancy worldwide regarding its incidence, occupying about 10.2% of the total tumor incidence . Moreover, nearly a half of patients die within 5 years after they are diagnosed . Most CRC cases progress from polypoid adenomas to high-grade dysplasia, then to adenoma-adenocarcinoma , and this process usually takes over 10 years [3, 4]. Currently, although various treatments have been developed for CRC, the patients’ prognosis still remains unsatisfactory, especially for patients with lymph node metastasis . For the time being, the TNM classification system is the major pathological staging method, which can hardly accurately evaluate the prognosis of CRC. With the progress of the genome-sequencing technologies and the protein function research, an increasing number of studies about the biomarkers to predict the development and prognosis of tumor have emerged. Microsatellite Instability (MSI) status and TP53 mutation status are associated with the event-free survival after neoadjuvant chemotherapy . And the high expression of PIWI-interacting RNA (piRNA) predicts poor prognosis of colorectal cancer . However, the clinical application of biomarkers is still in development.
The role of metabolic disorder in the development and therapy of malignant tumors remains a research hotspot. In the mouse model, the high cholesterol levels associate with the enhanced phosphorylation of Akt, accelerating breast cancer cell growth in experiment in vitro . Shu et al. believed that multiple glyceryl phosphatides, especially phosphatidylcholine and phosphatidylethanolamine, were negatively correlated with the risk of CRC . To inhibit the glycolytic pathway of tumor, shikonin could suppress the activity of PKM2 . Therefore, the therapy strategy targeting to the metabolism might provide novel therapeutic promise for CRC patients.
In this study, we constructed a MRG-based prognostic model to systematically evaluate the prognosis of CRC patients. Kaplan–Meier survival analysis and independent prognosis analysis demonstrated the prognostic value of the prognostic model. Next, hierarchical analysis for high- and low-risk groups in CRC patients with GSEA might provide the novel therapeutic targets for CRC patients. Moreover, the PPI network and TF-MRG network offered more reference for understanding the molecular relationship and molecular regulatory mechanisms.
2. Materials and Methods
2.1. Data Collection
The Cancer Genome Atlas (TCGA) data portal (https://portal.gdc.cancer.gov/) was used to acquire RNA sequences extracted from 482 tumor samples and 41 normal or paratumor samples and associated clinical data. The MSigDB v7.0 (c2: curated gene sets: KEGG gene sets, gene symbols) was used to obtain the MRGs. The TFs were obtained through the Cistrome Project database (http://www.cistrome.org/) .
2.2. Further Extraction of the MRG Data
The MRGs were obtained through the KEGG gene sets in MSigDB and intersected with all genes obtained in TCGA. The Wilcoxon test was utilized for differential analysis to obtain the differentially expressed MRGs according to the thresholds of and .
2.3. The Construction of the Protein-Protein Interaction (PPI) Network
In order to explore the underlying mechanisms of the interactions among the MRGs, we constructed a PPI network related to the differentially expressed MRGs with the Search Tool for the Retrieval of Interacting Genes (STRING) 11.0 (https://string-db.org). Meanwhile, Molecular Complex Detection (MCODE), the tool of the Cytoscape 3.7.2 software , was utilized to retrieve the hub genes of PPI network.
2.4. The Preliminary Validation of the Prognosis-Associated MRGs
To guarantee accuracy and objectivity, patients with missing survival time data or with a survival time of fewer than 30 days were excluded, since these patients might have died from other acute fatal diseases (heart disease and cerebral infarction), rather than CRC. Afterward, we used the “caret package” in the R software to divide patients into a training group and a validation group at a ratio of 7 : 3, and the “survival package” in the R software was employed to conduct univariate Coxregression analysis to obtain the prognosis-associated MRGs that were highly correlated with survival (). To avoid overfitting when constructing the prognostic model, the training group was subject to lasso regression analysis  and partial likelihood deviance to screen prognosis-associated MRGs, among which, the prognosis-associated MRGs with were defined as the high-risk MRGs and the prognosis-associated MRGs with were defined as the low-risk MRGs.
2.5. The Construction of the TF-MRG Network
TFs were obtained from the Cistrome Project database, and the differentially expressed TFs were extracted based on the obtained differentially expressed genes (DEGs) according to the thresholds of and . The prognosis-associated MRGs and differentially expressed TFs were subjected to Pearson correlation analysis ( and value < 0.001) to acquire the TFs and prognosis-associated MRGs used for the construction of the TF-MRG regulatory network, which was visualized with Cytoscape 3.7.2 software.
2.6. The Construction of the Prognostic Model
The “survival package” and “survimer package” in R software were employed for multivariate Cox regression analysis of the training group and validation group to obtain MRGs, among which we further uncovered their functional correlations in Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) through Database for Annotation, Visualization and Integrated Discovery (DAVID) and the risk score for the construction of the prognostic model. Patients in the training group and validation group were divided into high- and low-risk groups according to the median risk score. The survival function of R software was utilized to conduct Kaplan–Meier survival analyses on the high-risk group and low-risk group in the training group, validation group, and overall samples. The risk score and survival status curves reflected the distribution of patient risk score in the high- and low-risk groups, as well as the relationship between the risk score and the survival status. Heat maps denoted the changes in the expression of various significant prognostic MRGs in the high- and low-risk groups. The clinical characteristics obtained from TCGA, including age, gender, tumor stage, pathological T stage, pathological N stage, and pathological M stage, were subjected to independent prognosis analysis combined with the risk score of the prognostic model, to verify whether the prognostic model might serve as an independent factor to predict patient prognosis. Furthermore, the “survival ROC” in R software was adopted to plot the multi-indicator ROC curves for the training group, validation group, and overall samples, and the values of the areas under the curve (AUCs) were observed to verify the feasibility and accuracy of our prognostic model and the clinical characteristics in predicting the patients’ prognosis.
2.7. The Nomogram for the Prognostic Model and Clinical Characteristics
To more intuitively predict the patient survival time, the “rms package” in R software was used to plot the nomogram with a combination of the prognostic model and clinical characteristics (age, sex, and tumor stage). Additionally, the calibration curve and C-index were utilized to verify the consistency and integral veracity of the nomogram. The ROC curves for 1-, 3-, and 5-year patient survival were also plotted to examine the feasibility of the nomogram in predicting the patient’s survival rate in chronological order.
2.8. Gene Set Enrichment Analysis
To judge the potential relationship of each enriched KEGG pathway in the high- and low-risk groups, the overall samples were subjected to GSEA . The corresponding normalized enrichment scores (NES) in each KEGG enriched pathway were observed to judge whether this pathway was active in the high-risk group or the low-risk group. If , the pathway was active in the high-risk group; otherwise, if , the pathway was active in the low-risk group. The top 5 most active pathways in the high- and low-risk groups ( indicated statistical significance) were selected for further visualization analysis.
3.1. Data Acquisition
The main idea of the study is shown in Figure 1. We obtained RNA sequences from 481 tumor tissues and 41 paracarcinoma or nontumor tissues from TCGA-FPKM, which were then subjected to differential analysis using the “limma package” in R software. A total of 6475 DEGs (Table S1) were obtained according to the thresholds of and , including 4431 upregulated and 2044 downregulated ones (Figures 2(a) and 2(d)). Besides, the genes enriched in the metabolism-related pathways were identified from the KEGG gene sets of MSigDB, which were deemed as MRGs. Altogether, 944 MRGs related to CRC were obtained after integrated analysis of the RNA sequences obtained from TCGA-FPKM, which were subjected to differential analysis by the “limma package” in R software. In line with the thresholds of and , 289 differentially expressed MRGs (Table S2) in total, including 163 downregulated and 126 upregulated ones, were obtained (Figures 2(b) and 2(e)). Meanwhile, 318 TFs were acquired from the Cistrome Project database, and then 72 differentially expressed TFs (Table S3) were obtained through integrated analysis of the DEGs based on the thresholds of and , including 47 upregulated and 25 downregulated ones (Figures 2(c) and 2(f)).
3.2. The PPI Network
We aggregately screened out 487 interaction pairs composed of 227 differentially expressed MRGs to construct the PPI network (Figure 3(a)) with a confidence of 0.900 and filtration of isolated differentially expressed MRGs. And as shown in Figure 3(b), via MCODE of Cytoscape 3.7.2, 7 hub differentially expressed MRGs were selected from the PPI network.
3.3. The Acquisition of Prognostic-Associated MRGs
Patients conforming to the screening criteria were randomly divided at a ratio of 7 : 3 into a training group () and validation group () (Table 1). On the basis of univariate Cox regression analysis combined with MRG expression quantities and patients’ survival data, 14 MRGs significantly related to patients’ survival were obtained (). Then, the 12 prognosis-associated MRGs were acquired through lasso regression analysis and partial likelihood deviance (Figures 4(a) and 4(b)). As is shown in Table 2, 9 high-risk prognosis-associated MRGs (ACADL, ENPP2, GPX3, PTGIS, ADH1B, P4HA3, ADA, AOC2, and GSTM5) () and 3 low-risk prognosis-associated MRGs (GPD1L, PAFAH2, and CPT2) () were screened out.
3.4. The Regulatory Network between Differentially Expressed TFs and Prognostic-Associated MRGs
We carried out Pearson correlation analysis (, ) for 12 prognosis-associated MRGs and 72 DETFs and then screened 4 prognosis-associated MRGs (GSTM5, PTGIS, ENPP2, and P4HA3) and 4 differentially expressed TFs (NR3C1, MYH11, MAF, and CBX7) to construct the TF-MRG regulatory network, which was visualized using the Cytoscape 3.7.2 software for intuitive observation. Obviously, the 4 differentially expressed TFs positively regulated the 4 prognosis-associated MRGs (Figure 2(g)).
3.5. The Six-MRG Prognostic Model
To construct the MRG-based prognostic model, we carried out multivariate Cox regression analysis on the 12 prognosis-associated MRGs. Finally, 6 MRGs (AOC2, ENPP2, ADA, ACADL, GPD1L, and CPT2), together with the corresponding coefficients, were obtained (Figure 4(c)). Eventually, the risk score was obtained, which was calculated as follows: . Using the median risk score of 0.774, the patients with were classified as the high-risk group, while those with the were classified as the low-risk group. Table 3 shows the results of GO and KEGG functional enrichment analyses for the MRGs of the prognostic model. GPD1L, AOC2, ACADL and CPT2, ACADL and AOC2, and ACADL were, respectively, active in GO terms, oxidation-reduction process, fatty acid beta-oxidation, and electron carrier activity included, with statistical significance (). CPT2 and ACADL were all enriched in KEGG pathways including fatty acid degradation, fatty acid metabolism, and PPAR signaling pathway with statistical significance (). Figures 5(a), 5(e), and 5(i) show survival curves based on the prognostic model in the training group, validation group, and overall samples. In the training group, the 5-year survival rate for high-risk patients was about 47.9%, while that for low-risk patients, it was about 75%. In the validation group, the 5-year survival rate for high-risk patients was lower than 40.7%, and that in low-risk patients was about 60.9%. The 5-year survival rate for high-risk patients was about 46.7%, and that for low-risk patients was about 77.9%. The risk score plots (Figures 5(b), 5(f), and 5(j)) display the changes of risk score in high- and low-risk patients in training group, validation group, and overall samples, respectively. The survival status distribution graphs (Figures 5(c), 5(g), and 5(k)) show the changes of risk score and the patient survival distribution in the training group, validation group, and overall samples. Notably, the number of deaths increased and the survival time declined as the risk score increased in the training group, validation group, and overall samples. Heat maps (Figures 5(d), 5(h), and 5(l)) illustrate the variation trend of the expression of various prognosis-associated genes with the increase of the risk score. AOC2 showed the most obvious variation trend in the training group, validation group, and overall samples. Its expression quantity was positively correlated with the risk score, indicating that the expression quantity increased as the risk score increased. The expression of CPT2 and GPDL1 was negatively correlated with the risk score. To verify the influence of this prognostic model and various clinical characteristics on the patients’ survival, we conducted an independent prognosis analysis. Univariate Cox regression (Table 4) and multivariate Cox regression analysis (Table 5) were performed on the clinical characteristics and risk score of the training group, validation group, and overall samples. The risk score might serve as an independent factor to predict the patient prognosis in the training group (Figures 6(a) and 6(b)), validation group (Figures 6(c) and 6(d)), and overall samples (Figures 6(e) and 6(f)) (). Additionally, the multi-indicator ROC curve was plotted, and the AUCs in the training group (Figure 6(g)), validation group (Figure 6(h)), and overall samples (Figure 6(i)) were 0.706, 0.739, and 0.716, respectively, verifying that it was more feasible and accurate to use this model in predicting the outcomes of patients than the remaining clinical indicators.
3.6. The Clinical Correlation Analysis
Furthermore, we carried out clinical correlation analysis on various prognosis-associated MRGs and various clinical characteristics (age, gender, tumor stage, and pathological TNM system) so as to further explore the potential molecular regulatory relationships. was considered to denote statistical significance, while suggested high statistical significance, and indicated significant statistical significance. To more intuitively present their relationships, we used the “ggpubr package” of R software to plot boxplots. Clearly, as shown in Figure 7(a), the ADA expression level was correlated with age, and patients older than 60 years had slightly higher ADA expression level than those aged less than 60 years. As shown in Figure 7(b), the expression levels of CPT2 and GPDL1 were correlated with tumor stage, among which, the CPT2 expression level was significantly correlated with tumor stage. As the tumor stage advanced, the CPT2 expression level decreased accordingly, revealing a negative correlation. In tumor stage I and tumor stage II, the expression level of GPDL1 was consistent, but it decreased with the increase in tumor stage. As shown in Figure 7(c), the CPT2 expression level was considered to be significantly correlated with the pathological N stage. Patients with an advanced pathological N stage had lower expression than those with a lower pathological N stage, and the GPDL1 expression level displayed the same trend, which was related to the pathological N stage. The expression of ACADL and AOC2 showed significant correlations with pathological N stage, but their expression levels were low, with insignificant variation trends. For the pathological M stage (Figure 7(d)), the CPT2 expression level in patients with an advanced pathological M stage was lower. GPDL1 expression exhibited a similar trend, and the difference was statistically significant. The expression of the MRGs showed no obvious statistical significance with gender or pathological T stage.
3.7. The Four-Signature Nomogram
We integrated the prognostic model and clinical characteristics to predict patients’ survival with a nomogram (Figure 8(a)). The age, gender, tumor stage, and risk score of the prognostic model were used as the elements for rating various risk factors of the patients, and the scores were added to obtain the total score, thus obtaining the corresponding predicted survival rate. Meanwhile, the 1-year, 3-year, and 5-year survival calibration curves (Figures 8(b)–8(d)) and C-index (0.806) indicated an ideal fitting and excellent accuracy of the nomogram. In the ROC curves (Figure 8(e)), the AUCs of the 1-, 3-, and 5-year survival rates were 0.717, 0.715, and 0.740, respectively, suggesting that this model relatively accurately predicted the survival rate for over 70% of the patients.
3.8. Gene Set Enrichment Analysis
To further explore the biological functions of the MRGs, we carried out GSEA on high- and low-risk groups, finding that 83 KEGG enriched pathways were active in the high-risk group, while 95 were active in the low-risk group. In CRC, various metabolism-related pathways were mainly enriched in the low-risk group ( in total, including 18 with statistical significance at , while they were rarely enriched in the high-risk group). Furthermore, 89 statistically significant KEGG enrichment pathways () were screened, among which, 49 were active in the high-risk group, while 40 were active in the low-risk group. The top 5 pathways with the highest NES in the high- and low-risk groups (Table 6) were selected for visualization analysis (Figure 9), intuitively demonstrating that the expression of genes enriched in KEGG pathways active in the high-risk group located above the x-axis were apparently higher than those in the low-risk group. Genes enriched in KEGG pathways that were active in the low-risk group and located below the x-axis also exhibited a similar trend.
In total, a 6-MRG- (AOC2, ENPP2, ADA, GPD1L, ACADL, and CPT2) based prognostic model was constructed based on the CRC patient clinical characteristics and expression quantity of MRGs. The exploration on AOC2 is relatively limited, and the AOC2-like enzyme activity is detected in eye tissues . ENPP2 is a gene that encodes autotaxin, which has been verified to be related to the growth and metastasis of melanoma tumor and stage I nonsmall cell lung cancer (NSCLC) [16, 17]. And Zhao et al. revealed that autotaxin protein encoded by ENPP2 catalyzes the production of LPC into lysophosphatidic acid (LPA), and such lipid molecular metabolic reaction may be associated with the genesis and development of CRC . Some studies indicated that ADA, participating in encoding an enzyme involved in purine metabolism, is downregulated in lymphocytes of advanced stage lung cancer . Kelly et al. suggested that GPDL1 negatively regulated HIF-1α protein expression in tumor cells, while suppressing miR-210 induced the high expression of GPDL1, which might become a new target in tumor treatment . ACADL can encode an enzyme that participates in fatty acid and branched chain amino-acid metabolism. Hill et al. discovered that ACADL methylation might associate with the poor prognosis for breast cancer . Regarding research on CPT2, Fujiwara et al. discovered that, in obesity- and nonalcoholic steatohepatitis-driven hepatocellular carcinoma, the downregulation of CPT2 accelerated tumor progression . In clinical correlation analysis, the CPT2 expression quantities were significantly correlated with stage and pathological N stage, and its expression quantities gradually decreased as the stage and pathological N stage advanced, thus possibly meaning the reduced expression quantities of CPT2 in advanced CRC. Meanwhile, GO functional annotation suggested that GPD1L, AOC2, ACADL and CPT2, ACADL and AOC2, and ACADL were respectively active in oxidation-reduction process, fatty acid beta-oxidation, and electron carrier activity. Oxidation-reduction process was linked to the prognosis of hepatocellular carcinoma  and clear cell renal cell carcinoma . The critical role of fatty acid beta-oxidation was also proven in the progression of cancer. It has been revealed that fatty acid beta-oxidation promotes proliferation of lymphatic endothelial cells by providing acetyl-CoA and regulates the differentiation of lymphatic endothelial cells with the epigenetic control of CPT1 . And Wang et al. elucidated that JAK/STAT3-dependent fatty acid beta-oxidation is associated with breast cancer chemoresistance . Xu et al. revealed that DNA methylation-driven genes in prostate adenocarcinoma were active in electron carrier activity . What is more, KEGG pathway enrichment analysis uncovered that CPT2 and ACADL were both enriched in fatty acid degradation, fatty acid metabolism, and PPAR signaling pathway, which were all directly or indirectly involved in the process of lipid metabolism related to the progression of malignant tumors [28–31]. No doubt that the functional relationship among MRGs of the prognostic model provided compelling evidence for the role of metabolism in the progression of cancer from the molecular level. Upon the prognostic model constructed, Kaplan–Meier survival analysis for patients classified into high- and low-risk groups according to the median risk score in the training and validation groups verified the prognostic value of the prognostic model. The independent prognosis analysis validated that the risk score acquired had favorable statistical significance in predicting the patient prognostic outcomes. Additionally, the nomogram showed favorable accuracy in predicting the 1-, 3-, and 5-year survival rates of patients, contributing to systemically planning patient’s follow-up.
We further explored the underlying multiple molecular relationships based on differentially expressed MRGs, among which, PPI network was constructed. From the PPI network, we identified 7 hub genes, namely, ATIC, IMPDH1, ENTPD8, AMPD2, GMPR, ENTPD3, and AMPD1. Ruan et al. found that the high expression quantity of IMPDH1 was related to the poor prognosis of malignant tumors, and the interaction between IMPDH1 and YB-1 was associated to the tumor metastasis, which might be a novel therapeutic target . An et al. revealed that ENTPD8, the related gene of metabolite cytidine, was low expressed in pancreatic cancer . And AMPD2 was identified as a potential biomarker for predicting the poor prognosis of undifferentiated pleomorphic sarcoma functional genomics identifies . GMPR was found that it could downregulate GTP-bound Rho-GTPases and inhibited the further development of melanoma . Feldbrugge et al. elucidated that the enzyme expressed by ENTPD3 prevented colon against inflammation and purinergic signaling regulated by ENTPD3 dominated neuroimmune interactions related to Crohn’s disease . AMPD1 could be regarded as the biomarker to predict the survival of breast cancer . There is no doubt that our studies provided theoretical support for the interaction about MRGs in colorectal cancer. Besides, about the regulatory relationship of TFs on the prognosis-associated MRGs, we found that TFs (NR3C1, MYH11, MAF, and CBX7) positively regulated MRGs (ENPP2, PTGIS, GSTM5, and P4HA3). Previous study indicates that the point mutation of MYH11 and the reduced expression quantity of CBX7 are related to the poor prognosis for CRC [38, 39]. NR3C1 positively regulates the 4 prognosis-associated genes. Schlossmacher et al. indicated that glucocorticoid receptor encoded by NR3C1 promoted cell apoptosis through downregulating the expression of antiapoptotic proteins or inducing the expression of proapoptotic proteins . However, there is no research on the regulatory role of NR3C1 in CRC, which may provide a new therapeutic target for metabolic treatment. Currently, targeted metabonomics analysis or nontargeted metabonomics analysis or the combination of both is employed to investigate the effect of differentially expressed metabolite or specific metabolite on the disease prognosis [41–43], among which, NMR spectroscopy is the representative of nontargeted metabonomics analysis method. Previously, Moolenaar et al. obtained the abnormally elevating N,N-dimethylglycine (DMG) induced by the congenital deficiency of enzyme dimethylglycine dehydrogenase (DMGDH) through 13C NMR spectroscopy and gas chromatography-mass spectrometry, which resulted in body odor . In traditional metabonomics, the patient body fluid is collected to obtain the metabolic components to study the patient disease phenotype, but it is frequently dependent on the limited metabolic phenotype markers and is restricted by the quantities of sample metabolic components. Comparatively, our prognosis model utilized the high-throughput sequencing results to evaluate the DEGs and obtain their expression, and it was obtained based on the patient risk score acquired from the model algorithm, together with the patient clinical characteristics. Previously, the gene expression features are utilized to evaluate patient prognosis. O’Connell et al. constructed the multigene algorithms to quantify the prognosis for stage II/III CRC patients who received surgical treatment or combined with postoperative fluorouracil (FU) and leucovorin (LV) . Agesen combined patients, populations, and Affymetrix exon-level microarrays to display a 13 gene-based classifier for predicting the prognosis of stage II CRC through COX regression analysis . These studies have quantified the prognosis evaluation at the molecular level. Additionally, this MRG-based prognosis model expanded the gene biological functions. We conducted GSEA on high-risk group and low-risk group, finding numerous KEGG enriched pathways, most of which were related to metabolism. A vast majority of these pathways were enriched in the low-risk group, including propanoate metabolism pathway and fatty acid metabolism pathway. In recent years, research on the influence of lipid metabolic pathway on CRC development has always been a hotspot. Kazlauskas suggested that LPA played an important role in stimulating tumor angiogenesis , thus regulating tumor metastasis, while angiogenesis in tumor tissues usually promotes tumor growth and metastasis . Yeh et al. employed the microarray-bioinformatics analysis methods to reveal that the activation of fatty acid pathway promoted CRC genesis and development at gene level . Recently, Wang et al. discovered that the activation of the CPT1A-mediated fatty acid oxidation pathways suppressed anoikis to accelerate CRC development and metastasis . And for propanoate metabolism pathway, Perroud et al. illustrated that 31 proteins in the propanoate metabolism pathway were associated with the genesis of clear cell RCC (ccRCC) . However, how the propanoate metabolism pathway affects the genesis and development of colorectal cancer is still being probed. Moreover, compared with the high-risk group, it was feasible to apply the metabolic therapy in the low-risk CRC group. Such a result indirectly verified the feasibility to treat early CRC with metabolic-targeted therapy.
In conclusion, we constructed a prognostic model based on six MRGs to predict the prognosis of CRC by using the bioinformatics method. Univariate and multivariate Cox regression analysis for the training group, validation group, and overall samples verified the prognostic value of the prognostic model. Moreover, the TF-MRG network and PPI network revealed novel molecular regulatory targets about metabolism in CRC. GSEA for biological functions based on the prognostic model not only provided fresh sights about the therapeutic target but also facilitated the individualized treatment for CRC patients.
We analyzed the publicly available datasets in this study. All data were extracted from TCGA, MSigDB, and Cistrome Project.
Consent for participation from all patients was obtained through The Cancer Genome Atlas Project.
Conflicts of Interest
The authors declared that they had no conflict of interests.
Xu YC and Sun YL designed the study, Zhang Y carried out the data collection, Guo YC and Yang ZH conducted the data analysis, Sun YL drafted the manuscript, and Xu YC revised the manuscript. All authors read and approved the final manuscript.
Table S1: the 6475 DEGs. Table S2: the 289 differentially expressed MRGs. Table S3: the 72 differentially expressed TFs. (Supplementary materials)
F. Bray, J. Ferlay, I. Soerjomataram, R. L. Siegel, L. A. Torre, and A. Jemal, “Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries,” CA: a Cancer Journal for Clinicians, vol. 68, no. 6, pp. 394–424, 2018.View at: Publisher Site | Google Scholar
J. L. Westra, M. Schaapveld, H. Hollema et al., “Determination of TP53 mutation is more relevant than microsatellite instability status for the prediction of disease-free survival in adjuvant-treated stage III colon cancer patients,” Journal of Clinical Oncology, vol. 23, no. 24, pp. 5635–5643, 2005.View at: Publisher Site | Google Scholar
A. Subramanian, P. Tamayo, V. K. Mootha et al., “Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles,” Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 43, pp. 15545–15550, 2005.View at: Publisher Site | Google Scholar
T. J. Kelly, A. L. Souza, C. B. Clish, and P. Puigserver, “A hypoxia-induced positive feedback loop promotes hypoxia-inducible factor 1alpha stability through miR-210 suppression of glycerol-3-phosphate dehydrogenase 1-like,” Molecular and Cellular Biology, vol. 31, no. 13, pp. 2696–2706, 2011.View at: Publisher Site | Google Scholar
L. Feldbrügge, A. C. Moss, E. U. Yee et al., “Expression of ecto-nucleoside triphosphate diphosphohydrolases-2 and -3 in the enteric nervous system affects inflammation in experimental colitis and Crohn’s disease,” Journal of Crohn's & Colitis, vol. 11, no. 9, pp. 1113–1123, 2017.View at: Publisher Site | Google Scholar
X. Wang, D. Wang, Y. Wang, P. Zhang, Z. Zhou, and W. Zhu, “A combined non-targeted and targeted metabolomics approach to study the stereoselective metabolism of benalaxyl enantiomers in mouse hepatic microsomes,” Environmental Pollution, vol. 212, pp. 358–365, 2016.View at: Publisher Site | Google Scholar
M. J. O'Connell, I. Lavery, G. Yothers et al., “Relationship between tumor gene expression and recurrence in four independent studies of patients with stage II/III colon cancer treated with surgery alone or surgery plus adjuvant fluorouracil plus leucovorin,” Journal of Clinical Oncology, vol. 28, no. 25, pp. 3937–3944, 2010.View at: Publisher Site | Google Scholar
C.-S. Yeh, J.-Y. Wang, T.-L. Cheng, C.-H. Juan, C.-H. Wu, and S.-R. Lin, “Fatty acid metabolism pathway play an important role in carcinogenesis of human colorectal cancers by microarray-bioinformatics analysis,” Cancer letters., vol. 233, no. 2, pp. 297–308, 2006.View at: Publisher Site | Google Scholar