Significance of TP53 Mutational Status-Associated Signature in the Progression and Prognosis of Endometrial Carcinoma
Background. TP53 mutations are associated with poor outcome for patients with endometrial carcinoma (EC). However, to date, there have been no studies focused on the construction of TP53 mutational status-associated signature in EC. In this study, we aim to conduct a TP53 mutation-associated prognostic gene signature for EC. Methods. Hence, we explored the mutational landscape of TP53 in patients with EC based on the simple nucleotide variation data downloaded from The Cancer Genome Atlas (TCGA) database. Differential expression analysis and least absolute shrinkage and selection operator (LASSO)–Cox analysis was used to establish TP53 mutation-associated prognostic gene signature. The overall survival rate between the high-risk and low-risk groups was compared by the Kaplan–Meier (K-M) method. Results. We found that the TP53 mutation was associated with poor outcome, older age, lower BMI, and higher grade and stage of EC in patients. A TP53 mutational status-associated signature was established based on transcriptome profiling data. Moreover, the patients in TCGA database were categorized into high- and low-risk groups. Kaplan–Meier (K-M) analysis indicated that the patients in the high-risk group have poor survival outcome. Furthermore, receiver operating characteristic (ROC) curves confirmed the robust prognostic prediction efficiency of the TP53 mutational status-associated signature. Finally, the prognostic ability was successfully verified in the other two datasets from cBioPortal database as well as in 60 clinical specimens. Univariate (hazard ratio , , ) and multivariate (hazard ratio , , ) Cox regression analyses indicated that the TP53 mutational status-associated signature could be used as an independent prognostic factor for EC patients. Conclusion. In summary, our research constructed a powerful TP53 mutational status-associated signature that could be a potential novel prognostic biomarker and therapeutic target for EC.
Endometrial carcinoma (EC), which originates from the endometrial epithelium, is the second most common malignancy of the female reproductive system . Although surgery, chemotherapy, and immunotherapy for EC have led to some improvements in the clinical outcome , patient mortality rate is still high . Therefore, it is of practical clinical significance to further explore the pathogenesis of EC at the molecular level and to evaluate and predict the survival rate of patients by studying the prognostic signature of EC [4, 5].
Tumor protein P53 (TP53), located on the short arm of chromosome 17 (17p13.1) [6, 7], has a broad spectrum of mutations in human cancers, including allelic loss, deletion, insertion, and point mutations . Chromosomal deletion of TP53 gene is associated with the occurrence, chemotherapy resistance, and poor prognosis of many tumors . Notably, about 70–80% of mutations in the TP53 gene are missense mutations caused by the substitution of a single nucleotide, which consequently changes the corresponding amino acid residues. This change, especially the change of arginine residues, can significantly affect TP53 gene activity . Moreover, the TP53 protein is inactivated in more than half of tumors . A mutant TP53 protein not only loses its tumor suppressor function but may also acquire a functional expression similar to an oncogene, promoting the occurrence and development of cancer [12, 13]. Therefore, TP53 could potentially be a novel biomarker of tumor prognosis and an effective therapeutic target.
Previous studies have confirmed the prognostic value of TP53 mutations in EC [14–16]. However, so far, no studies have focused on the construction of a TP53 mutational status-associated signature in EC. Hence, this study is aimed at constructing a TP53 mutational status-associated signature based on The Cancer Genome Atlas (TCGA) database.
2. Materials and Methods
2.1. Data Acquisition
A dataset with 529 patients with EC including simple nucleotide variation, transcriptome profiling datasets, and clinical data was acquired from TCGA and was utilized as the training dataset (https://portal.gdc.cancer.gov/). The ucec_tcga_pan_can_atlas_2018 and ucec_tcga_pub datasets, which contained the information of 527 and 331 patients with EC, respectively, along with their transcriptome profiling datasets and corresponding clinical information, were downloaded from the cBioPortal database (http://www.cbioportal.org/study/summary?id=ucec_tcga) and used as validation datasets. The deadline for the dataset was June 2020. Inclusion criteria for sample screening included (1) primary endometrial cancer confirmed by pathology without any preoperative radiotherapy or chemotherapy and (2) prognostic information complete without deletion. Finally, this study has been performed according to the REMARK Guidelines. The baseline information of the EC patients is shown in Table 1.
2.2. Specimen Collection
A total of 60 patients with EC that were admitted to the Obstetrics and Gynecology Department of the Shengjing Hospital of China Medical University from January 2016 to December 2016 were selected as the research objects. The ages of patients ranged from 26 to 76 years, and the average age was years. In terms of FIGO stage, 26 cases were at stage I, 12 were at stage II, 14 cases were at stage III, and 8 cases were at stage IV. In terms of pathological grades, there were 21, 14, and 25 cases at G1, G2, and G3, respectively. This study was approved by the ethics committee of Shengjing Hospital of China Medical University, and informed consent was obtained from all patients and healthy participants. In addition, all methods were performed in accordance with the relevant guidelines and regulations.
2.3. Identification of Differentially Expressed Genes
The “edgeR” package in R was used to screen differentially expressed genes (DEGs) between patients with TP53 mutation or not. Inclusion criteria was and . The cut-off value was 0.029. The “ggplot2” package in R was used to draw the volcano map, and the “ComplexHeatmap” package was used to draw the heat map in order to show the differential expression in patients with TP53 mutation or not.
2.4. Construction and Validation of the TP53 Mutational Status-Associated Signature
The “Survival” package in R was performed to obtain the DEGs associated with prognostic value according to univariate Cox regression analysis. DEGs with significant prognostic value () were screened to establish the TP53 mutational status-associated signature using LASSO–multivariate Cox analysis. The risk score for each patient was calculated using the following formula: . The median value of risk score was used to classify the patients into high- and low-risk groups, and the K-M and log-rank method was used to compare the overall survival outcome between the two groups. A receiver operating characteristic (ROC) curve was plotted to evaluate the prognostic ability of the TP53 mutational status-associated signature at different time endpoints using the “Survival” and “timeROC” in R software. In addition, to evaluate the predictive performance of the TP53 mutational status-associated signature, we verified our analysis results using the ucec_tcga_pan_can_atlas_2018 and ucec_tcga_pub datasets from the cBioPortal database, as well as in 60 clinical specimens.
2.5. RT-qPCR Analysis
Trizol was used to extract the total RNA of EC samples (Takara, Japan), and then, complimentary DNA (cDNA) was synthesized using the PrimeScript RT kit (Takara, Japan), following the manufacturer’s instructions. The cDNA was amplified using SYBR Premix Ex Taq kit (Takara, Japan), and mRNA hydrolevel was detected using an ABI Prism 7000 fluorescence quantitative PCR assay (Applied Biosystems, Waltham, MA, USA). GAPDH was used as the internal reference, and mRNA expression in the TP53 mutational status-associated signature was calculated by the 2-ΔΔCT method. The sequences of primers used for RT-qPCR are displayed in Supplementary Table 1. Then, we established a TP53 mutational status-associated signature as per the method used for the training dataset based on the expression level of mRNAs. The risk scores of 60 clinical specimens were obtained, and specimens were classified into high- and low-risk groups.
2.6. Establishment of the Nomogram Model Based on the TP53-Associated Signature
To maximize clinical decision-making, the “rms” package in the R software was used to conduct a nomogram based on the expression level of genes in the TP53 mutational status-associated signature. After the clinicians input the expression values of each gene in the TP53 mutational status-associated signature for a specific EC patient into the nomogram, the corresponding score values in the score scale were obtained. Then, the resulting score values were added into the total score scale. Finally, a vertical line was drawn on the survival scale to estimate the survival rates at 1, 3, and 5 years. Calibration plots were used to evaluate calibrating ability. The closer the predicted value is to the actual value, the better the nomogram can be corrected. Decision curve analysis (DCA) curves were constructed to evaluate the efficacy of the histogram for prognosis prediction in different sample sets .
2.7. Statistical Analysis
Univariate and multivariate Cox regression analyses were used to evaluate the independence of TP53 mutational status-associated signature for EC. The total number of mutations in the DNA of cancer cells (TMB) of the patients with EC in TCGA database was calculated by Perl software. The Wilcoxon rank-sum test was used for comparative analysis between two groups. All statistical analyses were performed using the R language (version 3.6.2). Bilateral test was considered statistically significant.
3.1. TP53 Mutational Status in EC
Based on the TP53 mutation data from TCGA dataset, we found that the mutation frequency of the TP53 was 37%. As expected, K-M analysis confirmed that the patients with TP53 mutation had poor survival outcome (Figure 1(a), ). Moreover, Figures 1(b)–1(e) reveal that the TP53 mutation was related to poor outcome, older age, lower BMI, and higher grade and stage of EC in patients ().
3.2. Construction of the TP53 Mutational Status-Associated Signature
A total of 1058 DEGs were identified based on and screening standard (Supplementary Table 2; Figures 2(a) and 2(b)). Univariate Cox regression preliminarily screened 50 DEGs associated with the prognosis of patients with EC (Supplementary Table 3, ), and then, LASSO regression analysis identified 17 key DEGs (Figures 2(c) and 2(d)). Of these, nine were further screened by forward stepwise method, and then, a TP53 mutational status-associated signature was constructed. Moreover, multivariate Cox regression analysis was used to calculate the risk score of each EC patient, with the following formula: (Table 2). The difference analysis between TP53 mutation and TP53 wild-type group revealed that ERBB2, GLOD5, KCNK6, MAL, OR2W3, STAC, and ZNF829 were expressed at higher levels in the TP53 mutation group, while MUCL1 and RBP2 were expressed at higher levels in the TP53 wild-type group (Figures 3(a)–3(i)).
3.3. Evaluation of the TP53 Mutational Status-Associated Signature
We then evaluated and validated the prognostic efficacy of the TP53 mutational status-associated signature in both training and validation datasets. The risk scores and survival status of each patient with EC have been determined (Figures 4(a), 4(d), 4(g), and 4(j)). K-M analysis showed that patients in the low-risk group have longer survival time than those in the high-risk group (Figures 4(b), 4(e), 4(h), and 4(k), ). Moreover, ROC analysis showed that the overall survival rates of EC patients at 1, 3, and 5 years in TCGA dataset were 0.775, 0.762, and 0.738, respectively (Figure 4(c)). In the ucec_tcga_pan_can_atlas_2018 dataset, the overall survival rates at 1, 3, and 5 years were 0.764, 0.831, and 0.858, respectively (Figure 4(f)), whereas in the ucec_tcga_pub dataset, the overall survival rates at 1, 3, and 5 years were 0.886, 0.878, and 0.890, respectively (Figure 4(i)). Figure 4(l) reveals that the 1-, 3-, and 5-year overall survival rates (AUC) in clinical specimens were 0.925, 0.851, and 0.826, respectively.
3.4. Independent Prognostic Value of the TP53 Mutational Status-Associated Signature
With overall survival as the dependent variable, the risk score is calculated by the TP53 mutational status-associated signature, age, BMI, pathological stage, and grade in TCGA dataset. Univariate (hazard ratio , , ) and multivariate (hazard ratio , , ) Cox regression analyses indicate that the TP53 mutational status-associated signature has significant prognostic value, which could be used as an independent prognostic factor for EC patients (Figures 5(a) and 5(b)). We then investigated correlations between mutational status and the new risk score and clinicopathological variables in TCGA dataset and found that patients with older age, higher EC grade and stage, dead event, and TP53 Mut were more distributed in the high-risk group (Figures 5(c)–5(h), Table 3, ).
3.5. Establishment of the Nomogram Model Based on the TP53-Associated Signature
We successfully constructed a nomogram based on the expression levels of the above nine DEGs. After the clinicians input the expression values of nine genes for a specific EC patient into the nomogram, the corresponding score values in the score scale were obtained, and the resulting score values were added into the total score scale. Finally, a vertical line was drawn on the survival scale to estimate the survival rates at 1, 3, and 5 years (Figure 6(a)). Calibration curves showed that the predicted survival rates of patients with EC were in good agreement with the actual survival rates at 1, 3, and 5 years (Figures 6(b)–6(d)). Moreover, DCA results showed that the nomogram had high net income (Figure 6(e)).
3.6. Mutational Landscape Associated with the TP53 Mutational Status-Associated Signature
TMB is defined as the number of tumor-specific mutations per million coding region bases . Figures 7(a)–7(c) reveal that the patients with TP53 wild type and those in the low-risk group had higher TMB values. Moreover, the Sankey diagram showed the relationship between risk score, TP53 mutational status, TMB, and survival status (Figure 7(d)). Finally, we investigated the mutational landscape associated with the TP53 mutational status-associated prognostic signature and found that PTEN had higher mutation frequency in the high-risk group, while TP53, PPP2R1A, PIK3CA, and MUC16 had low mutation frequencies (Figure 7(e)).
EC is the most common type of cancer in the female reproductive system . In recent years, the understanding of EC has deepened, and some achievements have been made in the treatment and prognostic assessment of EC. However, there has still not been a breakthrough in treatment strategies, and individualized treatment of EC still faces great challenges. Previous studies have reported that the TP53 mutation is associated with poor outcome of patients with EC, which was confirmed in our research [20, 21]. However, to date, there are still no relevant studies on the development of a TP53 mutational status-associated signature. In our study, a TP53 mutational status-associated signature with powerful predictive potential in TCGA dataset was constructed and verified its potential using two datasets from the cBioPortal database, as well as in 60 clinical specimens, indicating that this could be a novel prognostic biomarker and therapeutic target for EC.
This TP53 mutational status-associated signature was constructed using LASSO–Cox analyses of identified key DEGs, which included ERBB2, GLOD5, KCNK6, MAL, MUCL1, OR2W3, RBP2, STAC, and ZNF829. To explore how these genes are involved in the development of EC, we reviewed the previous studies.
Erb-B2 receptor tyrosine kinase 2 (ERBB2), also known as HER2, is a member of the ERBB family . ERBB2, as a proto-oncogene, has been confirmed to be upregulated in EC tissues and is related to poor prognosis . Several targeted therapies for ERBB2, such as trastuzumab, pertuzumab, and lapatinib, have been used in the clinical setting . Potassium channel subfamily K member 6 (KCNK6) is the background potassium channel belonging to the potassium channel family of double pore domain. KCNK6 is upregulated in thyroid carcinoma and breast cancer and is related to the proliferation, invasion, and migration of breast tumor cells [25, 26]. Myelin and lymphocyte protein (MAL) encodes T lymphocyte maturation-related proteins and plays a role in T cell differentiation. Downregulated MAL, as a tumor suppressor gene, was associated with a variety of human epithelial malignancies . A study revealed that the MAL can be used for the early diagnosis of EC . Mucin-like 1 (MUCL1), also known as SBEM, is a breast-specific gene that is associated with the occurrence, progression, prognosis, and chemotherapy response of breast cancer . OR2W3, which belongs to the ORS gene family, has been revealed to be related to the progression of breast cancer . Retinoblastoma-binding protein 2 (RBP2) belongs to the JARID protein family and is responsible for histone demethylase (HDM) activity. As a chromatin-modifying enzyme, it has been shown to be involved in the development and progression of a variety of cancers . Src homology three (SH3) and cysteine-rich domain (STAC) encodes a cysteine-rich protein containing the SH3 domain, which is mainly expressed in neurons and may be involved in neuron-specific signal transduction . So far, no relevant studies have been found on the GLOD5 and ZNF829 genes. Although most of these genes have not been previously reported in EC, they have been found to play an important role in the development of other tumors [30, 33–35].
To evaluate and validate the prognostic value of the TP53 mutational status-associated signature in both the training and validation datasets, as well as in clinical specimens, an ROC curve at 1, 3, and 5 years was plotted. We found that the mean of AUC value was more than 0.80, indicating that the TP53 mutational status-associated signature has a powerful prognostic ability.
In summary, we conducted and validated a TP53 mutational status-associated signature with robust predictive potential. To our knowledge, this is the first study to do so. The TP53 mutational status-associated signature could potentially be used as a novel prognostic biomarker and therapeutic target for EC.
|TCGA:||The Cancer Genome Atlas|
|LASSO:||Least absolute shrinkage and selection operator|
|ROC:||Receiver operating characteristic|
|ERBB2:||Erb-B2 receptor tyrosine kinase 2|
|MAL:||Myelin and lymphocyte protein|
|RBP2:||Retinoblastoma-binding protein 2.|
All data generated or analyzed during this study are included in this published article.
This study was approved by the ethics committee of the Shengjing Hospital of China Medical University. In addition, all methods were executed in accordance with relevant guidelines and regulations.
Informed consent was obtained from all patients.
This manuscript was submitted as a preprint in the link https://www.researchsquare.com/article/rs-820719/v1 A .
Conflicts of Interest
The authors declare that they have no competing interests.
Ying Chen, Wancheng Zhao, Fangfang Bi, Xue Pan, Lili Yin, and Chengzhi Zhao conceived and designed the study. Ying Chen, Wancheng Zhao, Fangfang Bi, Lili Yin, and Chengzhi Zhao developed the methodology. Ying Chen, Wancheng Zhao, Lili Yin, and Chengzhi Zhao analyzed and interpreted the data. Ying Chen, Xue Pan, Lili Yin, and Chengzhi Zhao wrote, reviewed, and/or revised the manuscript.
We wish to thank the authors who provided TCGA and cBioPortal public datasets.
Supplementary 1. Supplementary Table 1: the sequences of primers used for RT-qPCR.
Supplementary 2. Supplementary Table 2: a total of 1058 differentially expressed genes (DEGs), including 525 upregulated genes and 533 downregulated genes, were identified based on and screening standard.
Supplementary 3. Supplementary Table 3: univariate Cox regression preliminarily screened 50 DEGs with the prognosis value.
V. C. A. Caponio, G. Troiano, I. Adipietro et al., “Computational analysis of TP53 mutational landscape unveils key prognostic signatures and distinct pathobiological pathways in head and neck squamous cell cancer,” British Journal of Cancer, vol. 123, no. 8, pp. 1302–1314, 2020.View at: Publisher Site | Google Scholar
C. Reijnen, H. V. N. Küsters-Vandevelde, M. J. L. Ligtenberg et al., “Molecular profiling identifies synchronous endometrial and ovarian cancers as metastatic endometrial cancer with favorable clinical outcome,” International Journal of Cancer, vol. 147, no. 2, pp. 478–489, 2020.View at: Publisher Site | Google Scholar
V. D. Li, K. H. Li, and J. T. Li, “TP53 mutations as potential prognostic markers for specific cancers: analysis of data from The Cancer Genome Atlas and the International Agency for Research on Cancer TP53 Database,” Journal of Cancer Research and Clinical Oncology, vol. 145, no. 3, pp. 625–636, 2019.View at: Publisher Site | Google Scholar
C. C. Chini, J. M. Espindola-Netto, G. Mondal et al., “SIRT1-activating compounds (STAC) negatively regulate pancreatic cancer cell growth and viability through a SIRT1 lysosomal-dependent pathway,” Clinical Cancer Research, vol. 22, no. 10, pp. 2496–2507, 2016.View at: Publisher Site | Google Scholar