Abstract

Background. TP53 mutation is a common mutation gene in uterine corpus endometrial carcinoma (UCEC), and the TP53 signaling pathway plays an essential role in the tumorigenesis, progression, and immune infiltration in UCEC. We aimed to discover TP53 pathway-related lncRNAs in UCEC. Materials and methods. 528 UCEC patients with 587 transcriptional profiles were enrolled in this study. We first investigated the differential status of TP53 signaling pathway between tumor and normal tissues by GSEA analysis, then identified TP53 pathway-related lncRNAs, accordingly establishing a nine TP53 pathway related to the lncRNA signature in the training set and verified this signature in the test set. Besides, the interaction network was constructed; the immune infiltration, drug response to cisplatin and paclitaxel, and mutation atlas were investigated. Finally, we performed a subgroup analysis to check the universality of this signature. Results. A nine TP53 pathway-related lncRNA prognostic signature was constructed and verified superior accuracy in predicting the overall survival of UCEC patients. Besides, high-risk patients showed a poor prognosis, but they were more sensitive to the cisplatin and paclitaxel. Notably, M2 macrophages were higher infiltrated in high-risk patients, and TP53 showed a significantly higher mutation in high-risk patients than low-risk patients. Conclusions. We constructed and verified a nine TP53 pathway-related lncRNA prognostic signature in UCEC, which also contributes to the decision-making of the chemotherapy.

1. Introduction

TP53 is one of the most significant genes in nearly all tumor types that are mutated in more than 50% of human cancers [1, 2]. It encodes p53 tumor suppressor protein, which mainly contributes to protecting the DNA integrity of the cells [3, 4]. As a protein encoded by the tumor suppressor gene, p53 protein can limit the proliferation of abnormal cells. It can limit tumor development through cell stress sensors, which can respond to DNA damage, hypoxia, oncogene expression, nutritional deprivation and ribosome dysfunction, and limit cell proliferation under these adverse conditions [5, 6]. In addition, p53 can respond to DNA damage and promote apoptosis or cellular senescence [7]. Under the condition of low-level stress, p53 can trigger protective survival-promoting responses, such as temporary cell cycle arrest, DNA repair, or production of antioxidant proteins, to maintain the integrity of the genome and repair the vitality of damaged cells [4]. Therefore, the TP53 signaling pathway plays a very important role in the normal life activities of cells [2]. In recent years, researchers are also committed to finding new potential regulatory genes related to the TP53 signaling pathway, especially long-chain noncoding RNA [810].

Long noncoding RNA (lncRNA) refers to noncoding RNA with a length of more than 400 nucleotides, including long intergenic no-coding RNA (Linc RNA) and natural antisense transcripts (NAT) [11, 12]. Although long noncoding RNAs do not play the role of coding proteins, they can regulate the expression and function of related genes at the RNA level through corresponding regulatory mechanisms [13]. Besides, in recent years, more and more studies focused on the prognostic value of long noncoding RNA, and many accurate lncRNA-based prediction models have been created and tested [1416].

Uterine corpus endometrial carcinoma (UCEC) is a common gynaecological tumor, which occupies a major position in gynaecological tumor cases in developed countries [17], and it is still the second most common tumor in developing countries. It has been reported that there are an estimated 382000 new cases and 89900 deaths worldwide in 2018 [18]. Although surgical treatment provides a good prognosis for patients with early UCEC, the 5-year OS of patients with recurrent or metastatic UCEC is significantly lower [19]. Accurate prediction of individual prognosis is helpful to early identify patients with poor prognosis and individualized accurate treatment. Therefore, in this study, we devoted ourselves to identifying lncRNAs associated with the TP53 signaling pathway in UCEC and constructed prediction models based on these lncRNAs. At the same time, we further studied the potential relationship between this prediction model and immune microenvironment and drug sensitivity.

2. Materials and Methods

2.1. Patients and Row Data

528 UCEC patients with corresponding clinicopathological characteristics and sequencing data of 552 UCEC tumor samples and 35 normal samples were downloaded from the TCGA database (https://portal.gdc.cancer.gov/). Besides, as TP53 was one of the most mutated genes in UCEC, the mutation profiles of UCEC were also retrieved from the TCGA database. “KEGG TP53 signaling pathway” was searched and downloaded from MsigDb (https://www.gsea-msigdb.org/gsea/msigdb/).

2.2. TP53 Pathway in UCEC and Identification of TP53 Pathway-Related lncRNAs

First, we wondered whether the TP53 pathway is differentially activated between tumor samples and normal samples, and we performed gene set enrichment analysis (GSEA). Then we extracted the expression of genes in the TP53 signaling pathway, applied the Spearman correlation test between this expression atlas and lncRNA expression atlas. and adjusted value <0.05 were set as the threshold to define TP53 pathway-related lncRNAs [20].

2.3. Randomization and Signature Construction

We randomly divided all UCEC patients into training sets and test set at a ratio of 1 : 1, then performed univariate cox regression, LASSO regression, and multivariate cox regression in proper order for the final prognostic signature. Then, there established a risk formula according to the signature: where represents the th TP53 pathway-related lncRNAs; represents that there a total of TP53 pathway-related lncRNAs in the prognostic signature; represents the expression value of (in FPKM format); represents the coefficient of . Then, all samples obtained a corresponding risk score, and we divided all samples into high-/low-risk groups according to the medium risk score in the training set.

2.4. Survival Analysis, ROC Curves, and Subgroup Analysis

We plot the survival curves by Kaplan-Meier methods in the training set, test set, and all samples. The log-rank test was used to investigate the survival differences between high-risk and low-risk groups. Notably, the ROC curves in the training set, test set, and all samples were assembled. Besides, the area under curves (AUC) at 1, 3, and 5 years was calculated to emphasize the prediction capability of the prognostic signature. Besides, we divided all patients into several subgroups according to their clinicopathological characteristics and performed survival analysis in each subgroup to explore whether this signature was accurate in all subgroups.

2.5. Interaction Network and Sankey Plot

We were interested in the interaction network between the TP53 pathway and these lncRNAs. Thus, we visualized the interaction network by Cytoscape (version 4.0.8) and then plot the Sankey plot to show the relationship and risk type of these lncRNAs.

2.6. Immune Infiltration, Drug Response, and Mutation Atlas

The CIBERSORT algorithm was carried out to emphasize the immune infiltration of each patient [21], and then we performed Wilcoxon tests to discover the differences in immune infiltration between high-risk and low-risk patients. Besides, R package “pRRophetic” was used to predict the drug response to cisplatin and paclitaxel [22]. Then, the drug response was also compared between high-risk and low-risk groups. Finally, as TP53 is one of the most mutated genes in UCEC, we sorted the mutation profiles according to the risk score and explored the differences in their mutation atlas.

3. Results

3.1. Characteristics of Patients Enrolled in This Study

A total of 528 patients with complete clinical information and transcriptional data were enrolled in this study and randomly split into a training set () and test set (). Their clinicopathological characteristics were shown in Table 1, and it seems there were no baseline differences between the training set and test set. Notably, the races of patients with UCEC were consist of American Indian or Alaska Native, Asian, Black or African American, Native Hawaiian or other pacific islander, and White. Among of them, the White was the major race. The longest follow-up duration in this cohort were 18.79 years. And the pathological G-stage refers to the FIGO 1971 staging system. Besides, 16 of 528 UCEC patients received certain radiotherapy, and the radiotherapy history of the left 512 patients was unknown.

3.2. GSEA Analysis and Signature Construction

Firstly, we sorted the transcriptome matrix according to the tissue type that there was tumor tissue to normal tissue from left to the right. Then we input the “KEGG TP53 signaling pathway” as enriched pathway and carried out the GSEA analysis. Interestingly, we found TP53 signaling pathway was significantly enriched in the tumor samples (Figure 1(a), , normalized value =0.008), which indicated TP53 pathway was highly activated and played an essential role in UCEC. Following this, we performed Spearman correlation test between the expression value of genes in the TP53 signaling pathway and expression value of all the lncRNAs. Initially, we discovered 775 TP53 pathway-related lncRNAs with and adjusted value <0.05. Finally, we constructed a nine TP53 pathway-related lncRNA signature by univariate Cox regression, LASSO regression (Figures 1(b) and 1(c)), and multivariate Cox regression (Figure 1(d)) in order. The detailed coefficient of each lncRNA in this signature was showed in Table 2.

3.3. Survival Analysis and ROC Curves

This nine TP53-related lncRNAs signature showed great capability in the log-rank test in the training set (Figure 2(a), ), test set (Figure 2(b), ), and all samples (Figure 2(c), ). Besides, this signature also showed stable AUC in predicting survival in training set (Figure 2(d), 1-year , 3-year , and 5-year ), test set (Figure 2(e), 1-year , 3-year , and 5-year ), and all samples (Figure 2(f), 1-year , 3-year , and 5-year ). The distribution of risk score and survival status in each group was also shown in Figures 2(g)2(i).

3.4. Interaction Network and Sankey Plot

We visualized the interaction network containing 10 mRNAs in the TP53 signaling pathway and these nine TP53-related lncRNAs as shown in Figure 3(a). Also, the Spearman correlation test was conducted between these 19 genes, and we found all the correlation was positively correlated (Figure 3(b)). The Sankey plot showed four protective lncRNAs and five risky lncRNAs in this signature (Figure 3(c)).

3.5. Immune Infiltration and Drug Response

The immune infiltration atlas of all patients between high-risk and low-risk groups was summarized in Figure 4(a). Here, we focused on the significant differential infiltrated immune cells containing M2 macrophages and activated dendritic cells (DC) that both activated DC and M2 macrophages were significantly higher infiltrated in high-risk patients than low-risk patients (Figures 4(b) and 4(c)). Besides, in the comparison of IC50 to the chemotherapy, the drug responses to both cisplatin and paclitaxel were significantly different between high-risk and low-risk patients that high-risk patients showed a more sensitive response to both cisplatin and paclitaxel (Figures 4(d) and 4(e)).

3.6. Mutation Atlas and Subgroup Analysis

Mutation atlas, including the 20 highest mutated genes in high-risk patients and low-risk patients, was shown in Figure 5(a) and Figure 5(b). The significant differential mutated genes were explored between high-risk and low-risk patients by test as shown in Table 3. Notably, TP53 mutation ranks the first in these differential mutation genes that showed a mutation proportion of 49% in high-risk patients and only 23% in low-risk patients (). Besides, this signature showed a great performance in all subgroups (Figures 5(c)5(g)). In both patients with (Figure 5(c), , ) and patients with (Figure 5(d), , p =0.002) subgroups, this risk score is associated with the overall survival that patients with high-risk showed a poor OS. Besides, in all the subgroups of G-stage, including G1 subgroup (Figure 5(e), , ), G2 subgroup (Figure 5(f), , ), and G3 subgroup (Figure 5(g), , ), this risk score is quite effective. All these results showed the universality and practicability of our prognostic signature.

4. Discussion

P53 is a tumor suppressor protein and transcription factor, which can regulate cell division, prevent DNA mutated or damaged cells from dividing and induce abnormal apoptosis of these DNA damaged cells by upregulating transcriptional apoptosis signals to prevent the formation of the tumor [23, 24]. It can respond to cell stress or DNA damage and activate different transcriptional targets. It is worth noting that p53 can coordinate various responses, including cell cycle arrest, DNA repair, metabolic changes, antioxidation, antiangiogenesis, autophagy, cellular senescence, and apoptosis [7]. Recent studies have also shown that TP53 is a significant iron death-inducing factor [25]. Therefore, the TP53 signaling pathway is crucial in the occurrence and development of tumors. It is necessary to study the new functions and new regulatory factors of TP53.

In this study, we first confirmed that the TP53 signaling pathway is highly activated in UCEC, then identified 775 TP53 signaling pathway-related lncRNAs, constructed a prognostic signature in the training set, and verified it in the test set. This prognostic signature shows high prediction accuracy in the training set, test set, all samples, and subgroups. More importantly, even though the prognosis of high-risk patients is worse, we found that they are more sensitive to cisplatin, which means that the patient differentiation of our prediction model can be used to assist clinical decision-making. Finally, we found significant differences in the infiltration of M2 macrophages between high-risk and low-risk patients, and the mutation rate of TP53 was higher in high-risk patients. In a word, we devote to identifying TP53 pathway-related lncRNAs in UCEC and established a nine TP53 pathway-related lncRNA signature. This signature has three main functions, and they are risk stratification, medication guidance, and mutation prediction. Higher risk score results in a poor prognosis but better response to the chemotherapy to cisplatin and paclitaxel, and a higher probability with mutant type of TP53.

The relationship between TP53 mutation and the tumor microenvironment has always been a controversial issue [26, 27]. Many studies have shown that TP53 mutation plays a negative role in antitumor immunity [26]. However, many studies have reported that TP53 mutation can promote antitumor immunity [28, 29]. In our study, high-risk patients were associated with significantly higher levels of M2 macrophage infiltration and activated DC cell infiltration. Although activated DC cells seem to present more tumor antigens or tumor neoantigens, M2 macrophages with a higher proportion of infiltration shape an anti-inflammatory and tumor-promoting immune microenvironment [30, 31], which also explains why patients with high-risk have a worse prognosis. Another interesting finding is the difference in sensitivity to cisplatin between patients with high- and low-risk. Traditional cytotoxic drugs usually play an antitumor role by blocking cell proliferation or inducing tumor cell apoptosis [32]. One of the most used chemotherapy regimen for endometrial cancer is the TC regimen: paclitaxel plus carboplatin [33]. Our study shows that high-risk patients are more sensitive to cisplatin and paclitaxel, which may be related to the difference in gene mutation spectrum in high-risk and low-risk patients.

Among these lncRNAs, CFAP58-DT has been reported associated with genome instability [34]. This is interesting and consistent with our study that TP53 is associated with genome instability, and we identified CFAP58-DT as a TP53 pathway-related lncRNA in UCEC. Also, Jin et al. identified AC092794.1 as an immune-related lncRNA in lung adenocarcinoma [35]. Moreover, we found a significant difference in immune infiltration between high-risk and low-risk patients in the present study. All these verified the reliability of our results. However, there existed some limitations in this study. On the one hand, though we have examined this prognostic signature in the test set, it still needs a large sample, multicenter independent cohort for validation. On the other hand, we need to conduct more experiments in the laboratory to further verify the regulation or coexpression of these lncRNAs on the TP53 signaling pathway. Besides, there lacks detail pathological information of patients, which is critical for our comprehension of the association between TP53 related pathway, this signature, and the pathological stages and grades. And here, we predicted the individual response to the chemotherapy by the “Prophetic” algorithm, which can only predict the “IC50” of each patient to the chemotherapy, which might be more meaningful for clinicians if it can predict the response classifications such as complete response (CR), partial response (PR), and stable disease (SD).

In summary, we verified the significant role of TP53 signaling pathway in UCEC and constructed a critical TP53-related mRNA-lncRNA interaction network, which may provide potential therapeutic targets and biomarkers of companion diagnosis. More importantly, we identified a nine TP53 pathway-related lncRNA signature in UCEC and verified this prognostic signature in the test set. Besides, though high-risk patients showed a poor prognosis, they were more sensitive to the cisplatin and paclitaxel. The differential immune infiltration and mutation atlas associated with this signature need further investigation in the future.

Abbreviations

UCEC:Uterine corpus endometrial carcinoma
lncRNA:Long noncoding RNA
Linc RNA:Long intergenic noncoding RNA
NAT:Natural antisense transcripts
LASSO:Least absolute shrinkage and selection operator.

Data Availability

Source data of this study were available from corresponding author (email: [email protected]).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

WY, QXL, LLS, and HD were responsible for the design, analysis and interpretation of data, drafting of the manuscript, and critical revision of the manuscript; HD managed the statistical analysis and project administration; WY and QXL were in charge of writing the original draft of the manuscript; all authors participated in writing the review and editing of the manuscript. Ying Wang and Xiaoling Qu contributed equally to this work and designated as co-first authors.

Acknowledgments

The work was supported by the grant project of the National Natural Science Foundation of China (No. 81600544).