Computational and Mathematical Methods in Medicine
Volume 2022 (2022), Article ID 3268386, 16 pages
https://doi.org/10.1155/2022/3268386
Molecular Analysis of Prognosis and Immune Infiltration of Ovarian Cancer Based on Homeobox D Genes
Correspondence should be addressed to Buze Chen and Zhengxiang Han
Received 15 May 2022; Revised 29 August 2022; Accepted 2 September 2022; Published 29 September 2022
Academic Editor: Hongxun Tao
Copyright © 2022 Buze Chen et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
Background. Homeobox D (HOXD) genes were associated with cancer pathogenesis. However, the role of HOXD genes in ovarian cancer (OC) and the possible mechanisms involved are unclear. In this study, we analyzed the function and regulatory mechanisms and functions of HOXD genes in OC based on comprehensive bioinformatics analysis. Methods. Expression of HOXD1/3/4/8/9/10/11/12/13 mRNA was analyzed between OC tissue and normal tissue using ONCOMINE, GEO, and TCGA databases. The relationship between HOXD expression and clinical stage was studied by GEPIA. The Kaplan-Meier plotter was used to analyze prognosis. cBioPortal was used to analyze the mutation and coexpression of HOXDs. GO and KEGG analyses were performed by the DAVID software to predict the function of HOXD coexpression genes. Immune infiltration analysis was used to evaluate the relationship between the expression of HOXD genes and 24 immune infiltrating cells. Results. The expression of HOXD3/4/8/9/10/11 was significantly lower in OC tissues than in normal ovarian tissues, while the expression of HOXD1/12/13 was significantly higher in OC tissues. The expression of HOXD genes was associated with FIGO stage, primary therapy outcome, tumor status, anatomic neoplasm subdivision, and age. The expression levels of HOXD1/3/4/8/9/10 correlated with tumor stage. HOXD1/8/9 could be served as ideal biomarkers to distinguish OC from normal tissue. Low HOXD9 expression was associated with shorter overall survival (OS) (HR: 0.75; 95% CI: 0.58–0.98; ) and progression-free survival (PFS) (HR: 0.69; 95% CI: 0.54–0.87; ). The HOXD coexpression genes were associated with pathways including cell cycle, TGF-beta signaling pathway, cellular senescence, and Hippo signaling pathway. HOXD genes were significantly associated with immune infiltration. Conclusion. The expression of HOXD genes is associated with clinical characteristics. HOXD9 is a new biomarker of prognosis in OC, and HOXD1/4/8/9/10 may be potential therapeutic targets. The members of the HOXD genes may be the response to immunotherapy for OC.
1. Introduction
Ovarian cancer (OC) is one of the most common gynaecological tumors, ranking fourth in incidence and third in mortality worldwide [1, 2]. In China, OC has the second highest mortality rate among gynaecological tumors and is on the rise, while the incidence is declining [3]. It is difficult to detect at an early stage, and most patients are diagnosed at an advanced stage [4]. Despite advances in the treatment of OC with chemotherapy, radiotherapy, surgery, and targeted therapies, the 5-year OS rate for patients with advanced OC is around 30% [5, 6]. Therefore, there is a need to explore the genetic signature of prognostic prediction associated with the underlying mechanisms of OC progression.
Homeobox genes are regulatory genes that share a common 180-183 bp sequence and encode a 61-amino acid structural domain known as the homeodomain. This homeodomain is a DNA binding domain that functions as a transcription factor [7]. In humans, the HOX genes are divided into four clusters (HOXA, HOXB, HOXC, and HOXD) on different chromosomes [8]. HOXDs contain 9 members, including HOXD1, HOXD3, HOXD4, HOXD8, HOXD9, HOXD10, HOXD11, HOXD12, and HOXD13. HOXD1 inhibited cell proliferation, cell cycle, and TGF-β signaling in kidney renal clear cell carcinoma (KIRC) [9]. By identifying the YY1-HOXD3-ITGA2 regulatory axis as a potential therapeutic target for hepatocellular carcinoma (HCC) treatment, a new and complete pathway for HCC treatment is offered [10]. Increased expression of HOXD3 was an independent and important predictor of poor prognosis in breast cancer (BRCA) patients [11]. Overexpression of HOXD4 is significantly associated with poorer prognosis in patients with gastric cancer (GC), suggesting the potential of HOXD4 as a novel clinical predictive biomarker and drug target [12]. HOXD8 may be associated with cisplatin resistance and metastasis in advanced OC [13]. Downregulation of miR-142-5p induced resistance to gefitinib in lung cancer PC9 cells through upregulation of HOXD8 [14]. In summary, some members of HOXDs are closely associated with clinical features and drug resistance of tumors, and their expression levels can be used as predictors of tumor prognosis, metastasis, and response to chemotherapy and targeted therapy. HOXD genes play a role in the pathogenesis of pediatric low-grade gliomas [15]. However, the role of HOXDs in OC is unclear. Studying the prognostic value of HOXDs for patients with OC may help to improve the prediction of clinical prognosis in OC and inform personalized treatment.
In this study, we used a comprehensive bioinformatics analysis to analyze the potential of HOXDs in OC as a predictor of OC prognosis, possible regulatory mechanisms, and relationship with immune infiltration. We hope that our study will be useful for the prognosis of biomarker and treatment of OC.
2. Materials and Methods
2.1. cBioPortal Analysis
The cBio Cancer Genomics Portal (cBioPortal) (http://cbioportal.org) was applied to study mutations in HOXD genes in OC [16]. Queries for visualization and analysis were performed by entering (1) cancer type: ovarian cancer; (2) 3 selected studies: ovarian serous cystadenocarcinoma (TCGA, Nature 2011), ovarian serous cystadenocarcinoma (TCGA, PanCancer Atlas), and ovarian serous cystadenocarcinoma (TCGA, Firehose Legacy); (3) molecular profile: mutations, structural variants, and copy number alterations; (4) selection of patients/case sets: all samples (1365); and (5) input genes: HOXD1 (ENSG00000128645), HOXD3 (ENSG00000128652), HOXD4 (ENSG00000170166), HOXD8 (ENSG00000175879), HOXD9 (ENSG00000128709), HOXD10 (ENSG00000128710), HOXD11 (ENSG00000128713), HOXD12 (ENSG00000170178), and HOXD13 (ENSG00000128714). After submission of queries, accessions were made including origin studies, mutation profiles, mutation number, overall survival (OS) status, OS (months), disease-free status, and disease-free period (months) tracks.
2.2. Differential Expression of HOXDs
ONCOMINE (https://www.oncomine.org/resource/login.html) was used to analyze the levels of HOXD mRNAs in OC tissues and normal tissues [17]. Screening criteria are as follows: , , and top 10% of gene rank [18].
The analysis was carried out according to the reference [19, 20]. Software: R (version 3.6.3). R package: mainly ggplot2 (for visualization). Molecules: HOXD1/3/4/8/9/10/11/12/13. Data: UCSC XENA (https://xenabrowser.net/datapages/) RNAseq data in TPM (transcripts per million reads) format for TCGA and GTEx processed uniformly by the Toil process [21]. Extracted TCGA (https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga) OC and corresponding normal tissue data in GTEx. Data filtering: none. Data transformation: RNAseq data in TPM format and log2 transformed for sample-to-sample expression comparisons. Significance markers: ns, ; , ; and , .
2.3. Correlation Heat Map
Correlation between every two genes of HOXDs was assessed using a Pearson’s correlation coefficient [16]. Software: R (version 3.6.3). R package: mainly ggplot2 (version 3.3.3). Data: RNAseq data in level 3 HTSeq-FPKM format from the TCGA OC project. Data conversion: RNAseq data in FPKM (fragments per kilobase per million) format were converted to TPM format and log2 transformed. Data filtering: remove control/normal (not all items have control/normal).
2.4. The Relationship between HOXDs and Clinical Characteristics of OC
Software: R (version 3.6.3). R package: basic R package. Molecules: HOXD1/3/4/8/9/10/11/12/13. The grouping condition is the median. Data were obtained from the TCGA OC project for RNAseq data in level 3 HTSeq-FPKM format. RNAseq data in FPKM format were converted to TPM format and then log2 transformed.
Expression and correlation analyses of HOXDs were carried out on the GEPIA website (http://gepia.cancer-pku.cn/) [22]. The expression of HOXDs at different clinical stages was generated online.
2.5. The Relationship between HOXDs and Prognosis of OC
Using the Kaplan-Meier method, the analysis was carried out according to the reference [18, 23]. Software: R (version 3.6.3). R package: survminer package (for visualization) and survival package (for statistical analysis of survival data). Molecules: HOXD1/3/4/8/9/10/11/12/13. Subgroups: 0-50 vs. 50-100. Prognosis type: OS and progression-free survival (PFS). OS is defined as the time from the beginning to death from any cause. PFS is defined as the time from initiation to the onset of arbitrary tumor progression or the onset of death. Data: RNAseq data and clinical data in level 3 HTSeq-FPKM format from the TCGA OC project. Data filtering: retain data with clinical information. Data conversion: RNAseq data in FPKM format were converted to TPM format and analyzed by grouping them according to molecular expression. Additional data: prognostic data from the reference [24].
The survival curves of HOXD12 were plotted using the online Kaplan-Meier plotter database [25].
2.6. Univariate and Multivariate Cox Regression Analysis
Software: R (version 3.6.3). R package: survivor package (version 3.2-10). Statistical methods: Cox regression module. Prognosis type: OS and PFS. Included variables: HOXD1/3/4/8/9/10/11/12/13. Data: RNAseq data in level 3 HTSeq-FPKM format from TCGA OC project. Data conversion: RNAseq data in FPKM format were converted to TPM format and log2 transformed. Supplementary data: prognostic data from the reference [24]. Data filtering: remove control/normal (not all items have control/normal) + keep clinical information.
2.7. ROC Curve Analysis
The analysis was carried out according to the reference [18]. Software: R (version 3.6.3). R packages: mainly the pROC package (for analysis) || ggplot2 package. Molecules: HOXD1/3/4/8/9/10/11/12/13. Clinical variables: tumor vs. normal. Data: UCSC XENA RNAseq data in TPM format for TCGA and GTEx processed uniformly by the Toil process [21]. Extracted OC for TCGA and corresponding normal tissue data in GTEx. Data filtering: none. Data transformation: RNAseq data in TPM format and log2 transformed for analysis.
2.8. Correlation Analysis for Genes Associated with HOXDs in OC
cBioPortal was also used to analyze the relationship between the mutation of HOXDs and survival in OC. Coexpression levels were calculated according to the online instructions of “Similar Genes” part of GEPIA2 (http://gepia2.cancer-pku.cn/index.html#example#e3). The first 100 coexpressed genes were kept separately for each gene, and finally, all coexpressed genes were summarized. To further validate the accuracy of the ONCOMINE and TCGA databases, OC samples from the GEO database were downloaded for analysis [26]. 10 ovarian cancer tumor tissues and 10 normal ovarian tissues contained in GSE29450 were used for differential gene expression analysis.
2.9. GO and KEGG Analyses
DAVID database was used to do Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses for the coexpression genes of HOXDs, including BP (biological process), MF (molecular function), CC (cellular component), and pathway analysis [23, 27].
2.10. Correlation between the Expression of HOXD Genes in OC and Immune Cells
The analysis was carried out according to the reference [20]. Software: R (version 3.6.3). R package: GSVA package (version 1.34.0) [28]. Immunoinfiltration algorithm: ssGSEA (built-in algorithm of the GSVA package). Molecules: HOXD1/3/4/8/9/10/11/12/13. Immune cells: aDC (activated DC); B cells; CD8 T cells; cytotoxic cells; DC; eosinophils; iDC (immature DC); macrophages; mast cells; neutrophils; NK CD56bright cells; NK CD56dim cells; NK cells; pDC (plasmacytoid DC); T cells; T helper cells; Tcm (T central memory); Tem (T effector memory); Tfh (T follicular helper); Tgd (T gamma delta); Th1 cells; Th17 cells; Th2 cells; and Treg. Data: RNAseq data in level 3 HTSeq-FPKM format from TCGA OC project. Data conversion: RNAseq data in FPKM format were converted to TPM format and log2 transformed. Data filtering: control/normal removed (not all items have control/normal). Other data: markers for 24 immune cells were obtained from the reference [29].
2.11. Statistical Analysis
The methodology of our analysis follows the previous literature [18]. The expression of HOXDs between OC tissue and normal ovarian tissue was analyzed using the Wilcoxon rank sum test. were considered statistically significant.
3. Results
3.1. HOXD Gene Alterations and mRNA Expression in OC
The cBioPortal online tool was used to analyze the gene expression of HOXD genes in OC patients. Alterations in the HOXD genes in OC ranged from 4% to 5% (Figure 1). The structural variation data, mutation data, and CNA (copy number alteration) data from the 3 studies are depicted in Figure 2.