Background. Docking protein 5 (DOK5) is a member of the docking protein group of membrane proteins and is an adapter protein involved in signal transduction. Nevertheless, the role of DOK5 expression in the prognosis of gastric cancer (GC) remains unclear. Methods. In this study, clinical prognostic parameters and survival data related to DOK5, in patients with GC, were analyzed using bioinformatics analysis comprising Oncomine and TIMER, UALCAN database, Kaplan-Meier plotter, GEPIA, GSEA, DAVID, and cBioPortal websites. Results. In our study, GC contained various DOK5 expressions, which forecasted poor survival outcomes. Moreover, our research showed that high DOK5 could predict high-level infiltration of several GC immune cells, as evidenced by M1, TAM, M2, B cell, and T cell failure. Hence, DOK5 might become a new gastric cancer biomarker and therapeutic target. In the following analysis, in order to explore the prognostic value of DOK5 in GC, more clinical trials are needed to validate our results. Conclusions. Through multiple database verifications, DOK5 was found to be part of the pathogenic genes for GC. Thus, it can change the formation and progression of tumors by acting on human immunity.

1. Introduction

Gastric cancer (GC) is globally significant. It is the 5th most diagnosed cancer and 3rd major cause of cancer-related deaths. It is twice as common in men than in women [1]. GC is deemed a high-mortality cancer due to delayed diagnosis, as no certain clinical symptom appears during the early stage [2]. Hence, exploring certain sensitive biomarkers is highly important for early diagnosis as well as the prognostic evaluation of patients with GC.

Docking protein 5 (DOK5), which was first reported in 2001, is a member of a subgroup of the DOK family that has been expressed using c-Ret in several neuronal tissues. The receptor, tyrosine kinase c-Ret, had been explored as an oncogene, which also has been mutated in patients with multiple endocrine neoplasia and familial medullary thyroid cancer syndromes. DOK5 enhances c-Ret-dependent activation of mitogen-activated protein kinase [3]. Favre et al. had shown that DOK5 are expressed in T cells and their expression is regulated upon T cell activation [4]. Pothlichet et al. suggest that DOK5 upregulation might also be associated with metastasis, in human melanoma [5].

The above findings indicate that DOK5 plays a key role in the invasion, progression, and the metastasis of cancer. In this study, we systematically assessed DOK5 expression in a variety of tumor forms involving GC, as well as its association with prognosis. We also assessed its status regarding distinct tumor-infiltrating immune cells, using the Oncomine database, TIMER databases, GEPIA, Kaplan-Meier plotter, and UALCAN database. Our results clarified the significant role of DOK5 in the prognosis of GC and offer a potential mechanism by which DOK5 expression might monitor tumor immunity—regulation of the infiltration of immune cells in GC.

2. Methods and Materials

2.1. Oncomine

The DOK5 GC and normal tissues’ mRNA expression levels were checked using the Oncomine database [website address (https://www.oncomine.org/resource/login.html)]. We selected ; two-fold change in the study and top 10% gene rank had been utilized for the threshold. Wang’s studies were used to assess the differential expression levels of GC genes.

2.2. GEPIA

Gene Expression Profiling Interactive Analysis (GEPIA) (http://gepia.cancer-pku.cn/) is a modern web-based tool containing data on gene expression in normal tissues and tumors, shared from TCGA (The Cancer Genome Atlas), as well as the Genotype-Tissue Expression project; thus, it implements a standard processing pipeline [6]. It gives optional functions like differential expression analysis in tumors as well as normal tissues. We could also illustrate DOK5 expression in GC, as well as normal tissues.

2.3. TIMER

The Tumor Immune Estimation Resource (TIMER) platform has been used to assess the tumor-infiltrating immune cells of 32 cancer types in a comprehensive way. It utilized 10,000+ samples from TCGA platform (https://cistrome.shinyapps.io/timer/). TIMER assesses a mass of tumor-infiltrating immune cells using the statistical analysis of the gene expression profiles [7]. We examined the link shared by the DOK5 gene expression level along with the abundance of infiltrating immune cells, comprising CD8+ T cells, CD4+ T cells, neutrophils, B cells, and macrophages and dendritic cells based on the expression regarding marker genes in various cancers involving GC. Those marker genes utilized the analysis of the tumor-infiltrating immune cells involving B cells, T cells, monocytes, TAMs, M2 and M1 macrophages, natural killer (NK) cells, neutrophils, dendritic cells (DCs), T-helper 17 (Th17) cells, T-helper (Th) cells, exhausted T cells, and follicular helper T (Tfh) cells, along with Tregs that had been based on data taken from past researches. DOK5 gene was on the -axis, and related marker genes were on the -axis.

2.4. Kaplan-Meier Plotter

The Kaplan-Meier plotter platform (https://kmplot.com/analysis/) integrates information from TCGA, EGA, and GEO databases and translates the impact of target genes on patients with cancer. To evaluate the impact of DOK5 on the prognosis of patients with GC, Kaplan-Meier plotter was used, using various pathological parameters.


The UALCAN platform (http://ualcan.path.uab.edu) makes use of RNA-seq as well as the clinical data of 31 different cancer categories through TCGA [8]. It is capable of analyzing the tumor and normal sample’s relative gene expression, at varying tumor stages, tumor grades, and other clinicopathological features.

2.6. Functional Enrichment Analyses of Gastric Cancer

We ran Kyoto Encyclopedia of Genes and Genomes (KEGG) as well as Gene Ontology (GO) functional enrichment assessment on DOK5. The Database for Annotation, Visualization and Integrated Discovery (DAVID, https://david.ncifcrf.gov/) was utilized for the identification of enriched pathways as well as terms of GO and KEGG.

2.7. Gene Set Enrichment Analysis

We used the Perl software to compile the expression dataset file along with the phenotype data file of the target gene for the single gene enrichment analysis. We downloaded and installed the GSEA software (http://software.broadinstitute.org/ gsea) and ran it in a Java8 environment. The target gene was enriched by KEGG pathway analysis, and the path for analysis was obtained through the c2.cp.kegg.v7.2.symbols.gmt dataset in the MsigDB database. Using weighted enrichment analysis technology and random combination enrichment detection a thousand times, we calculated the value of FDR and through GSEA. We then visualized the outcomes using R (plyr, ggplot2, grid, grid Extra package) software. Cut-off criteria include and >500, nominal value < 0.05, and .

2.8. Genetic Alteration Analysis

As we logged onto the cBioPortal website (https://www.cbioportal.org/), we opted for the TCGA Pan Cancer Atlas Studies from the quick selection section and went into DOK5 in order to determine the DOK5 genetic change characteristics. We observed the change frequency, mutation type, and CNA (copy number change) results of all TCGA tumors in the Cancer Type Summary module.

2.9. Clinical Specimens

This study used 24 postoperative tissue samples of patients with GC treated in Changzhou No. 2 People’s Hospital from 2019 to 2021. Further, during the operation, the adjacent tissues were collected and stored at 80°C immediately.

2.10. Quantitative Real-Time Polymerase Chain Reaction (qRT-PCR) Analysis

The tissues had been preprocessed to extract total RNA. Through the utilization of a PrimeScript RT reagent kit (TaKaRa, Dalian, China), the cDNA was synthesized. Quantitative PCR was carried out with a 7500 real-time PCR system (ABI, Waltham, MA, USA). The PCR primers were synthesized and purchased by Sangon Biotechnology Company (Shanghai, China). DOK5: forward: GGTGAAGGGCTGTTTATCTTTC, reverse: TTTTTCACACTCTGTAGCAAGC; GAPDH: forward: CATGTTCCAATATGATTCCAC, reverse: CCTGGAAGATGGTGATG. GAPDH served as an internal control, and fold change was calculated using the 2ΔΔCT technique.

3. Results

3.1. DOK5 mRNA Expression Levels in Different Types of Human Cancers

In order to determine the difference in the expression of DOK5 in tumors and normal tissues, we used the Oncomine database to analyze the DOK5 mRNA levels in normal tissues of different tumors and multiple cancer types. Analysis shows that in contrast to the normal tissues, DOK5 was better expressed in GC, leukemia, lymphoma, and pancreatic cancer tissues (Figure 1(a)). The in-depth outcomes of DOK5 expression in varying cancer types have been illustrated in Supplementary Table 1.

For more assessment of the DOK5 expression in human cancers, we made use of RNA-seq data using several malignant tumors found in TCGA for identifying the DOK5 expression. In all TCGA tumors, the difference in expression of DOK5 between the tumor and the adjacent normal tissues has been illustrated in Figure 1(b). The expression of DOK5 was significantly reduced in bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), head and neck cancer (HNSC), kidney chromophobe (KICH), kidney renal clear cell carcinoma (KIRC), prostate adenocarcinoma (PRAD), skin cutaneous melanoma (SKCM), thyroid carcinoma (THCA), and uterine corpus endometrial carcinoma (UCEC), compared to adjacent normal tissues. However, DOK5 expression was significantly increased in cholangiocarcinoma (CHOL), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), and stomach adenocarcinoma (STAD), compared to adjacent normal tissues.

3.2. Genetic Change Analysis Data of DOK5

In different tumor samples in the TCGA cohort, we observed the genetic changes of DOK5. The highest alteration frequency of DOK5 (>6%) appeared for patients with colorectal tumors, with “amplification” as the primary type. We also observed that the genetic alteration status of DOK5 was mainly amplified in GC (>4%), which probably explains the changes of DOK5 in GC tissues at the gene level and gives the foundation for further study (Figure 2).

3.3. Effects of DOK5 on the Prognosis of Different Types of Human Cancers

In order to study whether the expression of DOK5 is related to the prognosis of cancer patients, we used GEPIA and Kaplan-Meier plotter to evaluate the effect of DOK5 expression on survival. Using the data for STAD, LIHC, and LUAD from TCGA in the GEPIA database, we assessed the correlation between differential expressions of DOK5 and clinical outcomes. Based on results from 381 patients with GC, poorer prognoses in terms of OS and DFS () were associated with higher mRNA expression levels for DOK5 (Figures 3(a) and 3(b)). However, in liver cancer, different results have emerged. Based on results from 364 patients with liver cancer, poorer prognoses in terms of OS and DFS () were associated with lower mRNA expression levels for DOK5 (Figures 3(c) and 3(d)).

In order to further study the prognostic potential of DOK5 in different cancers, the Kaplan-Meier plotter database was used to evaluate the prognostic value of DOK5 based on Affymetrix microarray. It is worth noting that the poor prognosis in GC (OS: , , ; PFS: , , ; PPS: , , ) was shown to correlate with higher DOK5 expression (Figures 3(e)3(g)). Poor prognosis is associated with low DOK5 expression in liver cancer (OS: , , ; PFS: , , ; RFS: , , ; Figure 3(h)3(j)). Sawant et al.’s studies have also shown that high DOK5 expression is associated with poor prognosis in liver cancer [9]. The expression of DOK5 was also correlated with the patients’ survival in the lung cancer (OS: , , ; PFS: , to 1.24, ; PPS: , to 1.11, ; Figures 3(k)3(m)). Conversely, DOK5 expression was not related with PFS and PPS in lung cancer (Figures 3(l) and 3(m)). These results suggest that DOK5 expression is of prognostic significance in GC, liver cancer, and lung cancer.

3.4. Expression and Clinical Features of DOK5 in Patients with Gastric Cancer

We analyzed the influence of DOK5 expression on different types of clinical patients using the Kaplan-Meier plotter database (Table 1). High DOK5 expression correlated with both poorer OS and PPS in stage 3 patients (OS: , ; PFS: , ), stage T2 patients (OS: , ; PFS: , ), stage M0 patients (OS: , ; PFS: , ), intestinal patients (OS: , ; PFS: , ), diffuse patients (OS: , ; PFS: , ), and HER2-negative patients (OS: , ; PFS: , ). It is worth noting that among patients with lymph node metastasis, patients with high DOK5 expression have a poorer prognosis (OS: stage N1, , ; stage N2, , ; stage N3, , ; stage N1+2+3, , . PFS: stage N1, , ; stage N2, , ; stage N1+2+3, , ). However, in stages 1, 2, T3, T4, N0, M1, and HER2-positive patients, DOK5 expression was not related to OS and PFS. The above data shows that according to the clinical characteristics of patients with GC, DOK5 expression is related to patients with GC lymph node metastasis.

3.5. DOK5 Expression in Patients with STAD

In the UALCAN database, we further analyzed various clinicopathological characteristics of TCGA-STAD specimens and it was found that compared with normal tissues, the expression level of DOK5 mRNA was higher in STAD tissues (Figure 4(a)). The expression of DOK5 in patients with STAD of different stages (2-4) was significantly higher than that of normal controls (Figure 4(b)). In addition, in the assessment made on the basis of race, the expression of DOK5 in patients with STAD was essentially high in contrast to that in the control group (specifically in Caucasians and Asians (Figure 4(c)) and sex (Figure 4(d))). The expression of DOK5 in patients having different grades (1, 3) of STAD was significantly more than that of the normal controls (Figure 4(e)). In the end, in patients with lymph node metastasis, DOK5 expression level is also higher (Figure 4(f)). Hence, the expression level of DOK5 is expected to be a potential diagnostic indicator for tumor staging in patients with GC.

3.6. DOK5 Expression Is Associated with Immune Cell Infiltration in GC

The number and active state of tumor-infiltrating lymphocytes can determine the survival time of some patients with cancer [10]. Therefore, we made use of the TIMER database for the identification of the relationship shared by the DOK5 expression and infiltrating immune cells in 32 cancers, including GC. The results showed that in 32 tumor types, DOK5 expression was crucially related to CD4+ T cells, CD8+ T cells, neutrophils, macrophages, and dendritic cells (Figure 5(a)). Furthermore, we also explored the link shared by SCNA (somatic copy number alteration) of DOK5 gene and the level of GC tumor invasion. It is worth noting that the results show that the CNA of DOK5 is significantly related to the infiltration level of CD8+ T cells, B cells, neutrophils, macrophages, and dendritic cells (Figure 5(b)).

3.7. Correlation between Immune Marker Sets and DOK5 Expression

In exploring the relationship shared by DOK5 as well as various immune infiltrating cells, using GEPIA and TIMER databases, we paid attention to the relationship shared by various immune cell marker sets and DOK5 in GC tissues. In STAD, we examined the link shared by DOK5 expression and varying immune cells, like CD8+ T cells, B cells, T cells (general), TAMs, M1 macrophages, M2 macrophages, monocytes, neutrophils, NK cells, and DCs (Table 2 and Figures 6(a)6(h)). We examined T cells with different functions, involving Th1 cells, Th2 cells, Tfh cells, Th17 cells, Treg cells, and exhausted T cells. The results showed that in STAD, the expression level of DOK5 is related to most immune marker sets of different T cells and various immune cells (Table 2).

It is worth noting that in GC, we explored the levels of expressions of most marker groups of monocytes; TAMs and M2 macrophage held a significant correlation with DOK5 expression (Table 2). We showed that CD86 and CSF1R of monocytes; IRF5 and PTGS2 of M1 macrophages; CD19 and CD79A of B cell; CD163, VSIG4, and MS4A4A of M2 macrophages; CD8A and CD8B of CD8+ T cell; CCL2, CD68, and IL10 of TAM; PDCD1 and HAVCR2 of T cell exhaustion; and CD3D, CD3E, and CD2 of T cell (general) are crucially linked to DOK5 expression in GC (; Figures 6(a)6(h)). We further examined the link shared by DOK5 expression with monocytes markers and TAM markers in the GEPIA database. Correlation outcomes of DOK5 with monocyte markers and TAM markers matched the TIMER (Table 3). The above outcomes suggest that in GC, DOK5 may be related to the regulation of macrophage polarization.

DOK5 expression was positively correlated with dendritic cell infiltration in GC; for example, HLA-DPB1, HLA-DRA, HLA-DQB1, NRP1, CD1C, and ITGAX are also related to the expression of DOK5. The above outcomes also disclosed the proximate link shared by DOK5 and dendritic cell infiltration. In terms of Treg cells, DOK5 is positively correlated with FOXP3, CCR8, and TGFB1 in GC. In tumors, dendritic cells can promote metastasis through the reduction of cytotoxicity of CD8+ T cells and increasing Treg cells [11]. Moreover, we explored a strong link shared by DOK5 and T cell exhaustion and Treg, for instance, CCR8, FOXP3, STAT5B, TGFB1, PDCD1, LAG3, CTLA4, and HAVCR2 (Table 2). In Treg cells, FOXP3 can inhibit cytotoxic T cells from attacking tumor cells [12]. It is worth noting that HAVCR2, as a key gene regulating T cell exhaustion, is strongly correlated with the high expression of DOK5, highlighting that DOK5 holds an essential part in T cell exhaustion. In summary, the above results further indicate that DOK5 is specifically linked to immune infiltrating cells in GC, pointing towards that DOK5 has a crucial immune role in the microenvironment of GC.

3.8. Gene Set Enrichment Analysis of DOK5

According to TCGA information, the ability to search for DOK5 and its related symbol transmission is realized using GSEA. As per NES, nominal value, and FDR value, fundamentally advanced flagging pathways had been elected. In this study, 19 signaling measures were differentially enhanced in the profoundly communicated phenotypes of DOK5. We discovered that most of these pathways are immune-related and involve cell adhesion molecules (CAMS), gap junction, complement and coagulation cascades, ECM receptor interaction, cytokine-cytokine receptor interaction, hedgehog signaling pathway, and leukocyte transendothelial migration (Figure 7(a)).

3.9. Functional Enrichment Analysis of DOK5 Gene

To understand the biological properties of DOK5 completely, we carried out GO and KEGG analyses. On the basis of the outcomes of DAVID’s research, we explored biologically enriched genes that are positively linked to the DOK5 expression levels. In GO analysis, the two biological processes contained by genes that are positively associated with DOK5 expression are as follows: immune response and the inflammatory response. Nine cellular components have been included in these coexpressed genes: cytoplasm, cytosol, nucleoplasm, extracellular exosome, membrane, extracellular space, protein complex, cell-cell adherents’ junction, and melanosome. Moreover, these coexpressed genes have three main molecular functions: sequence-specific DNA binding, identical protein binding and protein kinase activity, and transcription factor activity. Genes positively correlated with DOK5 expression in KEGG pathway analysis were as follows: cytokine-cytokine receptor interaction, transcriptional misregulation in cancer, TNF signaling pathway, malaria, and Chagas disease (American trypanosomiasis) (Figures 7(b)7(e)).

3.10. qRT-PCR Experiments Show That DOK5 Expression Is Upregulated in Gastric Cancer

In order to further confirm that the expression level of DOK5 in GC tissues is higher than that in adjacent tissues, we used qRT-PCR technology to reveal the expression of DOK5 at the transcription level. The results showed that compared with adjacent nontumor tissues, the expression level of DOK5 mRNA in GC tissues was significantly increased (Figure 7(f)).

4. Discussion

GC is a tumor of the digestive system with high morbidity and mortality worldwide [13]. At present, surgery can be said to be the most efficient treatment for early GC, and 90% of patients will have a good prognosis [14]. However, resulting from the lack of early diagnosis, many patients are diagnosed at advanced stages, for example, about 65% of patients with stage 3 and stage 4 tumors, and nearly 85% of patients have lymph node metastasis [15]. In addition, treatment resistance often appears during the treatment of advanced GC. Immunomodulation has been applied to various types of cancer in preclinical models and has achieved good results [16]. It is of great significance for exploring an efficient method for the early diagnosis, as well as treatment of GC.

In this study, the Oncomine database and the TCGA database analyses showed that DOK5 expression was higher in GC than in normal tissues; these complied with the results of related research reports [17]. It has also been studied in breast cancer, liver cancer, and colorectal cancer, and DOK5 gene expression in cancer tissues is higher than that in normal tissues [18]. The receptor, tyrosine kinase c-Ret, has been found to be an oncogenic mutation in patients with multiple endocrine tumors and cancer syndromes with familial medullary thyroid carcinoma, and DOK5 can be directly associated with Y1062 of c-Ret, thereby enhancing the effect of c-Ret [3]. Our research shows that DOK5 is likely to be an oncogene that holds a crucial part in the development and occurrence of cancer. Moreover, in patients with stomach cancer, the DOK5 expression level is also significantly different in different pathological stages, tumor differentiation, and T stages. Therefore, we further studied the link shared by the expression of DOK5 and clinicopathological boundaries and found that the DOK5 expression level had been associated with lymph node metastasis and T stage. Moreover, for patients with lymph node metastasis, patients with high DOK5 expression have a poor prognosis, while patients with low DOK5 expression have a better prognosis. The pathological, T, N, and M stages are linked to the prognosis of GC patients. Our research shows that DOK5 is related to the survival and prognosis of GC patients; therefore, this suggests that DOK5 may be a specific marker of gastric cancer. Nineteen signal pathways were enriched by GSEA to analyze the signal pathway of DOK5 in GC. Melanoma, basal cell carcinoma, and hematopoietic cell lineage all proved that DOK5 influences the progression as well as the occurrence of tumors. In addition, we use the research method of Dr. Sun and his colleagues as a reference to conduct correlation analysis on DOK5 [1921].

Recently, the role of the immune system in the development, as well as occurrence of cancer, was given more attention [22, 23]. Exploring the tumor microenvironment is a new hot research field for tumor diagnosis and treatment [24]. Focal adhesion was found to affect cell migration [25, 26]. Studies have shown that focal adhesion is linked to several biological pathways like cell differentiation, cell proliferation, and cell survival [27]. Simultaneously, focal adhesions are also related to the invasion of cancer cells [28]. Studies have shown that in the tumor microenvironment, ECM receptor interaction plays a significant part in tumor metastasis and recurrence [29]. Tumor angiogenesis and tumor local invasion along with distant metastasis are closely associated to the CAM pathway [30]. Studies have shown that the expression of the CAM pathway can help identify the prognosis of tumors and is expected to turn into a new target for GC treatment. Moreover, the cytokine-cytokine receptor interaction signaling pathway plays a crucial role in cancer pathogenesis [31]. There are also research results showing that the restoration of gap junction will affect the growth of tumor cells and the differentiation of tumor tissues [32]. Therefore, the role of gap junctions can affect the efficacy of antitumor drugs, thereby providing new targets for tumor treatment [33]. There are related research reports on transcription regulation errors playing very important roles in the development of tumors [34]. Tumor necrosis factor (TNF) has a main part in the development of inflammation, cell proliferation, and cell death [35]. For a long time in the past, the TNF family signaling pathway has been a double-edged sword in the process of tumor occurrence and clearance [36]. In the pathogenesis of oral squamous cell carcinoma, TNF-α can regulate EMT through the MAPK signaling pathway to promote cancer cell invasion and metastasis, and DOK5 also involved in MAPK (mitogen-activated protein kinase) signal pathway activation [37]. Therefore, we have reason to speculate that DOK5 may also participate in tumor progression through the MAPK pathway. Therefore, the above results explain the high expression of DOK5 and the high level of immune cell infiltration can reduce the survival rate of patients with GC. These results may bring new methods for the immunotherapy of patients with GC.

Without doubt, after the above results and demonstrations, we have reason to believe that DOK5 is linked to the GC development as well as occurrence. The high expression of DOK5 further impairs the prognosis of patients with GC of by participating in immune-related mechanisms. However, our research also has certain limitations. Our data comes from different databases, and the information in each database may have some differences. Fortunately, through mutual verification of the different databases, we finally got our results.

5. Conclusions

Using multiple database verifications, we found that DOK5 is an oncogene of GC. Hence, DOK5 can reduce the prognostic effect of GC in patients through immune response. DOK5 is expected to become a new target for GC treatment and provide a new direction for GC treatment.

Data Availability

All data were acquired from public databases, including TCGA, Oncomine, GEPIA, Kaplan-Meier plotter, cBioPortal, TIMER, and UALCAN database.

Ethical Approval

The Ethics Committee of the Affiliated Changzhou No. 2 People’s Hospital of Nanjing Medical University approved all collections (No. [2020] KY125-01).

All patients provided written informed consent.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this article.

Authors’ Contributions

Fengyong Luo drafted the manuscript. Zhihuai Wang and Shuai Chen provided the design idea of this study. Zhenbo Luo, Gaochao Wang, Haojun Yang, and Liming Tang processed the data and supplemented the ideas. All authors read and approved the final manuscript. Fengyong Luo and Zhihuai Wang contributed equally to this work.


The authors are very grateful to Prof. Liming Tang for supervising the work. This work was supported by the Natural Science Foundation of Jiangsu Province, China (Grant Number BK20181155) and the Changzhou High-Level Medical Talents Training Project (Grant Number RC201602).

Supplementary Materials

Supplementary Table 1: DOK5 expression in cancers versus normal tissue in Oncomine database. (Supplementary Materials)