Abstract

Purpose. The competing endogenous RNA (ceRNA) network regulatory has been investigated in the occurrence and development of many diseases. This research aimed at identifying the key RNAs of ceRNA network in pterygium and exploring the underlying molecular mechanism. Methods. Differentially expressed long noncoding RNAs (lncRNAs), microRNAs (miRNAs), and mRNAs were obtained from the Gene Expression Omnibus (GEO) database and analyzed with the R programming language. LncRNA and miRNA expressions were extracted and pooled by the GEO database and compared with those in published literature. The lncRNA-miRNA-mRNA network was constructed of selected lncRNAs, miRNAs, and mRNAs. Metascape was used to perform Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses on mRNAs of the ceRNA network and to perform Protein-Protein Interaction (PPI) Network analysis on the String website to find candidate hub genes. The Comparative Toxicogenomic Database (CTD) was used to find hub genes closely related to pterygium. The differential expressions of hub genes were verified using the reverse transcription-real-time fluorescent quantitative PCR (RT-qPCR). Result. There were 8 lncRNAs, 12 miRNAs, and 94 mRNAs filtered to construct the primary ceRNA network. A key lncRNA LIN00472 ranking the top 1 node degree was selected to reconstruct the LIN00472 network. The GO and KEGG pathway enrichment showed the mRNAs in ceRNA networks mainly involved in homophilic cell adhesion via plasma membrane adhesion molecules, developmental growth, regulation of neuron projection development, cell maturation, synapse assembly, central nervous system neuron differentiation, and PID FOXM1 PATHWAY. According to the Protein-Protein Interaction Network (PPI) analysis on mRNAs in LINC00472 network, 10 candidate hub genes were identified according to node degree ranking. Using the CTD database, we identified 8 hub genes closely related to pterygium; RT-qPCR verified 6 of them were highly expressed in pterygium. Conclusion. Our research found LINC00472 might regulate 8 hub miRNAs (miR-29b-3p, miR-183-5p, miR-138-5p, miR-211-5p, miR-221-3p, miR-218-5p, miR-642a-5p, miR-5000-3p) and 6 hub genes (CDH2, MYC, CCNB1, RELN, ERBB4, RB1) in the ceRNA network through mainly PID FOXM1 PATHWAY and play an important role in the development of pterygium.

1. Introduction

Pterygium is a common ocular surface disease characterized by a triangular-shaped growth consisting of fibrotic subconjunctival connective tissue and hypertrophy of the overlying conjunctival epithelium [1]. The population affected by pterygium reached 200 million globally, and the prevalence in China was 108.65 million [2, 3]. Pterygium has long been considered as a chronic degenerative condition; however, after abnormal expression of the p53 protein was found in the epithelium [4], pterygium is now considered ultraviolet-related uncontrolled cell proliferation, similar to tumors [5]. The development of pterygium is a complicated process involving cell proliferation, migration, inflammatory infiltrates, fibrosis, angiogenesis, and extracellular matrix breakdown [2]. However, the mechanism of pterygium formation and development is still not completely understood. Studies have demonstrated that noncoding RNAs served crucial roles in numerous diseases [68]. They are classified into two classes: small noncoding RNAs (microRNAs (miRNAs), small interfering RNAs, and transfer RNAs) and long noncoding RNAs (lncRNAs). Salmena et al. proposed the competing endogenous RNA (ceRNA) hypothesis, wherein lncRNAs harboring miRNA response elements competed with one another to bind to a common miRNA and thereby acted as molecular “sponges” and depressed the target genes of the miRNAs [9]. The ceRNA network has been demonstrated in numerous diseases, particularly in cancer. By establishing the lncRNA-miRNA-mRNA network, Wang et al. identified functional genes in heart failure and provided further insights into the important roles of the ceRNA network in heart failure [10]. Yang et al. discovered that lncRNA RP11-169F17.1 and RP11-669N7.2 could become novel biomarkers of stomach adenocarcinoma assessed by the construction of the ceRNA network [11]. Zhu et al. reported that 4 lncRNAs might act as potential therapeutic targets or candidate prognostic biomarkers in clear cell kidney carcinoma by reconstruction and comprehensive analysis of the ceRNA regulatory network [12].

However, there is only 1 research on constructing ceRNA networks for pterygium [13]. The hub genes of the network and their roles in the development of pterygium have not been fully understood. Therefore, we collected the lncRNA, miRNA, and mRNA datasets and published literature; filtered out differentially expressed long noncoding RNAs (DELs), microRNAs (DEMis), and mRNAs (DEMs) to constructed ceRNA networks; and identified hub genes closely related to pterygium, based on the functional and pathway analysis on mRNAs in the ceRNA network.

2. Materials and Methods

2.1. Data Collection

The Gene Expression Omnibus (GEO) is an international, public functional genomics data repository for high-throughput microarray and next-generation sequences [14]. The published pterygium gene expression profiles (GSE83627, GSE21346, GSE51995, and GSE2513) were downloaded [1519]. The GSE83627 lncRNA data set was collected using GPL14550 platforms (Agilent-028004 SurePrint G3 Human GE 8 × 60 K Microarray, Agilent Technologies, Inc.) and included samples from 4 pterygium and 4 healthy conjunctiva tissues. The miRNA expression data of GSE21346 were based on the GPL7723 platforms (miRCURYLNA microRNA Array, v.11.0-hsa, mmu & rno, Exiqon A/S) and consisted of samples from 3 pterygium and 3 healthy conjunctiva tissues. The mRNA expression data of GSE51995 were from 4 pterygium samples and 4 healthy conjunctiva tissues and were based on GPL14550 platforms (Agilent-028004 SurePrint G3 Human GE 8 × 60 K Microarray, Agilent Technologies, Inc.). Another mRNA expression data of GSE2513 were from 8 pterygium samples and 4 healthy conjunctiva tissues and were based on GPL96 platforms (Affymetrix Human Genome U133A Array, Affymetrix). In order to make the datasets more abundant, we executed a systemic search in PubMed, CNKI, and Web of Science and found 2 microarray studies on DELs and 7 studies on DEMis, including 4 microarray assays studies and 3 experimental verification studies.

2.2. Data Preprocessing

All downloaded data sets were standardized through the limma package of R 3.6.3, and the standardized expression matrix was used for the differential expression analysis. The DELs, DEMis, and DEMs between pterygium samples and healthy control conjunctiva tissues were selected according to and values <0.05 [20]. Sequentially, DELs and DEMis were pooled based on the microarray data in the searched studies, respectively, with the Draw Venn Diagram website (http://bioinformatics.psb.ugent.be/webtools/Venn/). And the pooled results were used as candidate DELs and DEMis for the following analysis. The volcano plots and the heatmaps of DELs, DEMis, and DEMs expressions were set up using the ggpolt 2 package and pheatmap package in R 3.6.3 separately.

2.3. Prediction of Target lncRNAs and mRNAs of miRNAs

The lncRNA targets of the candidate miRNAs were predicted using DIANA-LncBase v3 (https://diana.e-ce.uth.gr/lncbasev3/interactions), which identifies miRNA and lncRNA interactions derived from manually curated publications and the analysis of 153 AGO CLIP-Seq libraries [21], and starbase 2.0 (http://starbase.sysu.edu.cn/) [22]. The predicted DELs could be selected for ceRNA network construction. Data for miRNA and mRNA interactions were downloaded from miRDB (http://www.mirdb.org/), TargetScanHuman7.1 (http://www.targetscan.org/), and miWalk2.0 (http://zmf.umm.uni-heidelberg.de/apps/zmf/mirwalk2/miRretsys-self.html), and DEMs which appeared in all three databases were used for the following analysis.

2.4. Construction of the lncRNA-miRNA-mRNA Network

Based on the associations between lncRNAs-miRNAs and miRNAs-mRNAs, the ceRNA network was constructed. First, the upregulated and downregulated RNAs (lncRNAs, miRNAs, and mRNAs) were assigned with values <0.05; then, the lncRNA-miRNA-mRNA network was visualized by using the Cytoscape 3.7.2 software [23], and all node degrees of the ceRNA network were calculated using the CytoHubba plugin [24] of Cytoscape 3.7.2. The significant lncRNA was selected according to the node degree ranking, and a new ceRNA network was reconstructed centered on the top lncRNA.

2.5. Functional Analysis on mRNAs in ceRNA Network for Discovering Hub Genes

To analyze the function of mRNAs in ceRNA networks, GO and KEGG analyses were performed by Metascape (http://metascape.org/) [25]. The protein-protein interaction (PPI) network of mRNAs in the ceRNA network was constructed by the STRING database with the confidence as the cutoff level, providing the critical assessment and integration of protein-protein interactions of genes, including direct (physical) as well as indirect (function) associations [26, 27]. Then, we used the CytoHubba plugin of Cytoscape 3.7.2 to analyze the PPI network and to choose the candidate hub genes with top node degrees. Sequentially, we identified hub genes closely related to pterygium using the Comparative Toxicogenomic Database (CTD), which provides interactions between diseases-genes and diseases-drugs.

2.6. Validation of Hub Genes
2.6.1. Patient Sample Preparation

There were 20 primary pterygium specimens obtained in Zhongnan Hospital of Wuhan University during pterygium surgery. The control conjunctival tissues were derived from healthy conjunctival tissues on the temporal side of the surgical eye of the same patient, with a size of . There were 9 males aged 40-84 () years old and 11 females aged 40-79 () years old. All participants are of Han nationality and have no blood relationship. This study was reviewed and approved by the Ethics Committee of Zhongnan Hospital of Wuhan University, and the written informed consent of the patients has been obtained before the study. Patients with recurrent pterygium, history of severe eye trauma, eye infection, cataract, glaucoma, acute dacryocystitis, and severe mental illness were excluded.

2.6.2. Tissue RNA Extraction and RT-qPCR

All specimens were collected during the operation and immediately placed in RNA protective agent (RNA later™, SIGMA, USA), stored at -20°C for usage. RNA was extracted from 80 mg specimen tissue using TRIZOL reagent. The relative expression of hub genes was measured using a reverse kit (HiScript® III RT SuperMix for qPCR (+gDNA wiper)) and ChamQ™ Universal SYBR® qPCR Master Mix (Vazyme company, Nanjing, China) on CFX96 fluorescence qPCR instrument (Bio-Rad, USA). And the PCR reaction was carried as follows: predenaturation at 95°C for 30 seconds, denaturation at 95°C for 3-10 seconds, and annealing extension at 60°C for 10-30 seconds, for 40 cycles. Using GAPDH as internal references, the relative expression of genes was calculated by the 2-△Ct method. The primer sequences are listed in Supplementary Table 1.

3. Results

3.1. Filtered DELs, DEMis, and DEMs

There are four recruited pterygium datasets from the GEO search, including the lncRNA data set (GSE83627), miRNA data set (GSE21346), and mRNA data sets (GSE51995 and GSE2513). In Figure 1, the expression of DELs, DEMis, and DEMs is shown by volcano plots and heat maps (GSE83627 (Figure 1(e)), GSE21346 (Figure 1(f)), GSE51995 (Figure 1(g)), GSE2513 (Figure 1(h)). After the limma package differential analysis, 214 upregulated lncRNAs were selected (Figure 1(a)), 4 upregulated miRNAs (miR-138-5p, miR-184, miR-642a-5p, miR-1298-5p) were found (Figure 1(b)), and there are 360 downregulated and 537 upregulated mRNAs in GSE51995 (Figure 1(c)) and GSE2513 (Figure 1(d)) datasets. Using online search keywords (lncRNA and pterygium) in PubMed, CNKI, and Web of Science, we found two publications about lncRNA microarray data of pterygium [28, 29]. There were 10 upregulated lncRNAs and 10 downregulated lncRNAs in Liu’s study and 10 upregulated lncRNAs and 7 downregulated lncRNAs in Zheng’s research (Supplementary Table 2). Then, we used the Venn diagram to make an intersection analysis with lncRNAs in the two publications and previously selected DELs from datasets (Figure 2(a)). So, we merged the lncRNAs from two studies with DELs; then, finally, 251 lncRNAs were obtained to be DELs in the following study, of which 17 are downregulated lncRNAs and 234 upregulated lncRNAs. Moreover, there were seven references including 4 microarray articles and 3 experimental validation (Supplementary Table 2). In the 4 microarray articles, Engelsvold et al. [30] found 14 upregulated and 12 downregulated miRNAs; Cui et al. [31] found 10 differently expressed miRNAs, including 1 upregulated and 9 downregulated; Lee et al. [32] found 4 upregulated miRNAs (miR-143-3p, miR-181a-2-3p, miR-377-5p, miR-411-5p); and Lan et al. found 2 upregulated miRNAs in pterygium (miR-138-5p, miRPlus-E1233) [33]. Three experimental validation studies confirmed that miR-218-5p was downregulated, and miR-182-5p, miR-183-5p, miR-184, and miR-221-3p were upregulated [3436]. The Venn diagram was used to remove the duplicate miRNAs and combine the reported miRNA with 4 DEMis filtered in GSE21346 (Figure 2(b)). Then, totally, 46 differentially expressed miRNAs were obtained, among which 21 miRNAs were downregulated and 25 miRNAs were upregulated. Eventually, a total of 360 downregulated mRNAs and 537 upregulated mRNAs of two datasets in GEO were filtered. In a word, we got 251 lncRNAs, 46 miRNAs, and 897 mRNAs as candidate RNAs for subsequent study.

3.2. miRNA Predicted lncRNA and mRNA Targets

Firstly, the associations between the DELs and DEMis were assessed to reveal the 12 miRNAs targeting 8 lncRNAs. Subsequently, 94 mRNAs targeted by 12 miRNAs were identified. Finally, the 8 coexpressed lncRNAs, 12 coexpressed miRNAs, and 94 coexpressed mRNAs were selected to construct the primary ceRNA network (Table 1).

3.3. lncRNA-miRNA-mRNA Networks

The lncRNA-miRNA-mRNA ceRNA network was constructed by Cytoscape 3.7.2 (Figure 3(a)), which consisted of above 8 lncRNA nodes, 12 miRNA nodes, 94 mRNA nodes, and 136 edges. According to the node degree ranking, we selected the highest node degree RNA (LINC00472) as the center to construct the LINC00472 network (Figure 3(b)). The new ceRNA network centered by LINC00472 was constructed of 1 lncRNA nodes, 8 miRNA nodes, 84 mRNA nodes, and 106 edges (Figure 4(a)). This suggested that LINC00472 might be a key lncRNA in pterygium functioning through reacting with the 8 miRNAs (miR-29b-3p, miR-183-5p, miR-138-5p, miR-211-5p, miR-221-3p, miR-218-5p, miR-642a-5p, miR-5000-3p). Among them, miRNA-221-3p [34], miR-183-5p [35], and miR-218-5p [36] have been reported to have important function in pterygium. The other 5 miRNAs have been reported to play an important role in cancers. For example, Drummond et al. found miR-29b-3p was involved in a novel mechanism wherein cigarette smoke promoted accelerated cardiac and renal tissue injury in chronic kidney diseases [37]. And miR-138-5p can inhibit the malignant progression of prostate cancer [38] and lung cancer growth, through the miR-138-5p/FOXC1 pathway [39]. miR-211-5p has been confirmed to be related to a variety of cancers, such as cervical cancer [40], the nonsmall cell lung cancer cell [41], breast cancer [42], oral squamous cell carcinoma [43], and papillary thyroid cancer [44]. miR-642a-5p and miR-5000-3p were found have important effects on colon cancer [45, 46]. As we know, pterygium is considered as uncontrolled cell proliferation and is similar to tumor [5], which supports that these miRNAs could also play an important role in the development of pterygium.

3.4. Functional Analysis of mRNAs Related to Pterygium

The 94 mRNAs in the primary ceRNA network was investigated by the GO and KEGG pathway enrichment analysis with Metascape. Metascape results were dominated by functional categories, including regulation of homophilic cell adhesion via plasma membrane adhesion molecules, developmental growth, regulation of neuron projection development, adherens junction interactions, cell maturation, synapse assembly, central nervous system neuron differentiation, positive regulation of mRNA metabolic process, negative regulation of cell-substrate adhesion, tissue morphogenesis, and blood vessel morphogenesis. Two pathways were enriched, PID FOXM1 PATHWAY and PID ATF2 PATHWAY. The occurrence of pterygium is closely related to the epithelial-mesenchymal transitions (EMT) and angiogenesis under the tissue [47]. FOXM1 is a member of the forkhead box (FOX) transcription factor family. Previous reports have demonstrated that FOXM1 is oncogenic, in breast cancer [48], hepatocellular carcinoma [49], prostate cancer [50], lung cancer [51], and colorectal cancer [52], and plays important roles in angiogenesis, invasion, and metastasis [53]. It may indicate the mRNAs in the ceRNA network are the biogenesis of the transformation of conjunctival cells into fibroblasts and the formation of subconjunctival neovascularization according to these biological processes and pathways, especially by regulating cell maturation, tissue morphogenesis, blood vessel morphogenesis, and PID FOXM1 PATHWAY (Figure 3(c)).

In order to explore the role of LINC00472 in pterygium, we performed GO and KEGG enrichment analysis again on the 84 mRNAs in the LINC00472 ceRNA network. We found that these genes were enriched in the same biology as the primary ceRNA network, but also in the cell part morphogenesis, regulation of synaptic transmission, glutamatergic, cellular response to lipid, negative regulation of cell differentiation, nervous system development, temperature homeostasis, NABA CORE MATRISOME, regulation of behavior, and response to growth factor. In addition, the mRNAs in both networks are enriched in PID FOXM1 PATHWAY, indicating that LINC00472 is likely to regulate the growth of pterygium through PID FOXM1 PATHWAY (Figure 4(b)).

3.5. PPI Network Construction and Analysis of Hub Genes

The PPI network of mRNAs in the LINC00472 network was obtained by using the STRING database, including 36 nodes and 48 edges, and the nodes contain 24 downregulated genes and 12 upregulated genes (Figure 5(a)). The top 10 genes evaluated by connectivity degree in the PPI network were present in CytoHubba, and the result showed that CDH2 was the most significant genes with a connectivity degree of 10, followed by MYC, CCNB1, PCDHA10, RELN, CCNA2, ERBB4, CDH11, PCDHA, and RB1 (Figure 5(b)). We defined the 10 genes as candidate hub genes in the PPI network. It means LINC00472 may regulate pterygium through these 10 key genes.

3.6. Identified Hub Genes Closely Related to Pterygium

We searched for genes related to pterygium in the CTD and found 6163 mRNAs, these genes are associated with pterygium or its descendants. A gene has either a curated association to the disease (M marker/mechanism and/or T therapeutic) or an inferred association via a curated chemical interaction. Then, we found 8 overlapped genes in the database, CCNB1, CDH11, MYC, CCNA2, ERBB4, RELN, RB1, and CDH2 (Figure 6), as selected hub genes. It implied that the 8 genes in the LINC00472 network may play a key role in pterygium.

3.7. Hub Genes Were Verified

Through RT-qPCR, we used 20 groups of pterygium and control conjunctival tissues to verify the above 8 hub genes. There are 6 genes that are highly expressed in pterygium, MYC, CDH2, CCNB1, RELN, RB1, and ERBB4. Compared with the control conjunctiva, only the expression of CDH11 and CCNA2 showed no significant differences with pterygium (Figure 7). The results indicate that the prediction of the hub genes is reasonably reliable and could be used to discover more hub genes in pterygium.

4. Discussion

Pterygium is a common disease in ophthalmology, and its occurrence is related to the proliferation of subconjunctival fibrous tissue and the formation of new blood vessels [15]. The treatment of pterygium is mainly surgical treatment, but the high postoperative recurrence rate is still a problem that is currently difficult to solve [54]. Therefore, effective drug therapy combined with the operation to reduce the recurrence rate of pterygium has become a research hot spot. Recently, with the development of high-throughput genomic platforms and bioinformatic analysis, more and more noncoding RNAs, especially miRNAs and lncRNAs, have been discovered. They serve an important role in the regulation of multiple biological processes, including the development of pterygium [55, 56].

In this study, we mined pterygium-associated RNA expression datasets from the GEO to construct the ceRNA network. Through the analysis of the 4 data sets and 9 publications about lncRNAs and miRNAs, we obtained 251 DELs, 46 DEMis, and 897 DEMs. By the link of miRNA, the primary ceRNA network of pterygium was constructed with 8 lncRNA nodes, 12 miRNA nodes, 94 mRNA nodes, and 136 edges. Since LINC00472 has the highest node degree value of all lncRNAs in the ceRNA network (Figure 3(b)), we took LINC00472 as the center and built the LINC00472 network. And we found genes in the primary ceRNA network mainly enriched in homophilic cell adhesion via plasma membrane adhesion molecules, developmental growth, regulation of neuron projection development, adherens junctions interactions, cell maturation, synapse assembly, central nervous system neuron differentiation, positive regulation of mRNA metabolic process, negative regulation of cell-substrate adhesion, tissue morphogenesis, blood vessel morphogenesis, and PID FOXM1 PATHWAY, while genes in the LINC00472 network enriched in homophilic cell adhesion via plasma membrane adhesion molecules, developmental growth, cell part morphogenesis, regulation of synaptic transmission, glutamatergic, cellular response to lipid, negative regulation of cell differentiation, nervous system development, and temperature homeostasis. In addition, the genes in both networks enriched in PID FOXM1 PAYHWAY, which represents cell proliferation [57], angiogenesis, invasion, and metastasis [53]. This indicated that LINC00472 might play an important role in pterygium through FOXM1 PATHWAY. After the PPI network analysis on mRNAs in the LINC00472 network was visualized by Cytoscape 3.7.2, 10 candidate hub genes were found out. We searched the 10 hub genes in the CTD; 8 hub genes were identified to be involved in pterygium. They are CCNB1, CDH11, MYC, CCNA2, ERBB4, RELN, RB1, and CDH2, and 6 of them were confirmed highly expressed in pterygium by wet experiments (CCNB1, MYC, ERBB4, RELN, RB1, and CDH2). We presume that the LINC00472 network might function in pterygium via the FOXM1 PATHWAY, especially through the regulation of CCNB1, MYC, ERBB4, RELN, RB1, and CDH2 in the ceRNA network. And the data mining based on the public data and publications in the present study is useful and reliable for digging emerging therapy targets of pterygium.

5. Conclusion

The present study increased the understanding of the ceRNA-associated regulatory mechanism in the pterygium and identified LINC00472 as a key lncRNA in the whole ceRNA network, which might be highly related to pterygium via the FOXM1 PATHWAY in the LINC00472 network, especially by regulating its related 6 hub genes (CCNB1, MYC, ERBB4, RELN, RB1, and CDH2).

Data Availability

The microarray data (GSE83627, GSE21346, GSE51995, GSE2513) are obtained from the GEO database.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Authors’ Contributions

M. Yan designed the experiments. F. Zheng, S. He, C. Lu, C. Qiao, and Y. Xu wrote the manuscript. M. Yan and F. Zheng conducted the expression analysis. Y. Xu, C. Qiao, C. Lu, and S. He carried out the study and collected important background information. S. Dong and X. Wu helped to do some supplements. All authors read and approved the final manuscript. Yuting Xu and Chen Qiao contributed equally to this work.

Acknowledgments

This work was supported by the Grant of Nation Natural Science Foundation of China (grant no. 81770898) and the research project of Wuhan Health and Family Planning Commission (grant no. WX18Q46).

Supplementary Materials

Supplementary Table 1: Primers used for RT-qPCR of hub genes. Supplementary Table 2: Differentially expressed lncRNA and miRNAs in pterygium from 9 publications [2836]. (Supplementary Materials)