Abstract

mir-548 is a larger, poorly conserved primate-specific miRNA gene family. 69 human mir-548 genes located in almost all human chromosomes whose widespread distribution pattern implicates the evolutionary origin from transposable elements. Higher level of nucleotide divergence was detected between these human miRNA genes, which mainly derived from divergence of multicopy pre-miRNAs and homologous miRNA genes. Products of  mir-548, miR-548-5p, and miR-548-3p showed inconsistent evolutionary patterns, which partly contributed to larger genetic distances between pre-miRNAs. “Seed shifting” events could be detected among miR-548 sequences due to various 5′ ends. The events led to shift of seed sequences and target mRNAs, even generated to new target mRNAs. Additionally, the phenomenon of miRNA:miRNA interaction in the miRNA gene family was found. The potential interaction between miRNAs may be contributed to dynamic miRNA expression profiles by complementarily binding events to form miRNA:miRNA duplex with 5′-/3′-overhangs. The miRNA gene family had important roles in multiple biological processes, including signaling pathways and some cancers. The potential abundant roles and functional implication further led to the larger and poorly conserved gene family with genetic variation based on transposable elements. The evolutionary pattern of the primate-specific gene family might contribute to dynamic expression profiles and regulatory network.

1. Introduction

MicroRNAs (miRNAs), a distinct class of ~22 nt single-stranded noncoding endogenous RNAs, play pivotal roles in negatively regulating gene expression by targeting mRNAs with an influence on multiple biological processes in plants and animals, including cell growth, differentiation, and apoptosis [13]. The primary transcript (pri-miRNA) encoding multiple miRNAs is generated by polymerase II in the nucleus. For example, miRNA members in a gene cluster are cotranscribed as a single polycistronic transcript to coregulate several categories of genes simultaneously [46]. Subsequently, the pri-miRNA is converted into pre-miRNA with a hairpin structure by Drosha [79]. Pre-miRNA is then translocated to the cytoplasm via exportin-5 [7] where the miRNA:miRNA* duplex is released from the hairpin structure by Dicer [10, 11]. Mature miRNA is loaded into RISC (RNA induced silencing complex) to mediate mRNA targeting [12, 13]. Although miRNA* is typically degraded as an inactive sequence, accumulating reports indicate that miRNA* also contributes to the gene network regulation as a potential active miRNA [1417]. Posttranscriptional silencing of target genes by miRNAs occurs either by targeting specific cleavage of homologous mRNAs, or by targeting specific inhibition of protein synthesis [18]. Some miRNAs have higher level of sequence similarity and form the miRNA gene family, and even coregulate complex biological processes.

The small noncoding RNAs are well evolutionary conserved across large phylogenetic distances [19], and they have been subjected to the evolutionary patterns, genetic and phylogenetic analysis [2022]. Compared with other components in the complex gene regulatory networks, the small RNA sequences show a different evolutionary pattern [20, 23]. Despite well-conserved sequences, miRNAs show evolutionary stable “seed shifting” events across different animal species, especially toward the 3′ ends [22]. The diverse biological roles of the small noncoding regulatory RNAs have been investigated recently, but the origin and evolution of miRNAs still remain largely obscure. Some literatures show that miRNAs may originate from repetitive elements, especially for transposable elements (TEs) [2432]. For example, a larger human gene family, hsa-mir-548, are derived from Made1 transposable elements and have important roles in multiple biological processes [28]. However, with more members in the gene family have been identified, it is quite necessary to further study its evolutionary and functional relationship by systematic analysis, especially based on their potential biological roles in complex regulatory network. Here, according to known members in mir-548 gene family, we sought to indicate the evolutionary pattern and functional implication of the larger gene family.

2. Results

According to the annotation in the miRBase database (Release 18.0), mir-548 was a larger primate-specific gene family. There were 100 miRNA members that were distributed in primates, including Homo sapiens (hsa, 69), Pongo pygmaeus (ppy, 2), Pan troglodytes (ptr, 20), and Macaca mulatta (mml, 9). Except for hsa-mir-548, the miRNA genes in other primates were mainly predicted by softwares based on sequence similarity. Therefore, we herein mainly analyzed and discussed human mir-548.

As a larger human gene family, hsa-mir-548 was located on almost all of the human chromosomes, especially for chromosomes 6, 8, and X (Figure 1). There were 69 members in the gene family, and higher level of nucleotide divergence could be detected between these homologous miRNA genes. The main reasons were mainly derived from a larger number of miRNA gene members and multicopy pre-miRNAs. Some miRNAs, such as hsa-miR-548f and hsa-miR-548 h, were found 5 multicopy pre-miRNAs that were located on different chromosomes. Although these multicopy miRNA precursors could yield the same miRNA sequence, other regions, including adjacent nucleotides of miRNA sequence, showed inconsistent nucleotide substitution patterns (see Figure S 1 in Supplementary Materials available online at http://dx.doi.org/10.1155/2012/679563). Similarly, mature miRNAs also showed higher level of nucleotide divergence, including nucleotide substitution and insertion/deletion (Figures 2 and 3). Compared to other larger miRNA gene families, such as hsa-let-7 gene family [16], hsa-mir-548 gene family showed higher level of nucleotide divergence and was a poorly conserved miRNA gene family (Figures 2 and 3). The phenomenon of “seed shifting” events could be detected among miR-548 due to involved in various 5′ ends (Figure 2). The events led to variety of “seed sequences,” which also influenced the prediction of target mRNAs.

Based on all hsa-mir-548 sequences, NJ tree showed that some of miRNA genes had larger genetic distances with other members, such as hsa-mir-548e, 548ao, 548i-4 and 548 m (Figure S 2 ). Multicopy pre-miRNAs for a specific miRNA might be reconstructed in different clusters based on phylogenetic tree (Figure S 2 ) and phylogenetic network by using neighbor-net method (data not shown here). Although both arms of hsa-mir-548 could yield mature miRNAs (miR-548-5p and miR-548-3p), hsa-miR-548-5p was more conservation than hsa-miR-548-3p. Nucleotide diversity of miR-548-5p was 0 . 1 9 ± 0 . 0 1 , while nucleotide diversity of miR-548-3p was 0 . 2 0 ± 0 . 0 2 . Average number of nucleotide differences of miR-548-5p and miR-548-3p were 3.81 and 4.30, respectively. Higher frequency of nucleotide substitution and insertion/deletion could be detected between miR-548-3p sequences (Figure 3). Their phylogenetic networks showed different evolutionary patterns. Hsa-miR-548-3p indicated a complex network with more median vectors (Figure 4). According to the predicted target mRNAs of hsa-miR-548, functional enrichment analysis showed that the miRNA gene family played important roles in multiple biological processes, including various human diseases (Table 1). For example, they were involved in regulation of actin cytoskeleton, MAPK signaling pathway, ubiquitin mediated proteolysis, colorectal cancer, glioma, and nonsmall cell lung cancer (Table 1).

Interestingly, we found that some miRNA members were natural or endogenous sense/antisense miRNA genes (Table 2). Hsa-mir-548aa and hsa-mir-548d were located on the sense and antisense strands of the same genomic region, and their products could complementarily bind to each other and formed miRNA:miRNA duplex with 5′-/3′-overhangs. The duplex between miRNAs was similar to typical miRNA:miRNA* duplex, although miRNA-miRNA interaction was not involving a loop structure (Figure S 3 ). Strikingly, due to multicopy pre-miRNAs, the miRNA pairs were detected on chromosomes 8 and 17, respectively (Table 2). The miRNA-miRNA interaction not only existed in sense/antisense miRNAs, and also was detected between other miRNAs, such as hsa-miR-548 h-4-3p, and hsa-miR-548c-5p/hsa-miR-548o-2-5p/hsa-miR-548am-5p, although these miRNA pairs were located on different genomic regions or even different chromosomes (Table 2). The functional interaction networks of these miRNA pairs were reconstructed based on the top 20 predicted target mRNAs. Although miRNA:miRNA interaction could be found between miRNA pairs, they might coregulate biological processes (Figure S 4 ).

3. Discussion

According to the known data, mir-548 gene family mainly expresses in primates. The members of this family are poorly conserved in their sequences, and higher levels of nucleotide divergence exist in some of the members, or even in multicopy pre-miRNAs (Figure S 1 ). Among 69 hsa-mir-548 genes, 1–5 multicopy pre-miRNAs for a specific miRNA can be detected, although adjacent nucleotides are inconsistent and consist of nucleotide variations (Figure S 1 ). The phylogenetic tree based on hsa-mir-548 population shows that some of the members have larger genetic distances than the others (Figure S 2 ). Although this gene family is widely located in almost all the human chromosomes, distribution bias can be found, especially for chromosomes 6, 8, and X (Figure 1). Given that the larger and poorly conserved gene family is originated from the transposable elements (TEs) [28, 32], such universe distribution is probably derived from the evolutionary origin. The evolutionary process also leads to the diversity of miRNA sequences with nucleotide substitution, insertion, or deletion (Figures 2 and 3). Simultaneously, the dynamic evolution further strengthens the functions of the poorly conserved gene family, including their versatile biological roles in complex regulatory network.

Although homologous miRNA sequences are found among different members, “seed shifting” events can be detected (Figure 2). The alternation of 5′ ends of miRNA sequences leads to the variety of seed sequences, and even generates the new target mRNAs. The variety of seed sequences implicates their diverse roles in multiple biological processes, including regulating important signaling pathways and human tumorigenesis (Table 1). Indeed, accumulated evidence show that multiple isomiRs, as well as various sequences, can be generated from a given miRNA locus due to the alternative and imprecise Drosha and Dicer cleavage during pre-miRNA processing [3338]. This fact also partly contributes to the complexity of the regulatory network, especially when mentioning those isomiRs with novel 5′ ends and seed sequences. Taken together, the “seed shifting” events occurring among different members of miR-548 gene family and among multiple isomiRs in a given miR-548 locus are, respectively, derived from the dynamic evolutionary process and miRNAs processing mechanism, and strengthen the biological roles of miRNA in gene regulatory network. Furthermore, the functional variety may be the adaption to the complex biological processes, and simultaneously implicates the evolutionary trend. Interestingly, except for the seed shifting events in miR-548, higher level of nucleotide divergence can also be detected, especially for the miR-548-3p sequences (Figures 3 and 4). Although both arms of hsa-mir-548 yield to mature miRNA sequences (miR-548-5p and miR-548-3p), they show inconsistent mutational profiles and evolutionary patterns (Figures 3 and 4). The nucleotide divergences in the miRNA sequences lead to the various target mRNAs and biological roles (Table 1). The evolutionary trends and patterns are mainly driven by the functional selection pressure from the complex cellular environment and are based on the repetitive elements and transposable elements.

Based on the larger gene family, we also found that some of the members have potential interaction and form miRNA:miRNA duplex (Table 2 and Figure S 3 ). The miRNA-miRNA interaction is recently identified in the miRNA world, which restricts the transcription process [11, 3942]. These miRNA pairs, such as hsa-mir-548aa and hsa-mir-548d, may be the sense/antisense miRNAs in the same genomic region, or have not location correlation (Table 2). Indeed, the miRNA-miRNA interaction is more prevalent and complex than we previously thought. In this sense, more miRNA members will be discovered and identified, including those active miRNAs* serving as the gene network regulators [1417]. The abundant miRNA data will indicate more potential miRNA pairs which can complementarily bind to each other and form miRNA:miRNA duplex. In addition, even if the analysis is not so stringent and permits some mismatches and the complementarily binding of nucleotide U and G, the interaction among miRNAs still can be found as a typical structure of the miRNA:miRNA* duplex. The functional interaction of miRNA pairs can be predicted through their target mRNAs, although they have potential interactions by forming a miRNA:miRNA duplex and restricting each enrichment levels (Figure S 4 ). The miRNA-miRNA interaction in the miR-548 gene family may be the results of the inverted repeat transposons during evolutionary history, while these miRNA pairs might have more potential and abundant regulatory roles and contribute to dynamic expression profiles and multiple biological processes.

4. Materials and Methods

All the miRNA members in the mir-548 gene family (http://www.mirbase.org/cgi-bin/mirna_summary.pl?fam=MIPF0000317), including their annotations, miRNA/miRNA* and pre-miRNA sequences from different animal species, were obtained from the miRBase database (Release 18.0, http://www.mirbase.org/) [43]. Multiple sequence alignment of miRNA and pre-miRNA sequences were aligned with Clustal X 2.0 [44]. Phylogenetic trees of pre-miRNAs based on Neighbor-Joining (NJ) method were reconstructed with MEGA 5.0 [45] by 1,000 bootstrap resampling. Nucleotide diversity and average number of nucleotide differences of miR-548-5p and miR-548-3p populations were estimated in DnaSP version 5 software [46]. Percentage of nucleotide substitution and insertion/deletion at each position was estimated for miR-548-5p and miR-548-3p without considering gaps/missing sites in the terminus regions. The most abundant nucleotide at each position was selected as the reference nucleotide, which would help estimate substitution trend more precisely.

Further, phylogenetic network of pre-miRNA members was reconstructed in SplitsTree 4.10 [47] by using the Neighbor-Net method [48] based on Jukes-Cantor model. Based on the network, we attempted to reconstruct the evolutionary history and discover potential evolutionary pattern. All the gaps/missing data were deleted in the phylogenetic tree. In order to infer ancestral miRNA members to understand origin of miRNAs, we also reconstructed evolutionary network [49] with Network 4.6.1.0 (http://www.fluxus-engineering.com/) based on the mature miRNA sequences. Due to various 5′/3′ ends, miR-548 sequences (including miR-548-5p and miR-548-3p) were dealt according to their pre-miRNAs based on the core sequences. Some miRNAs that largely deviated from the core miRNA sequences were removed from the analysis.

Based on a great amount of miRNA members in the gene family, we also searched for the phenomenon of interaction between miRNAs according to miRNA and pre-miRNA sequences. If miRNA sequence could be accurately mapped to other pre-miRNAs, potential miRNA:miRNA interaction could be detected. Generally, the miRNA pairs might be located on sense/antisense strands in the same genomic region, or located on different genomic regions. Similar to miRNA:miRNA* duplex, they could form miRNA:miRNA duplex with 5′-/3′-overhangs.

We integrated the predicted Target mRNAs of the prediction software programs Pictar [50], TargetScan [51] and miRanda programs [52]. These genes were then queried for gene ontology (GO) enrichments by using CapitalBio Molecule Annotation System V4.0 (MAS, http://bioinfo.capitalbio.com/mas3/). We also constructed functional interaction networks using Cytoscape v2.8.2 Platform [53].

Authors’ Contribution

T. Liang, L. Guo, and C. Liu are contributed equally to this work.

Acknowledgments

This work was supported by a research Grant from the National Natural Science Foundation of China (2012104GZ30055), the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (12KJB360001), and the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).

Supplementary Materials

Figure S1: Multiple alignment of multicopy pre-miRNAs of hsa-miR-548f and their NJ tree.

Figure S2: The phenomenon of miRNA-miRNA interaction by forming miRNA:miRNA duplex.

Figure S3: A phylogenetic tree based on all pre-miRNAs of miR-548 gene family by using NJ model.

Figure S4: A functional interaction network of miRNA pairs.

  1. Supplementary Figure 1
  2. Supplementary Figure 2
  3. Supplementary Figure 3
  4. Supplementary Figure 4