Sequence and Structure Analysis of Biological Molecules Based on Computational MethodsView this Special Issue
Research Article | Open Access
Evolutionary and Expression Analysis of miR-#-5p and miR-#-3p at the miRNAs/isomiRs Levels
We mainly discussed miR-#-5p and miR-#-3p under three aspects: (1) primary evolutionary analysis of human miRNAs; (2) evolutionary analysis of miRNAs from different arms across the typical 10 vertebrates; (3) expression pattern analysis of miRNAs at the miRNA/isomiR levels using public small RNA sequencing datasets. We found that no bias can be detected between the numbers of 5p-miRNA and 3p-miRNA, while miRNAs from miR-#-5p and miR-#-3p show variable nucleotide compositions. IsomiR expression profiles from the two arms are always stable, but isomiR expressions in diseased samples are prone to show larger degree of dispersion. miR-#-5p and miR-#-3p have relative independent evolution/expression patterns and datasets of target mRNAs, which might also contribute to the phenomena of arm selection and/or arm switching. Simultaneously, miRNA/isomiR expression profiles may be regulated via arm selection and/or arm switching, and the dynamic miRNAome and isomiRome will adapt to functional and/or evolutionary pressures. A comprehensive analysis and further experimental study at the miRNA/isomiR levels are quite necessary for miRNA study.
MicroRNAs (miRNAs) have been widely studied as a class of well-conserved negative regulatory molecules. They play an important role in biological processes by regulating gene expression at the posttranscriptional level [1, 2]. As endogenous small noncoding RNAs (ncRNAs) (~22 nt), miRNAs are generated from the cleavage of primary miRNAs (pri-miRNAs) and precursor miRNAs (pre-miRNAs) by Drosha and Dicer cleavage [3–5]. miRNA may be generated from 5p or 3p arm of pre-miRNA, and the selection is believed to be influenced by hydrogen-bonding selection . Based on the typical miRNA genesis, one arm can produce abundant active mature miRNAs, while another arm can produce rare and inactive (miRNA star, also ever named passenger strand). However, increasing evidence indicates that both arms can generate mature miRNAs under specific developmental stages or species [7–13]. Indeed, many pre-miRNAs have been reported to yield two kinds of mature miRNAs, although the two products, miR-#-5p and miR-#-3p, may vary in expression levels. The term given to this dynamic selection and expression is “arm switching” [8, 14]. Evolutionary analysis demonstrates that both miR-#-5p and miR-#-3p are conserved, although the nondominant miRNA sequences are not well-conserved with dominant miRNA sequences . Increasing reports indicate that the nondominantly expressed miRNA sequences may act as potential regulatory molecules with unexpectedly abundant expression levels [16–18].
Although the typical miRNA is annotated and studied as a single sequence, accumulating evidence suggests that multiple sequences with varied 5′ and/or 3′ ends or varied lengths have been detected from the miRNA locus. The annotated or canonical miRNA is only one specific member of the multiple sequences. These multiple sequences are termed miRNA variants, also named isomiRs [19–23]. The miRNA isoforms are mainly derived from imprecise cleavage by Drosha/Dicer and 3′ addition events through miRNA processing and maturation processes. RNA editing and single nucleotide polymorphisms (SNPs) also contribute to the generation of these multiple isomiRs . The occurrence of multiple isomiRs is quite common, and each miRNA locus can be associated with these various miRNA isoforms [9, 19, 21, 23–30]. Despite the fact that both miR-#-5p and miR-#-3p are generated from the pre-miRNA and can form miRNA:miRNA duplex through nucleotide complementary base pairing, the two miRNA loci may yield various isomiR expression profiles and patterns .
This study aimed to explore the potential evolutionary and expression divergences and relationships between miRNAs from different arms of different/same pre-miRNAs. First, we characterized the origins and nucleotide compositions of all the annotated human miRNAs. Second, we performed evolutionary analysis on the common miRNAs among 10 typical vertebrates and then analyzed the nondominant miRNAs based on the pre-miRNAs. Finally, the expression analysis was performed in samples from female patients using published small RNA sequencing datasets. Because gender difference can affect isomiR expression profiles , and common variation affects various diseases and medically relevant characteristics in a sex-dependent manner , we selected female patients to analyze miRNA/isomiR expression profiles to avoid potential effects from gender difference. miRNA expression patterns were mainly estimated at the miRNA/isomiR levels, especially between homologous miRNAs and between miR-#-5p and miR-#-3p. This study provides insights on the arm selection and/or arm switching in miRNAs from the evolutionary and expression angles, which would partly be informative to understanding the dynamic miRNAome and isomiRome and to characterizing miRNA and isomiR expression profiles. Study from the isomiR level may be a necessary way to understand miRNA, especially for those isomiRs from ever termed passenger stand, which will contribute to further explore miRNA biogenesis and function.
2. Materials and Methods
2.1. Source Data and Primary Analysis
According to the evolutionary taxa and numbers of known miRNA genes, 10 vertebrate species were selected: Petromyzon marinus (pma, Agnathostomata), Danio rerio (dre, Pisces), Xenopus tropicalis (xtr, Amphibia), Anolis carolinensis (aca, Lepidosauria), Gallus gallus (gga, Aves), Equus caballus (eca, Mammalia, Laurasiatheria), Bos taurus (bta, Mammalia, Ruminantia), Monodelphis domestica (mdo, Mammalia, Metatheria), Mus musculus (mmu, Mammalia, Rodentia), and Homo sapiens (hsa, Mammalia, Primates, Hominidae). All the pre-miRNAs and miRNAs were retrieved from the miRBase database (Release 20.0, http://www.mirbase.org/) .
Location information of miRNA on pre-miRNAs was obtained according to the annotations in the miRBase database. Specifically, miRNA generated from 5p arm of pre-miRNA was named miR-#-5p (# indicated the detailed miRNA name, such as miR-100), and miRNA generated from 3p arm of pre-miRNA was named miR-#-3p. If there is no existing annotation, the detailed location distributions were determined using self-developed scripts. Many miRNAs may be generated from multicopy pre-miRNAs, and herein we only presented the detailed isomiR expression profiles based on location of the first pre-miRNA. In the study, miR-#-5p and miR-#-3p were defined as miRNA pairs generated from the 5p and 3p arm of pre-miRNA, respectively, and 5p-miRNA and 3p-miRNA were defined as the miRNAs generated from 5p or 3p arm of different pre-miRNAs.
2.2. Evolutionary Analysis of miRNAs in Ten Test Vertebrates
Known annotated miRNAs from ten vertebrates were comprehensively surveyed for common miRNA members using self-developed scripts. These miRNAs were further classified based on the unit of miRNA gene family because many miRNAs could belong to the same gene family based on homologous sequences with high sequence similarity. Those pre-miRNAs that were not comprehensively annotated (miR-#-5p or miR-#-3p was not simultaneously annotated based on limited studies), unannotated miRNA sequences, were predicted and obtained from consensus sequences using pre-miRNAs and known human miRNAs. The main reasons were as follows: (1) human miRNAs have been widely studied, and most miR-#-5p and miR-#-3p are reported and annotated; (2) most miRNAs are phylogenetically well-conserved across different animal species, and well-conserved consensus sequences are easily obtained using sequence alignment analysis; (3) although the miR-#-5p and miR-#-3p show different levels of evolutionary divergence, both of them are conserved; (4) according to the known miRNA sequences and pre-miRNAs, the detailed miR-#-5p and miR-#-3p sequences can be collected. The shared miRNAs were aligned using Clustal X 2.0 multiple sequence alignment . Nucleotide divergence was analyzed using MEGA 5.10 software  and DnaSP 5.10.01 software . Simultaneously, nucleotide diversity (π), haplotype diversity (Hd), and average number of nucleotide differences for the miRNAs from different animal species were calculated using DnaSP software as special miRNA populations . Evolutionary patterns were estimated based on nucleotide divergence across the ten animal species using percentage of nucleotide substitutions (transition and transversion) and insertions/deletions in each position. The reference nucleotide was denoted as human miRNA. Based on the potential length difference between miRNAs in different species, we only analyzed the core sequences and not the terminus nucleotides with deficiency (these nucleotides were mostly derived from length differences). Nucleotide divergence patterns were further estimated between 5p-miRNA and 3p-miRNA and between miR-#-5p and miR-#-3p.
In order to track the evolutionary history of pre-miRNAs and miRNAs from the different arms, especially between homologous miRNAs, phylogenetic trees of pre-miRNAs were reconstructed using the neighbor-net method  in SplitsTree 4.10 , and networks of miRNAs were defined based on Jukes-Cantor model and Network 126.96.36.199 (http://www.fluxus-engineering.com/) using the median-joining (MJ) method. Also, the free energies of some pre-miRNAs were estimated through the RNAfold WebServer (http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi) [41, 42].
2.3. Analysis of the miRNA/isomiR Expression Levels Using Public Sequencing Datasets
In order to understand the expression patterns of miR-#-5p and miR-#-3p pairs, we analyzed them at the miRNA/isomiR levels using small RNA sequencing datasets generated by The Cancer Genome Atlas (TCGA) pilot project established by the NCI and NHGRI. Information about TCGA and the investigators and institutions constituting the TCGA research network can be found at http://cancergenome.nih.gov/. Available small RNA sequencing datasets associated with the three kinds of women’s diseases including breast cancer (BRCA), ovarian serous cystadenocarcinoma (OV), uterine corpus endometrial carcinoma (UCEC), and their respective control samples were selected to investigate miRNA expression patterns at the miRNA/isomiR levels (see Table in Supplementary Material available online at http://dx.doi.org/10.1155/2015/168358). We also conducted expression analysis in the three kinds of women’s diseases dataset of some miRNAs (especially homologous miRNAs) identified from our evolutionary analysis. All of these high-throughput sequencing datasets were generated on Illumina HiSeq sequencing platform.
Reads per million (RPM) were used to estimate the relative expression levels, and relative expression rate (percentage) in the miRNA locus was used to assess the isomiR expression patterns across different samples. In order to track relative expression levels of miRNA/isomiR and reduce potential sequencing errors/mapping procedures, only those abundant miRNAs/isomiRs were selected to perform the analysis using larger sample sizes. The abundant expression and larger sample sizes could reduce error. Further, functional analysis was performed between miR-#-5p and miR-#-3p and between canonical miRNA sequences and their 5′ isomiRs (with the novel 5′ ends and seed sequences). According to the seed sequences, target mRNAs were predicted and obtained from TargetScan program (http://www.targetscan.org/).
2.4. Statistical Analysis
Data were evaluated using paired -test (length distributions between miR-#-5p and miR-#-3p), Student’s -test (length distributions between 5p-miRNA and 3p-miRNA), Chi-square test (nucleotide compositions between different miRNAs from 5p or 3p), Wilcoxon signed-rank test (nucleotide divergence pattern between miR-#-5p and miR-#-3p), and Spearman correlation test (nucleotide divergence between miR-#-5p and miR-#-3p and homologous miRNAs). Differences were considered statistically significant if the value is less than 0.05. All tests were two-tailed and conducted using Stata software (Version 11.0).
3.1. Primary Analysis of Human miR-#-5p/miR-#-3p and 5p-miRNA/3p-miRNA
There were 2,578 annotated human mature miRNAs in the miRBase database (Release 20.0). A total of 1,291 miRNAs were characterized from the 5p arms of pre-miRNAs, while the others were characterized from the 3p arms. Of these, 849 pairs were identified as miR-#-5p and miR-#-3p from the same pre-miRNAs. Both 5p-miRNA and 3p-miRNA or miR-#-5p and miR-#-3p had different length distributions (5p-miRNA, , 3p-miRNA, , , ; miR-#-5p, , miR-#-3p, , , , Figure 1(a)). 5p-miRNA and 3p-miRNA showed different nucleotide compositions (, , Figure 1(b) and Table 1). Guanine (G) was more predominant in 5p-miRNA (more than 32.82%) than in 3p-miRNA (24.77%). The predominant nucleotide in 3p-miRNA was cytosine (C) (27.19%), which was present at 19.76% in 5p-miRNA (Figure 1(b)). The presence of G, including double (GG), triple (GGG), and fourfold (GGGG) nucleotides, showed larger divergence between miRNAs from different arms (Figure 1(b) and Table 1). Similarly, the nucleotide composition was varied between miR-#-5p and miR-#-3p (Figure 1(b) and Table 1). Significant differences in the continuous nucleotide compositions could be detected between 5p-miR and 3p-miRNA and between miR-#-5p and miR-#-3p (Table 1). Compared to the total nucleotide compositions, nucleotides in each position along miRNA also showed significant difference between 5p-miR and 3p-miRNA and between miR-#-5p and miR-#-3p (, , Figure 1(c); , , Figure 1(d)), although the nucleotides 2–8, termed “seed sequences” of the miRNAs, did not display nucleotide bias.
|The percentage is estimated based on frequency in all the 5p- or 3p-miRNAs, all the miR-#-5p or miR-#-3p. aA significant difference in the triple repetitive nucleotides can be detected between 5p-miRNA and 3p-miRNA ( = 14.82, ), ba significant difference in the four repetitive nucleotides can be detected between 5p-miRNA and 3p-miRNA ( = 26.71, ), ca significant difference in the double repetitive nucleotides can be detected between miR-#-5p and miR-#-3p ( = 13.21, ), da significant difference in the triple repetitive nucleotides can be detected between miR-#-5p and miR-#-3p ( = 26.89, ), and ea significant difference in the four repetitive nucleotides can be detected between miR-#-5p and miR-#-3p ( = 52.17, ).|
3.2. Evolutionary Patterns of miR-#-5p/miR-#-3p and 5p-miRNA/3p-miRNA across Species
There were 31 miRNAs gene families (contain 43 miRNA members) shared by the 10 test animal species (Table ). They may be composed of two or more members with high sequence similarity, but these members were not always shared by the 10 species. The common miRNA might have different number of pre-miRNAs (also termed multicopy pre-miRNAs) in different species and even have different number of homologous miRNAs (Figure ).
Although miRNAs are regarded as phylogenetically well-conserved small ncRNAs, different miRNAs, including homologous miRNAs, may show various evolutionary patterns (Figure ) [15, 38]. Analysis of miR-#-5p and miR-#-3p revealed diverse variations in nucleotide composition (Figure ). Compared to the dominant miRNAs, another strands showed higher levels of nucleotide diversity, haplotype diversity, and average number of nucleotide difference (Table 2). For example, let-7a-5p was highly conserved across the ten species, but let-7a-3p was associated with variation in the nucleotides. Generally, the dominant miRNAs were well-conserved, especially in the “seed sequences” (nucleotides 2–8), while nondominant miRNAs might display more variation in nucleotide composition (Figures and ). Although both of them were reported as functional miRNAs existing at abundant levels in one or more species, 55.81% of miR-#-5p and miR-#-3p showed different levels of nucleotide divergence (Figure 2(a) and Table ). The scatter plot analysis of the shared 43 miRNA genes revealed that both miR-#-5p and miR-#-3p were conserved (Figure 2(a)), with most sites showing minimal variation in nucleotide composition. Herein, 20 dominant miRNAs were identified as 5p-miRNA from 5p arm, and others (23 miRNAs) were identified as 3p-miRNA from 3p arm. We also analyzed the functional regions (seed sequences) of miRNAs, and only 4 pairs (9.30%) indicated difference (Table ). The difference in average percentages from all the miR-#-5p and miR-#-3p was not significant (, ), and similar result could be detected based on the dominant miRNA (, ). Furthermore, although homologous miRNAs displayed close sequence, functional, and evolutionary relationships, no significant correlations were detected between most of homologous miRNAs (Figure 2(b) and Table ).
|These parameters are estimated according to Figure S1.|
Phylogenetic trees and networks were reconstructed using pre-miRNAs and miRNAs from Figure , respectively (Figure 3). The phylogenetic tree of let-7a was split into three clusters, and each cluster contained pre-miRNAs from different animal species (Figure 3(a)). Compared to the tree of the single miRNA gene of let-7a, the phylogenetic tree of homologous mir-30b, mir-30c, and mir-30d could be split (Figure 3(b)). mir-30d showed larger genetic distance with mir-30b and mir-30c. The pma-mir-30b and pma-mir-30c were clustered with mir-30d, which indicates that these should be members of pma-mir-30d (Figure and Figure 3). The evolutionary networks of miR-#-5p and miR-#-3p showed various patterns (Figures 3(c) and 3(d)). Different types of sequences (termed miRNA haplotypes) were classified with different frequencies. For example, let-7a-5p was highly conserved across the ten animal species, and only one specific sequence was identified. However, let-7a-3p was associated with high nucleotide variation and showed a complex evolutionary network (Figure and Figure 3(c)). Compared to let-7, both evolutionary networks of miR-30-5p and miR-30-3p showed clear module networks based on miRNA members (Figure 3(d)).
3.3. Expression Analysis of miR-#-5p/miR-#-3p at the miRNA/isomiR Levels
We analyzed available miRNA datasets of 2,144 patients or volunteers with women’s diseases (BRCA, OV, or UCEC) and their relevant controls (Table ). Following evolutionary analysis, several miRNAs were selected to perform expression analysis using these sequencing datasets. Generally, in the miRNA locus, only several isomiRs were dominantly expressed (Figure 4 and Tables , , and ). Homologous miRNAs were likely to show similar isomiR expression pattern, such as miR-30a and miR-30e (Figure 4). Dominant miRNAs and their multiple isomiRs were present at abundant expression levels, while most of nondominant strands were not abundant. Abundantly expressed isomiRs were always near the most dominant isomiR sequence. Specifically, their 5′ or 3′ ends either were the same or differ at 1-2 nucleotides (Figure 4 and Tables , , and ). The standard deviation (SD) of the average percentage of each isomiR showed diverse distributions (Figure 5 and Figures , , and ). Different miRNAs showed different types of isomiRs with diverse expression distribution and SD (Figures 4 and 5 and Figures , , and ). Abundantly expressed isomiRs were likely to be detected larger SD (Figure 4 and Figures and ), and similar SD distributions could be found between diseased and normal samples (Figure 5 and Figure ). Generally, at the isomiR level, the average percentages of samples from disease patients would be involved in larger divergence than control samples, and similar results can be detected based on all miRNAs (Figure 5 and Figure ).
3.4. Functional Analysis of miR-#-5p/miR-#-3p at the miRNA/isomiR Levels
Although miR-#-5p and miR-#-3p had different sequences and seed sequences, some common targets could be detected (Figure ). These miRNA pairs could bind different regions in UTR (untranslated regions) of target mRNAs, although the phenomenon was rare (larger amounts of specific targets could be detected). The common targets were more popular between the canonical miRNA sequences and their 5′ isomiRs, despite the fact that “seed shifting” could be detected between them (Figures and ). There were about half of target mRNAs of 5′ isomiRs that were shared by the canonical miRNA sequences, although these 5′ isomiRs were involved in novel seed sequences via “seed shifting” events.
4.1. Evolutionary Divergence between miRNAs from Different Arms
miRNAs have been widely regarded as a class of crucial negative regulatory molecules with important biological roles, especially for their roles in tumorigenesis. Based on the current annotated human miRNAs, similar numbers of 5p-miR and 3p-miR show well-conserved sequences across different species, although they are involved in inconsistent length distributions and nucleotide compositions, including multiple repetitive nucleotides (Figures 1(a)–1(c), Figure 2, and Table 1). This difference may be influenced by larger sample sizes. Simultaneously, mirtrons have been reported as alternative precursors for miRNA biogenesis in vertebrates , which may lead to the difference of nucleotide compositions because of nucleotide biases in mirtrons. There are 849 pairs that are identified as miR-#-5p and miR-#-3p, and significant difference in length distributions and nucleotide compositions is detected between the two arms (Figures 1(b)–1(d), Table 1, and Table ). Evolutionary analysis shows that both dominant and nondominant miRNAs are conserved, although the nondominant miRNA is associated with more nucleotide variation across homologous miRNAs and different species . Phylogenetic relationship shows that these multicopy pre-miRNAs are located in different clusters (Figure 3), which suggests the similar distributions of miRNA genes across different species. The well-conserved sequence contributes to stable miRNA-mRNA regulatory network, and simultaneously, the evolutionary process is also controlled by functional pressures. The two arms of pre-miRNA showed various evolutionary patterns via different levels of nucleotide substitutions and insertions/deletions (Figure , Figure 2, and Table ), which may influence stem-loop structure of pre-miRNA (Table ). However, both of the two arms are always well-conserved in the functional region, termed the “seed sequences” (Figure 3(a) and Table ). These results suggest that both products from the two arms are regulatory molecules, although they always have various expression levels.
Homologous and clustered miRNAs are commonly found in miRNAs . No significant relationships between these homologous miRNAs can be detected (Figure 2(b) and Table ). These findings indicate relatively rapid evolutionary patterns between homologous miRNAs, especially between the less well-conserved nondominant strands (Figure 2(b)). Despite the possibility that these miRNAs have evolved from the common ancient miRNA gene, varied nucleotides in miRNAs, especially in the “seed sequences,” will generate novel miRNAs with novel candidate target mRNAs. Simultaneously, coevolution of miRNA and target mRNAs also contributes to the varied miRNAs across different species . Taken together, homologous miRNAs may provide a method to generate novel miRNA genes via duplication events, and multicopy pre-miRNAs are probably transitional products. The driving force should be mainly derived from functional and evolutionary pressures, which largely contributes to the dynamic miRNAome, and enriches the potential relationships between different miRNAs.
4.2. Expression and Function between miRNAs from Different Arms
Similar to our previous studies [21, 46, 47], we found that only several isomiRs (always 1–3) are dominantly expressed, and others have lower expression rate (Figure 4 and Tables , , and ). The interesting distributions are consistent in different individuals, including samples from patients with disease and healthy controls. The similar distributions suggest that isomiR expression patterns are always stable across different samples [21, 26]. The characteristics of these dominant isomiRs provide the possibility of imprecise cleavage of Drosha and Dicer through pre-miRNA processing and miRNA maturation processes. Indeed, due to the smaller size of miRNA sequence (~22 nt), degradation of hairpins may also be one factor that contributes to rare isomiRs . Although the distribution of isomiR expression is similar across different samples, no significant correlations can be found between isomiR expression profiles of miR-#-5p and miR-#-3p (Figure 4). Simultaneously, various standard values of deviation can be found (Figure 5 and Figures , , and ). Compared to control samples, samples from patients with disease may be involved in larger expression divergence across different samples (Figure 5). This suggests that a more flexible expression of isomiRs can be detected across different samples from patients with disease compared to control samples. Functional analysis showed that some common target mRNAs between miR-#-5p and miR-#-3p can be detected, although they have no different sequences and most target mRNAs are specific (Figure ). Simultaneously, more shared target mRNAs are obtained between the canonical miRNA and 5′ isomiRs despite being with “seed shifting” events (Figures and ). The interesting results imply that multiple isomiRs may coordinately contribute to the specific biological processes by binding different regions in UTR. Moreover, 3′ addition events (isomiRs with additional nontemplate nucleotides in 3′ ends) are quite common in isomiRome, while no further analysis is performed in the present study based on the previous TCGA datasets. The phenomenon of 3′ additions may have versatile biological roles, including affecting target selection or miRNA stability [22, 24, 26, 49]. Collectively, analyzing multiple isomiRs and their expression patterns is the first step towards a systematic understanding of the miRNA world, including the genesis and regulatory roles of miRNAs.
miRNAs are likely to be members of miRNA gene families/clusters sharing high sequence similarity or close location distribution. These homologous/clustered miRNAs may have evolved from ancestor genes via part or tandem historic duplication events [15, 50–52]. Previous study reported that homologous miRNAs are likely to show similar isomiR expression patterns , and our results are consistent with this observation (Figure 4 and Table ). The similarity in the expression patterns implies that the pre-miRNA processing and miRNA maturation processes should be derived from the ancestral gene, which may contribute to the potential interactions in the regulatory network . Moreover, we found that deregulated miRNAs are likely to have different types of isomiRs (miR-30a, miR-30e, and miR-10b, Figure 4 and Tables , , and ). These deregulated miRNAs have been reported in breast cancer [53, 54], and the moderate expression patterns can be detected. No enough evidence indicates that miRNA with moderate isomiR expression is likely to be abnormally expressed and contributes to abnormal biological roles. More studies, especially for experimental validation, are needed to further study the small noncoding RNAs at the isomiR level.
4.3. Selection of 5p and 3p or Switching between the Two Arms in miRNAome/isomiRome
The phenomenon of arm selection shows that miRNAs may be derived from different arms, and the arm switching phenomenon suggests that the two arms may also show dynamic expression patterns. miRNAs from the two arms (they can form miRNA:miRNA duplex) always show different evolutionary patterns and also have various expression levels and isomiR expression patterns. Most of pre-miRNAs only produce one dominant and one rare miRNAs in specific samples, although the expression rate of the two miRNAs may be changed in other samples (arm switching phenomenon). Indeed, the two arms of many pre-miRNAs are conserved (especially in “seed sequences”), providing the possibility to be regulatory molecules, and the arm switching phenomenon further enriches the dynamic miRNAome by controlling miRNA expression profiles to adapt to functional and/or evolutionary needs. Expression and evolution patterns in miR-#-5p and miR-#-3p are relatively independent, and they are prone to regulate different targets. Based on the phenomena of arm selection or arm switching, the dynamic miRNAome also represents the multiple and dynamic isomiRome at the isomiR level. These isomiRs provide more information towards further understanding of miRNAs, in that isomiR expression patterns may indicate the characteristics of pre-miRNA processing and miRNA maturation processes. Thus it is worth exploring the biological roles of miRNAs at the isomiR level and the origin of miRNAs (5p or 3p) and related miRNAs based on miRNA gene family/cluster. Taken together, the arm selection and/or arm switching may be an important method to regulate miRNAome and isomiRome, and the dynamic miRNA and isomiR expression profiles will adapt to functional and/or evolutionary pressures.
|SNP:||Single nucleotide polymorphism|
|TCGA:||The Cancer Genome Atlas|
|OV:||Ovarian serous cystadenocarcinoma|
|UCEC:||Uterine corpus endometrial carcinoma|
Conflict of Interests
The authors declare no potential conflict of interests with respect to the authorship and/or publication of this paper.
This work was supported by the National Natural Science Foundation of China (nos. 61301251, 81473070, and 81373102), the Research Fund for the Doctoral Program of Higher Education of China (20133234120009), the National Natural Science Foundation of Jiangsu (no. BK20130885), the Natural Science Foundation of the Jiangsu Higher Education Institutions (nos. 12KJB310003 and 13KJB330003), Shandong Provincial Key Laboratory of Functional Macromolecular Biophysics, and the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).
Figure S1: showed that examples of nucleotide divergence between different miRNAs, including 5p-miRNA and 3p-miRNA, and miR-#-5p and miR-#-3p.
Figure S2-S4: showed box plots of miRNAs between different samples using standard deviation (SD), and Figure S5 presented examples of functional analysis.
Table S1: listed selected small RNA sequencing datasets from the TCGA database.
Table S2: showed the common miRNAs in the ten animal species.
Table S3: presented Wilcoxon signed-rank test of miR-#-5p and miR-#-3p.
Table S4: showed spearman correlation coefficient between homologous miRNAs.
Table S5: presented the free energies of some pre-miRNAs.
Table S6-S8: showed isomiR expression distributions of let-7a-5p, homologous miR-30a and miR-30e, miR-10b and miR-21 across all samples.
- D. P. Bartel, “MicroRNAs: target recognition and regulatory functions,” Cell, vol. 136, no. 2, pp. 215–233, 2009.
- D. P. Bartel, “MicroRNAs: genomics, biogenesis, mechanism, and function,” Cell, vol. 116, no. 2, pp. 281–297, 2004.
- Y. Lee, C. Ahn, J. Han et al., “The nuclear RNase III Drosha initiates microRNA processing,” Nature, vol. 425, no. 6956, pp. 415–419, 2003.
- J. Han, Y. Lee, K.-H. Yeom, Y.-K. Kim, H. Jin, and V. N. Kim, “The Drosha-DGCR8 complex in primary microRNA processing,” Genes & Development, vol. 18, no. 24, pp. 3016–3027, 2004.
- J. Han, Y. Lee, K.-H. Yeom et al., “Molecular basis for the recognition of primary microRNAs by the Drosha-DGCR8 complex,” Cell, vol. 125, no. 5, pp. 887–901, 2006.
- S. Griffiths-Jones, R. J. Grocock, S. van Dongen, A. Bateman, and A. J. Enright, “miRBase: microRNA sequences, targets and gene nomenclature,” Nucleic Acids Research, vol. 34, pp. D140–D144, 2006.
- S. C. Li, W. C. Chan, M. R. Ho et al., “Discovery and characterization of medaka miRNA genes by next generation sequencing platform,” BMC Genomics, vol. 11, no. 4, article S8, 2010.
- S. Griffiths-Jones, J. H. L. Hui, A. Marco, and M. Ronshaugen, “MicroRNA evolution by arm switching,” EMBO Reports, vol. 12, no. 2, pp. 172–177, 2011.
- N. Cloonan, S. Wani, Q. Xu et al., “MicroRNAs and their isomiRs function cooperatively to target common biological pathways,” Genome Biology, vol. 12, no. 12, article R126, 2011.
- A. Marco, J. H. L. Hui, M. Ronshaugen, and S. Griffiths-Jones, “Functional shifts in insect microRNA evolution,” Genome Biology and Evolution, vol. 2, no. 1, pp. 686–696, 2010.
- S.-C. Li, Y.-L. Liao, M.-R. Ho, K.-W. Tsai, C.-H. Lai, and W.-C. Lin, “miRNA arm selection and isomiR distribution in gastric cancer,” BMC Genomics, vol. 13, supplement 1, article S13, 2012.
- S.-C. Li, Y.-L. Liao, W.-C. Chan et al., “Interrogation of rabbit miRNAs and their isomiRs,” Genomics, vol. 98, no. 6, pp. 453–459, 2011.
- W. C. Cheng, I. F. Chung, T. S. Huang et al., “YM500: a small RNA sequencing (smRNA-seq) database for microRNA research,” Nucleic Acids Research, vol. 41, no. 1, pp. D285–D294, 2013.
- S.-C. Li, K.-W. Tsai, H.-W. Pan, Y.-M. Jeng, M.-R. Ho, and W.-H. Li, “MicroRNA 3' end nucleotide modification patterns and arm selection preference in liver tissues,” BMC Systems Biology, vol. 6, no. 2, article S14, 2012.
- L. Guo and Z. Lu, “The fate of miRNA* strand through evolutionary analysis: implication for degradation as merely carrier strand or potential regulatory molecule?” PLoS ONE, vol. 5, no. 6, Article ID e11387, 2010.
- K. Okamura, M. D. Phillips, D. M. Tyler, H. Duan, Y.-T. Chou, and E. C. Lai, “The regulatory activity of microRNA star species has substantial influence on microRNA and 3' UTR evolution,” Nature Structural and Molecular Biology, vol. 15, no. 4, pp. 354–363, 2008.
- K. Okamura, A. Ishizuka, H. Siomi, and M. C. Siomi, “Distinct roles for argonaute proteins in small RNA-directed RNA cleavage pathways,” Genes and Development, vol. 18, no. 14, pp. 1655–1666, 2004.
- G. Jagadeeswaran, Y. Zheng, N. Sumathipala et al., “Deep sequencing of small RNA libraries reveals dynamic regulation of conserved and novel microRNAs and microRNA-stars during silkworm development,” BMC Genomics, vol. 11, no. 1, article 52, 2010.
- P. Landgraf, M. Rusu, R. Sheridan et al., “A mammalian microRNA expression atlas based on small RNA library sequencing,” Cell, vol. 129, no. 7, pp. 1401–1414, 2007.
- R. D. Morin, G. Aksay, E. Dolgosheina et al., “Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa,” Genome Research, vol. 18, no. 4, pp. 571–584, 2008.
- L. Guo, Q. Yang, J. Lu et al., “A comprehensive survey of miRNA repertoire and 3′ addition events in the placentas of patients with pre-eclampsia from high-throughput sequencing,” PLoS ONE, vol. 6, no. 6, Article ID e21072, 2011.
- C. T. Neilsen, G. J. Goodall, and C. P. Bracken, “IsomiRs—the overlooked repertoire in the dynamic microRNAome,” Trends in Genetics, vol. 28, no. 11, pp. 544–549, 2012.
- L. W. Lee, S. Zhang, A. Etheridge et al., “Complexity of the microRNA repertoire revealed by next-generation sequencing,” RNA, vol. 16, no. 11, pp. 2170–2180, 2010.
- H. A. Ebhardt, H. H. Tsang, D. C. Dai, Y. Liu, B. Bostan, and R. P. Fahlman, “Meta-analysis of small RNA-sequencing errors reveals ubiquitous post-transcriptional RNA modifications,” Nucleic Acids Research, vol. 37, no. 8, pp. 2461–2470, 2009.
- F. Kuchenbauer, R. D. Morin, B. Argiropoulos et al., “In-depth characterization of the microRNA transcriptome in a leukemia progression model,” Genome Research, vol. 18, no. 11, pp. 1787–1797, 2008.
- A. M. Burroughs, Y. Ando, M. J. L. De Hoon et al., “A comprehensive survey of 3′ animal miRNA modification events and a possible role for 3′ adenylation in modulating miRNA targeting effectiveness,” Genome Research, vol. 20, no. 10, pp. 1398–1410, 2010.
- A. F. Fernandez, C. Rosales, P. Lopez-Nieva et al., “The dynamic DNA methylomes of double-stranded DNA viruses associated with human cancer,” Genome Research, vol. 19, no. 3, pp. 438–451, 2009.
- S. Lu, Y. H. Sun, and V. L. Chiang, “Adenylation of plant miRNAs,” Nucleic Acids Research, vol. 37, no. 6, pp. 1878–1885, 2009.
- C. Shao, Q. Wu, J. Qiu et al., “Identification of novel microRNA-like-coding sites on the long-stem microRNA precursors in Arabidopsis,” Gene, vol. 527, no. 2, pp. 477–483, 2013.
- J. Zhang, S. Zhang, S. Li et al., “A genome-wide survey of microRNA truncation and 3′ nucleotide addition events in larch (Larix leptolepis),” Planta, vol. 237, no. 4, pp. 1047–1056, 2013.
- L. Guo, H. Zhang, Y. Zhao, S. Yang, and F. Chen, “Selected isomiR expression profiles via arm switching?” Gene, vol. 533, no. 1, pp. 149–155, 2014.
- P. Loher, E. R. Londin, and I. Rigoutsos, “IsomiR expression profiles in human lymphoblastoid cell lines exhibit population and gender dependencies,” Oncotarget, vol. 30, no. 5, pp. 8790–8802, 2014.
- W. P. Gilks, J. K. Abbott, and E. H. Morrow, “Sex differences in disease genetics: evidence, evolution, and detection,” Trends in Genetics, vol. 30, pp. 453–463, 2014.
- A. Kozomara and S. Griffiths-Jones, “MiRBase: integrating microRNA annotation and deep-sequencing data,” Nucleic Acids Research, vol. 39, no. 1, pp. D152–D157, 2011.
- M. A. Larkin, G. Blackshields, N. P. Brown et al., “Clustal W and Clustal X version 2.0,” Bioinformatics, vol. 23, no. 21, pp. 2947–2948, 2007.
- K. Tamura, D. Peterson, N. Peterson, G. Stecher, M. Nei, and S. Kumar, “MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods,” Molecular Biology and Evolution, vol. 28, no. 10, pp. 2731–2739, 2011.
- P. Librado and J. Rozas, “DnaSP v5: a software for comprehensive analysis of DNA polymorphism data,” Bioinformatics, vol. 25, no. 11, pp. 1451–1452, 2009.
- L. Guo, B. Sun, F. Sang, W. Wang, and Z. Lu, “Haplotype distribution and evolutionary pattern of miR-17 and miR-124 families based on population analysis,” PLoS ONE, vol. 4, no. 11, Article ID e7944, 2009.
- D. Bryant and V. Moulton, “Neighbor-Net: an Agglomerative Method for the Construction of Phylogenetic Networks,” Molecular Biology and Evolution, vol. 21, no. 2, pp. 255–265, 2004.
- D. H. Huson, “SplitsTree: analyzing and visualizing evolutionary data,” Bioinformatics, vol. 14, no. 1, pp. 68–73, 1998.
- I. L. Hofacker, “Vienna RNA secondary structure server,” Nucleic Acids Research, vol. 31, no. 13, pp. 3429–3431, 2003.
- R. Lorenz, S. H. Bernhart, C. Höner Zu Siederdissen et al., “ViennaRNA Package 2.0,” Algorithms for Molecular Biology, vol. 6, no. 1, article 26, 2011.
- E. Berezikov, W.-J. Chung, J. Willis, E. Cuppen, and E. C. Lai, “Mammalian mirtron genes,” Molecular Cell, vol. 28, no. 2, pp. 328–336, 2007.
- L. Guo, Y. Zhao, H. Zhang, S. Yang, and F. Chen, “Integrated evolutionary analysis of human miRNA gene clusters and families implicates evolutionary relationships,” Gene, vol. 534, no. 1, pp. 24–32, 2014.
- S. Lehnert, P. Van Loo, P. J. Thilakarathne, P. Marynen, G. Verbeke, and F. C. Schuit, “Evidence for co-evolution between human MicroRNAs and Alu-repeats,” PLoS ONE, vol. 4, no. 2, Article ID e4456, 2009.
- L. Guo, F. Chen, and Z. Lu, “Multiple isomiRs and diversity of miRNA sequences unveil evolutionary roles and functional relationships across animals,” in MicroRNA and Non-Coding RNA: Technology, Developments and Applications, pp. 127–144, 2013.
- L. Guo, H. Li, T. Liang et al., “Consistent isomiR expression patterns and 3′ addition events in miRNA gene clusters and families implicate functional and evolutionary relationships,” Molecular Biology Reports, vol. 39, no. 6, pp. 6699–6706, 2012.
- M. R. Friedländer, W. Chen, C. Adamidi et al., “Discovering microRNAs from deep sequencing data using miRDeep,” Nature Biotechnology, vol. 26, no. 4, pp. 407–415, 2008.
- S. L. Fernandez-Valverde, R. J. Taft, and J. S. Mattick, “Dynamic isomiR regulation in Drosophila development,” RNA, vol. 16, no. 10, pp. 1881–1888, 2010.
- J. Hertel, M. Lindemeyer, K. Missal et al., “The expansion of the metazoan microRNA repertoire,” BMC Genomics, vol. 7, article 25, 2006.
- L. F. Sempere, C. N. Cole, M. A. Mcpeek, and K. J. Peterson, “The phylogenetic distribution of metazoan microRNAs: insights into evolutionary complexity and constraint,” Journal of Experimental Zoology Part B: Molecular and Developmental Evolution, vol. 306, no. 6, pp. 575–588, 2006.
- A. Grimson, M. Srivastava, B. Fahey et al., “Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals,” Nature, vol. 455, no. 7217, pp. 1193–1197, 2008.
- F. Yu, H. Deng, H. Yao, Q. Liu, F. Su, and E. Song, “Mir-30 reduction maintains self-renewal and inhibits apoptosis in breast tumor-initiating cells,” Oncogene, vol. 29, no. 29, pp. 4194–4204, 2010.
- L. Ma, J. Teruya-Feldstein, and R. A. Weinberg, “Tumour invasion and metastasis initiated by microRNA-10b in breast cancer,” Nature, vol. 449, no. 7163, pp. 682–688, 2007.
Copyright © 2015 Li Guo et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.