Molecular Evolutionary Routes that Lead to InnovationsView this Special Issue
Review Article | Open Access
Lu Chen, Jaime M. Tovar-Corona, Araxi O. Urrutia, "Alternative Splicing: A Potential Source of Functional Innovation in the Eukaryotic Genome", International Journal of Evolutionary Biology, vol. 2012, Article ID 596274, 10 pages, 2012. https://doi.org/10.1155/2012/596274
Alternative Splicing: A Potential Source of Functional Innovation in the Eukaryotic Genome
Alternative splicing (AS) is a common posttranscriptional process in eukaryotic organisms, by which multiple distinct functional transcripts are produced from a single gene. The release of the human genome draft revealed a much smaller number of genes than anticipated. Because of its potential role in expanding protein diversity, interest in alternative splicing has been increasing over the last decade. Although recent studies have shown that 94% human multiexon genes undergo AS, evolution of AS and thus its potential role in functional innovation in eukaryotic genomes remain largely unexplored. Here we review available evidence regarding the evolution of AS prevalence and functional role. In addition we stress the need to correct for the strong effect of transcript coverage in AS detection and set out a strategy to ultimately elucidate the extent of the role of AS in functional innovation on a genomic scale.
The first draft of the human genome sequence [1, 2] was unveiled in February 2001 and surprisingly it was shown to contain ~23000 genes, only a fraction of the numbers of genes originally predicted . To put this into perspective, there are ~20,000 genes in the genome of the nematode C. elegans. The lack of an association between gene number and organismal complexity has resulted in an increased interest in alternative splicing (AS) given it has been proposed to be a major factor in expanding the regulatory and functional complexity, protein diversity, and organismal complexity of higher eukaryotes [4–6]. However, despite the best efforts of many research groups we still understand very little about the actual role played by AS in the evolution of functional innovation—here understood as the appearance of novel functional transcripts—underpinning the increased organismal complexity observed.
Alternative splicing is a posttranscriptional process in eukaryotic organisms by which multiple distinct transcripts are produced from a single gene . Previous studies using high-throughput sequencing technology have reported that up to 92%~94% of human multiexon genes undergo AS [7, 8], often in a tissue/developmental stage-specific manner [7, 9]. With the development and constant improvement of whole genome transcription profiling and bioinformatics algorithms, the ubiquity of AS in the mammalian genome began to become clear. The concept of one gene-one protein gave way as evidence mounted for the high percentage of AS incidence in nonhuman species [7, 8], such as fruit fly , Arabidopsis  and other eukaryotes . Despite the advances in our understanding and characterisation of AS several questions remain unanswered. First, the large difference in transcript coverage between species has hampered direct comparisons of the prevalence of alternative splicing in different species . Secondly, even if comparable AS estimates between species could be obtained, it is unclear to what extent any changes in AS prevalence along evolution have contributed to overall protein diversity or rather reflect splicing noise. Finally, we understand very little about how AS has evolved through time and how this is related to functional parameters of genes. Here we review how alternative is regulated and recent progress in our understanding of the evolution of alternative splicing.
2. Alternative Splicing and Its Regulation
In 1977, Chow et al. [12–15] reported that 5′ and 3′ terminal sequences of several adenovirus 2 (Ad2) mRNAs varied, implying a new mechanism for the generation of several distinct mRNAs. Following this study, alternative splicing was also found in the gene encoding thyroid hormone calcitonin in mammalian cells. Subsequent studies revealed that many other genes were also able to generate more than one transcript by cuttingout different sections from its coding regions (reviewed in [4, 16]).
Depending on the location of the exonic segments cut-out-or if introns are left in, splicing events can be classified into four basic types (Figure 1). These four major modes of splicing are (1) exon skipping (2) intron retention (3) alternative 5′ splicing site (5′ss), and (4) alternative 3′ splicing site (3′ss) [22, 23]. In addition, mutually exclusive exons, alternative initiation, and alternative polyadenylation provide two other mechanisms for generating various transcript isoforms. Moreover, different types of alternative splicing can occur in a combinatorial manner and one exon may be subject to more than one AS mode, for example, 5′ss and 3′ss at the same time (Figure 1). Prevalence of each type of AS has been found to vary between different taxa. Several studies have shown that exon skipping is common in metazoan genomes  whereas intron retention is the most common type of AS among plants  and fungi .
Alternative splicing is tightly regulated by cis elements as well as transacting factors that bind to these cis elements. Transacting factors, mainly RNA-binding proteins, modulate the activity of the spliceosome and cis elements such as exonic splicing enhancers (ESEs), exonic splicing silencers (ESSs), intronic splicing enhancers (ISEs), and intronic splicing silencers (ISSs). Canonical mechanism of AS suggests that serine/arginine-rich (SR) proteins typically bind to ESEs, whereas heterogeneous nuclear ribonucleoproteins (hnRNP) tend to bind to ESSs or ISSs . Given the crucial roles of these regulators in the splicing machinery, the cis and transacting mutations, which disrupt the splicing code, are known to cause disease (reviewed in [28–30]). It has been estimated that 15–60% of mutations cause disease by affecting the splicing pattern of genes ( and reviewed in ). Moreover, AS has also been shown to be regulated without the involvement of auxiliary splicing factors  and AS may be also combined with other posttranscriptional events such as the use of multiple internal translation initiation sites, RNA editing, mRNA decay, and microRNA binding and other noncoding RNAs [33, 34], suggesting the existence of additional noncanonical mechanism of AS that are yet to be identified .
Recently, a direct role of histone modifications in alternative splicing has been reported, in which histone modification (H3-K27m3) affects the splicing outcome by influencing the recruitment of splicing regulators via a chromatin-binding protein in a number of human genes such as FGFR2,TPM2,TPM1 and PKM2 . Moreover, it has been reported that CTCF-promoted RNA polymerase II pausing links DNA methylation to splicing, providing the first evidence of developmental regulation of splicing outcome through heritable epigenetic marks . Additionally, non-coding RNAs also have emerged as key determinants of alternative splicing patterns . Therefore these findings reveal an additional epigenetic layer in the regulation of transcription and alternative splicing . Genomewide genetic and epigenetic studies, therefore, have been proposed in at least 100 specific blood cell types , which will provide high quality reference epigenomes (using DNA methylation and histone marks assays) with detailed genetic and transcriptome data (whole genome sequencing, RNA-Seq, and miRNA-Seq), providing us with an opportunity to assess the genomewide influence of epigenetic factors in the regulation of AS in specific blood cell types. We are expecting the rise of comparative epigenetics will provide different perspective of the evolution of transcriptome.
3. Identification of Alternative Splicing Events
Alternative splicing is difficult to estimate from genomic parameters alone . A number of regulatory motifs for AS have been uncovered but the presence of known alternative splicing motifs does not guarantee that a gene is actually alternatively spliced . Thus, alternative splicing patterns are generally assessed from examining transcript data. For any gene of interest, alternative splicing events can be identified by using reverse transcription polymerase chain reaction (RT-PCR) conducted on a complementary DNA (cDNA) library. Over the last decade, as high-throughput transcriptome technologies have improved, it has become possible to assess alternative splicing patterns on a genomewide scale. Three main sources of transcriptome data have been used to assess splicing patterns: expressed sequence tags (ESTs), splice-junction microarrays, and RNA sequencing (RNA-Seq).
The first wave of genomewide transcriptome analysis consisted in direct sequencing cDNA and ESTs carried out at large scale , which allowed alternative splicing events to be identified by aligning cDNA/EST sequences to the reference genome. ESTs are 200–800 nucleotide bases in length, unedited, randomly selected single-pass sequence reads derived from cDNA libraries . Currently, there are eight million ESTs for human, including about one million sequences from cancer tissues, and about 71 million ESTs for around 2000 species in dbEST . However, ESTs are based on low-throughput Sanger sequencing and are aggregated over a wide range of tissues, developmental states, and diseases using widely different levels of sensitivity.
More recently, splice-junction microarrays and RNA-Seq have been increasingly used to quantitatively analyse alternative splicing events. Splicing microarrays target specific exons or exon-exon junctions with oligonucleotide probes. The fluorescent intensities of individual probes reflect the relative usage of alternatively splicing exons in different tissues and cell lines . High-density splice-junction microarrays are a cost-effective way to assay previously known exons and AS events with low false positive rate. The disadvantage is that it requires prior knowledge of existing AS variants and gene structures. More importantly unlike RNA-Seq and EST, microarrays do not provide additional sequence information.
RNA-Seq has emerged as a powerful technology for transcriptome analysis due to its ability to produce millions of short sequence reads [45–47]. RNA-Seq experiments provide in-depth information on the transcriptional landscape . The ever-increasing accumulation of high-throughput data will continue to provide ever richer opportunities to investigate further aspects of AS such as low-frequency AS events as well as tissue-specific and/or development-specific AS events [7, 8, 47–49]. Earlier datasets consist of RNA read sequences of 50 bp or less, limiting the information about combinations of AS events in a single transcript but it is likely that the length of short reads will continue to increase over the next decade. With the increasing capacity of next-generation sequencing (RNA-Seq) the study of alternative spicing is likely to undergo a revolution . The higher depth of sequencing of transcriptomes in human and other species has increased our understanding of the occurrence of AS event and AS expression patterns in different tissues [7, 51], developmental stages .
Transcript assembly of sequence-based technologies, such as ESTs and RNA-Seq, can use either align-then-assemble or assemble-then-align, depending on the quality of reference genome and sequence data . An algorithm can be employed to detect AS event by comparing different transcripts. However, detecting AS isoforms, as opposed to single AS event, is still challenging because short sequences provide little information in terms of the combination of exons. Several applications have been developed for transcript assembly and AS isoform detection, different strategies and comparison of these applications have been reviewed previously .
4. Prevalence of Alternative Splicing across Eukaryotic Genomes
Initial whole genome analyses suggested that 5%–30% of human genes were alternatively spliced (reviewed in [6, 16]). EST-based AS databases identify AS events in 40–60% of human genes [5, 52, 53]; however, recently this number has been revised over and over with the latest estimates showing that up to 94% of human multiexon genes produce more than one transcript through alternative splicing [7, 8, 16]. Understanding how alternative splicing has changed over time could provide insights as to how alternative splicing has impacted on transcript and protein diversity and phenotype evolution . In fungi, AS is thought to be rare due to the low number of exons in yeast . In plants it has been estimated that around 20% of genes undergo AS based on EST data , a recent study using RNA-Seq, however, suggests that at least approximately 42% of intron-containing genes in Arabidopsis are alternatively spliced . We are expecting significantly higher percentages of AS occurrence will be discovered from various eukaryotes given the in-depth studies of transcriptome using next-generation sequencing such as RNA-Seq are ongoing. A few studies have attempted to compare AS prevalence among different taxa with animals generally reported to have higher AS incidence than plants  and vertebrates having a higher AS incidence than invertebrates . However, these studies are either based on limited data or failed to correct for differences in transcript coverage .
There are a number of databases that provide AS data for multiple species [5, 52–54]. However, these existing resources are primarily focused on animal species and have poor coverage for protist, fungal, and plant genomes thus making it difficult to compare divergent taxa. Most importantly, none of these resources take into account the well-documented effects of differential transcript coverage across genes within and between species which greatly influences AS detection rates [6, 24, 55, 56]. Random sampling has been used  and shown to minimize the bias of transcript coverage (Figure 2). We expect that similar strategies will be employed in future comparative AS data resources.
5. Is Alternative Splicing Functional or Mostly Just Noise?
If an increase in AS levels in vertebrate species compared to invertebrates is confirmed, given the limitations of current proteomics resources, it is hard to assess the extent to which alternatively spliced transcripts are translated into an expanded proteome. The evolution of many phenotypes that we most associate with human being such as longer lifespan, encephalization, or even increased complexity have been accompanied by sharp reductions in effective population size, possibly explaining the proliferation of a variety of genomic features in more complex organisms ( but see ). Therefore, it is possible that increased AS through evolution results from aberrant splicing and therefore it does not play any functional role [59–61]. If alternative splicing has increased along the phylogenetic tree and it is indeed functional, we can expect the following.(A) Transcripts should have a low incidence of premature stop codons which would render them vulnerable to nonsense mediated decay. Between 4% and 35% of AS human transcripts have been found to contain a premature termination codon in human and mouse transcripts [62, 63]. These transcripts have been found to be enriched in nonconserved exons likely to cause frame shifts . It is unknown whether the proportion of premature stop codon containing AS transcripts has changed along the phylogenetic tree.(B) It has been proposed that most low copy number alternative isoforms produced in human cells are likely to be nonfunctional [65, 66]. A recent study has shown that although cancer-specific alternative-splicing variants can be found, these events are mostly found as single-copy events and thus unlikely to contribute to the core cancer transcriptome .(C) Conservation of alternative-splicing events along evolution can be taken as an indicator of their functional role. Conservation levels of AS have been studied in many species. The estimation ranges from 11% to 67% between human and mouse [68–70]. Notably, major AS forms tend to be have higher conservation levels compared to minor forms. On the other hand, the conserved AS forms vary among different AS; for example, exon skipping between C. elegans and C. briggsae has shown more than 81% conservation level, compared to 28% for intron retention [71, 72].(D) Presence of identifiable functional domains in AS areas may also be an indicator of functional relevance for AS transcripts . To our best knowledge there are no reports of the prevalence of functional domains in AS areas in model species. To examine the presence of functional domains in AS transcripts, we compiled a set of 267,996 AS events obtained from the analysis of 8,315,254 ESTs from normal human tissues. We found that about 50% of AS areas in human contain known functional components using InterProScan  which contains 14 applications for the prediction of protein domains (Figure 3, see methods in ), suggesting a possible functional role for AS. The extent of the variations in the prevalence of functional domains among AS areas between species remains to be explored but would provide additional insights on the evolution of AS.
Taken together above observations suggest that although alternative splicing-events are indeed conserved throughout evolution a significant proportion are not and some may result from noisy transcript splicing not contributing to the protein pool. However, until further studies using comparable AS indexes it will be impossible to estimate the extent to which increases in AS levels along the phylogenetic tree have impacted on the pool of functional transcripts.
6. Alternative Splicing and Gene Duplication
Gene duplication (GD) is considered a prime source of functional innovation in the genome. Newly duplicated genes can evolve functional divergence , and it is thought to be key in driving the evolution of developmental and morphological complexity in vertebrates . Alternative splicing, as a prevalent mechanism that also increases protein diversity, has been proposed as a potential player in the evolution of eukaryotes [4, 6]. By examining the relationship between gene duplication and alternative splicing we can better understand the extent to which both mechanisms are equivalent means for protein diversification. Several studies have reported a negative correlation between AS and gene family size in human and mouse [6, 65, 75, 76] and worm [71, 77] (Table 1). It is easy to lead to a conclusion that AS and GD are interchangeable and there is a universal negative correlation from worm to human. However, the relationship between the two variables is marginal at best and it is not consistent when including singleton genes which have a lower AS level compared to multigene families [76, 78, 79]. Jin et al.  suggested that singletons have more evolutionary constriction than duplicates which hampers their AS isoform gain Consistent with this hypothesis, Lin et al.  found that singletons differ from multigene families in several aspects suggesting that they have differing evolutionary paths. Even if we focus on multigene families only, a negative correlation between AS and gene family size may be explained or byproduct of AS and gene family size covariance with other factors. For example, gene age and biased duplication have been proposed to be the explanation . This study has cast doubt over the relationship between AS and GD and it may indeed provide support to the suggestion that AS and GD have little or no equivalence concerning effects on protein sequence, structure, and function . As most studies have examined a small number of model species it is difficult to assess the extent of the link between AS and GD. In addition, the snapshot approach of comparing GFS and AS in a single genome might hide the true relationship between AS and GFS.
7. Alternative Splicing’s Contribution to Functional Innovation
Alternative splicing has been hailed as the missing source of information in the genome accounting for the evolution of higher complexity despite the near static gene number in metazoans over the last 800 million years. Wegmann et al.  found that width of gene expression is positively correlated to the number of new transcript isoforms and proposed that the increase of gene expression breadth is essential for acquiring new transcript isoforms, which could be maintained by a new form of balancing selection. Moreover, experimental and bioinformatics analyses have shown that AS can generate a variety of functional mRNAs and protein products, displaying distinct stability properties, subcellular localization, and function  as well as in specific stages in cell differentiation , sex differentiation [83, 84], and development .
Single-gene studies have provided examples where alternative splicing can lead to functional innovation before any events of gene duplication have taken place. One such example is that of Troponin I (TnI), which plays a key role in muscle contraction. In the vertebrate genome, TnI exists in three copies each expressed in a different muscle type (skeletal, fast and slow, and cardiac). In Ciona, one of the closest relatives of vertebrates TnI is present as a single gene. Interestingly, however, the Ciona gene produces three distinct alternatively spliced isoforms, each found to resemble the expression profile of one of the vertebrate genes suggesting that the specialisation of the TnI proteins to function in each muscle type preceded gene duplication events . This pattern of alternative splice variants in ancestrally single genes resembling expression profiles of genes later duplicated has also been found in synapsin-2 genes in tetrapods  and MITF genes in teleost fish species [87, 88]. These examples suggest that alternative splicing can be a mechanism for functional innovation preceding events of gene duplication through one of the three possible paths (Figure 4).
(a) Splicing signal degeneration
(b) Exonization of non-coding DNA or transposons
(c) Exon duplication and specialization of isoforms
Genes may also further gain alternative splicing and regulation after duplication along with the complexity of the organ systems after the divergence of protochordates and vertebrates. Comparison between transcriptional factors Pax genes in vertebrates and amphioxus has shown that at least 52 reported alternative-splicing events in vertebrates compared to 23 events in amphioxus . Furthermore, vertebrate Pax genes have maintained most of their ancestral functions and also expanded their expression . Novel alternative splicing of Pax genes has been shown to modify the functional domain content (e.g., DNA binding) and transactivation capacities of the resulting protein products . For example, a novel alternative transcript of Pax3 can transactivate a cMET reporter construct in mouse . These additional isoforms of Pax3 have been proposed to play a functional role in the acquisition of new roles at neural plate in vertebrates . Similarly, vertebrate-specific AS events of exon 5a in Pax4 and Pax6 have been linked to functional roles in the development of vertebrate eye [89, 92]. Therefore, it is reasonable to propose the hypothesis that, besides gene duplication, alternative splicing plays important roles in acquiring novel functions contributing to the complexity of the organ systems after the divergence of protochordates and vertebrates . The potential roles of the increasing prevalence of AS in vertebrates in functional innovation will be largely explored in more gene families or genomewide level in the future, which will further our understanding of how AS contributes to functional innovation.
Here we have reviewed evidence from genomewide studies as well as possible avenues for future comparatives studies for the potential of alternative splicing as a source of functional innovation during the evolution of the eukaryotic genome. While it is now clear that AS is prevalent in the human genome, obstacles still remain in the assessment how alternative splicing has evolved through time. The main obstacle lies in that while most other genomic features can be directly measured or estimated from genomic sequences alone, no accurate estimates of alternative splicing can be obtained from genomic sequence analysis. The reliance in transcript sequences availability to measure AS together with the strong bias brought by unequal transcript coverage has hampered the genomewide assessment of AS in all but a few model species and makes difficult any direct comparison between species. This has slowed down the study of how alternative splicing has evolved over time, how AS is regulated, and how it may relate to other genomic features and most crucially to phenotype. The ever-increasing transcript profiling for many more species combined with the use of comparable index estimates will allow addressing a number of evolutionary questions regarding the evolution of AS and its implications for the evolution of transcript diversity and functional innovation.
Conflict of Interests
The authors declare no conflict of interests.
The authors wish to thank Humberto Gutierrez for comments on earlier versions of this paper. This work was funded by UK-China Scholarship for Excellence and University of Bath Research Studentship to L. Chen, a CONACyT Scholarship to J. M. Tovar-Corona, and a Royal Society Dorothy Hodgkin Research Fellowship, Royal Society Research Grant, and a Royal Society Research Grant for Fellows to A. O. Urrutia.
- E. S. Lander, L. M. Linton, B. Birren et al., “Initial sequencing and analysis of the human genome,” Nature, vol. 409, no. 6822, pp. 860–921, 2001.
- J. C. Venter, M. D. Adams, E. W. Myers et al. et al., “The sequence of the human genome,” Science, vol. 291, no. 5507, pp. 1304–1351, 2001.
- H. R. Crollius, O. Jaillon, A. Bernot et al., “Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence,” Nature Genetics, vol. 25, no. 2, pp. 235–238, 2000.
- B. R. Graveley, “Alternative splicing: increasing diversity in the proteomic world,” Trends in Genetics, vol. 17, no. 2, pp. 100–107, 2001.
- N. Kim, A. V. Alekseyenko, M. Roy, and C. Lee, “The ASAP II database: analysis and comparative genomics of alternative splicing in 15 animal species,” Nucleic Acids Research, vol. 35, no. 1, pp. D93–D98, 2007.
- T. W. Nilsen and B. R. Graveley, “Expansion of the eukaryotic proteome by alternative splicing,” Nature, vol. 463, no. 7280, pp. 457–463, 2010.
- E. T. Wang, R. Sandberg, S. J. Luo et al., “Alternative isoform regulation in human tissue transcriptomes,” Nature, vol. 456, no. 7221, pp. 470–476, 2008.
- Q. Pan, O. Shai, L. J. Lee, B. J. Frey, and B. J. Blencowe, “Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing,” Nature Genetics, vol. 40, pp. 1413–1415, 2008.
- S. Stamm, S. Ben-Ari, I. Rafalska et al., “Function of alternative splicing,” Gene, vol. 344, pp. 1–20, 2005.
- B. R. Graveley, A. N. Brooks, J. W. Carlson et al., “The developmental transcriptome of Drosophila melanogaster,” Nature, vol. 471, no. 7339, pp. 473–479, 2011.
- S. A. Filichkin, H. D. Priest, S. A. Givan et al., “Genome-wide mapping of alternative splicing in Arabidopsis thaliana,” Genome Research, vol. 20, no. 1, pp. 45–58, 2010.
- L. T. Chow, R. E. Gelinas, T. R. Broker, and R. J. Roberts, “An amazing sequence arrangement at the 5′ ends of adenovirus 2 messenger RNA,” Cell, vol. 12, no. 1, pp. 1–8, 1977.
- S. M. Berget, C. Moore, and P. A. Sharp, “Spliced segments at the 5′ terminus of adenovirus 2 late mRNA,” Proceedings of the National Academy of Sciences of the United States of America, vol. 74, no. 8, pp. 3171–3175, 1977.
- F. W. Alt, A. L. M. Bothwell, M. Knapp et al., “Synthesis of secreted and membrane-bound immunoglobulin mu heavy chains is directed by mRNAs that differ at their 3′ ends,” Cell, vol. 20, no. 2, pp. 293–301, 1980.
- P. Early, J. Rogers, M. Davis et al., “Two mRNAs can be produced from a single immunoglobulin μ gene by alternative RNA processing pathways,” Cell, vol. 20, no. 2, pp. 313–319, 1980.
- I. I. Artamonova and M. S. Gelfand, “Comparative genomics and evolution of alternative splicing: the pessimists' sciene,” Chemical Reviews, vol. 107, no. 8, pp. 3407–3430, 2007.
- E. M. Zdobnov and R. Apweiler, “InterProScan—an integration platform for the signature-recognition methods in InterPro,” Bioinformatics, vol. 17, no. 9, pp. 847–848, 2001.
- A. Bateman, L. Coin, R. Durbin et al., “The Pfam protein families database,” Nucleic Acids Research, vol. 32, pp. D138–D141, 2004.
- J. D. Bendtsen, H. Nielsen, G. von Heijne, and S. Brunak, “Improved prediction of signal peptides: SignalP 3.0,” Journal of Molecular Biology, vol. 340, no. 4, pp. 783–795, 2004.
- A. Krogh, B. Larsson, G. von Heijne, and E. L. L. Sonnhammer, “Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes,” Journal of Molecular Biology, vol. 305, no. 3, pp. 567–580, 2001.
- K. Nakai and P. Horton, “PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization,” Trends in Biochemical Sciences, vol. 24, no. 1, pp. 34–35, 1999.
- D. B. Malko, V. J. Makeev, A. A. Mironov, and M. S. Gelfand, “Evolution of exon-intron structure and alternative splicing in fruit flies and malarial mosquito genomes,” Genome Research, vol. 16, no. 4, pp. 505–509, 2006.
- G. Ast, “How did alternative splicing evolve?” Nature Reviews Genetics, vol. 5, no. 10, pp. 773–782, 2004.
- E. Kim, A. Magen, and G. Ast, “Different levels of alternative splicing among eukaryotes,” Nucleic Acids Research, vol. 35, no. 1, pp. 125–131, 2007.
- B. B. Wang and V. Brendel, “Genomewide comparative analysis of alternative splicing in plants,” Proceedings of the National Academy of Sciences of the United States of America, vol. 103, no. 18, pp. 7175–7180, 2006.
- E. Kim, A. Goren, and G. Ast, “Alternative splicing: current perspectives,” BioEssays, vol. 30, no. 1, pp. 38–47, 2008.
- M. Chen and J. L. Manley, “Mechanisms of alternative splicing regulation: insights from molecular and genomics approaches,” Nature Reviews Molecular Cell Biology, vol. 10, no. 11, pp. 741–754, 2009.
- B. M. N. Brinkman, “Splice variants as cancer biomarkers,” Clinical Biochemistry, vol. 37, no. 7, pp. 584–594, 2004.
- J. P. Venables, “Unbalanced alternative splicing and its significance in cancer,” BioEssays, vol. 28, no. 4, pp. 378–386, 2006.
- G. S. Wang and T. A. Cooper, “Splicing in disease: disruption of the splicing code and the decoding machinery,” Nature Reviews Genetics, vol. 8, no. 10, pp. 749–761, 2007.
- N. López-Bigas, B. Audit, C. Ouzounis, G. Parra, and R. Guigó, “Are splicing mutations the most frequent cause of hereditary disease?” FEBS Letters, vol. 579, no. 9, pp. 1900–1903, 2005.
- Y. Yu, P. A. Maroney, J. A. Denker et al., “Dynamic regulation of alternative splicing by silencers that modulate 5′ splice site competition,” Cell, vol. 135, no. 7, pp. 1224–1236, 2008.
- T. A. Hughes, “Regulation of gene expression by alternative untranslated regions,” Trends in Genetics, vol. 22, no. 3, pp. 119–122, 2006.
- R. F. Luco and T. Misteli, “More than a splicing code: integrating the role of RNA, chromatin and non-coding RNA in alternative splicing regulation,” Current Opinion in Genetics & Development, vol. 21, no. 4, pp. 366–372, 2011.
- B. R. Graveley, “Alternative splicing: regulation without regulators,” Nature Structural & Molecular Biology, vol. 16, no. 1, pp. 13–15, 2009.
- R. F. Luco, Q. Pan, K. Tominaga, B. J. Blencowe, O. M. Pereira-Smith, and T. Misteli, “Regulation of alternative splicing by histone modifications,” Science, vol. 327, no. 5968, pp. 996–1000, 2010.
- S. Shukla, E. Kavak, M. Gregory et al., “CTCF-promoted RNA polymerase II pausing links DNA methylation to splicing,” Nature, vol. 479, no. 7371, pp. 74–79, 2011.
- R. F. Luco, M. Allo, I. E. Schor, A. R. Kornblihtt, and T. Misteli, “Epigenetics in alternative pre-mRNA splicing,” Cell, vol. 144, no. 1, pp. 16–26, 2011.
- D. Adams, L. Altucci, S. E. Antonarakis et al., “BLUEPRINT to decode the epigenetic signature written in blood,” Nature Biotechnology, vol. 30, no. 3, pp. 224–226, 2012.
- Y. Barash, J. A. Calarco, W. Gao et al., “Deciphering the splicing code,” Nature, vol. 465, no. 7294, pp. 53–59, 2010.
- E. W. Sayers, T. Barrett, D. A. Benson et al., “Database resources of the National Center for Biotechnology Information,” Nucleic Acids Research, vol. 37, no. 1, pp. D5–D15, 2009.
- S. H. Nagaraj, R. B. Gasser, and S. Ranganathan, “A hitchhiker's guide to expressed sequence tag (EST) analysis,” Briefings in Bioinformatics, vol. 8, no. 1, pp. 6–21, 2007.
- M. S. Boguski, T. M. J. Lowe, and C. M. Tolstoshev, “dbEST—database for ‘expressed sequence tags’,” Nature Genetics, vol. 4, no. 4, pp. 332–333, 1993.
- J. M. Johnson, J. Castle, P. Garrett-Engele et al., “Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays,” Science, vol. 302, no. 5653, pp. 2141–2144, 2003.
- Z. Wang, M. Gerstein, and M. Snyder, “RNA-Seq: a revolutionary tool for transcriptomics,” Nature Reviews Genetics, vol. 10, no. 1, pp. 57–63, 2009.
- G. Robertson, J. Schein, R. Chiu et al., “De novo assembly and analysis of RNA-seq data,” Nature Methods, vol. 7, no. 11, pp. 909–912, 2010.
- J. A. Martin and Z. Wang, “Next-generation transcriptome assembly,” Nature Reviews Genetics, vol. 12, no. 10, pp. 671–682, 2011.
- R. D. Hawkins, G. C. Hon, and B. Ren, “Next-generation genomics: an integrative approach,” Nature Reviews Genetics, vol. 11, no. 7, pp. 476–486, 2010.
- F. Ozsolak and P. M. Milos, “RNA sequencing: advances, challenges and opportunities,” Nature Reviews Genetics, vol. 12, no. 2, pp. 87–98, 2011.
- A. Mortazavi, B. A. Williams, K. McCue, L. Schaeffer, and B. Wold, “Mapping and quantifying mammalian transcriptomes by RNA-Seq,” Nature Methods, vol. 5, no. 7, pp. 621–628, 2008.
- H. J. Kang, Y. I. Kawasawa, F. Cheng et al., “Spatio-temporal transcriptome of the human brain,” Nature, vol. 478, no. 7370, pp. 483–489, 2011.
- A. Bhasi, P. Philip, V. T. Sreedharan, and P. Senapathy, “AspAlt: a tool for inter-database, inter-genomic and user-specific comparative analysis of alternative transcription and alternative splicing in 46 eukaryotes,” Genomics, vol. 94, no. 1, pp. 48–54, 2009.
- Y. Lee, B. Kim, Y. Shin et al., “ECgene: an alternative splicing database update,” Nucleic Acids Research, vol. 35, no. 1, pp. D99–D103, 2007.
- G. Koscielny, V. Le Texier, C. Gopalakrishnan et al., “ASTD: the alternative splicing and transcript diversity database,” Genomics, vol. 93, no. 3, pp. 213–220, 2009.
- D. Brett, H. Pospisil, J. Valcárcel, J. Reich, and P. Bork, “Alternative splicing and genome complexity,” Nature Genetics, vol. 30, no. 1, pp. 29–30, 2002.
- Z. Y. Kan, D. States, and W. Gish, “Selecting for functional alternative splices in ESTs,” Genome Research, vol. 12, no. 12, pp. 1837–1845, 2002.
- M. Lynch and J. S. Conery, “The origins of genome complexity,” Science, vol. 302, no. 5649, pp. 1401–1404, 2003.
- K. D. Whitney and T. Garland, “Did genetic drift drive increases in genome complexity?” PLoS Genetics, vol. 6, no. 8, 2010.
- Q. Xu and C. Lee, “Discovery of novel splice forms and functional analysis of cancer-specific alternative splicing in human expressed sequences,” Nucleic Acids Research, vol. 31, no. 19, pp. 5635–5643, 2003.
- R. I. Skotheim and M. Nees, “Alternative splicing in cancer: noise, functional, or systematic?” The International Journal of Biochemistry & Cell Biology, vol. 39, no. 7-8, pp. 1432–1449, 2007.
- E. Kim, A. Goren, and G. Ast, “Insights into the connection between cancer and alternative splicing,” Trends in Genetics, vol. 24, no. 1, pp. 7–10, 2008.
- R. E. Green, B. P. Lewis, R. T. Hillman et al., “Widespread predicted nonsense-mediated mRNA decay of alternatively-spliced transcripts of human normal and disease genes,” Bioinformatics, vol. 19, no. 1, pp. i118–i121, 2003.
- B. P. Lewis, R. E. Green, and S. E. Brenner, “Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans,” Proceedings of the National Academy of Sciences of the United States of America, vol. 100, no. 1, pp. 189–192, 2003.
- Z. Zhang, D. Xin, P. Wang et al., “Noisy splicing, more than expression regulation, explains why some exons are subject to nonsense-mediated mRNA decay,” BMC Biology, vol. 7, article 23, 2009.
- Z. Su, J. Wang, J. Yu, X. Huang, and X. Gu, “Evolution of alternative splicing after gene duplication,” Genome Research, vol. 16, no. 2, pp. 182–189, 2006.
- J. K. Pickrell, A. A. Pai, Y. Gilad, and J. K. Pritchard, “Noisy splicing drives mRNA isoform diversity in human cells,” PLoS Genetics, vol. 6, no. 12, Article ID e1001236, 2010.
- L. Chen, J. M. Tovar-Corona, and A. O. Urrutia, “Increased levels of noisy splicing in cancers, but not for oncogene-derived transcripts,” Human Molecular Genetics, vol. 20, no. 22, pp. 4422–4429, 2011.
- T. A. Thanaraj, F. Clark, and J. Muilu, “Conservation of human alternative splice events in mouse,” Nucleic Acids Research, vol. 31, no. 10, pp. 2544–2552, 2003.
- Q. Pan, M. A. Bakowski, Q. Morris et al., “Alternative splicing of conserved exons is frequently species-specific in human and mouse,” Trends in Genetics, vol. 21, no. 2, pp. 73–77, 2005.
- J. M. Mudge, A. Frankish, J. Fernandez-Banet et al., “The origins, evolution and functional potential of alternative splicing in vertebrates,” Molecular Biology and Evolution, vol. 28, no. 10, pp. 2949–2959, 2011.
- M. Irimia, J. L. Rukov, D. Penny, J. Garcia-Fernandez, J. Vinther, and S. W. Roy, “Widespread evolutionary conservation of alternatively spliced exons in Caenorhabditis,” Molecular Biology and Evolution, vol. 25, no. 2, pp. 375–382, 2008.
- M. Irimia, J. L. Rukov, S. W. Roy, J. Vinther, and J. Garcia-Fernandez, “Quantitative regulation of alternative splicing in evolution and development,” BioEssays, vol. 31, no. 1, pp. 40–50, 2009.
- M. Long, E. Betrán, K. Thornton, and W. Wang, “The origin of new genes: glimpses from the young and old,” Nature Reviews Genetics, vol. 4, no. 11, pp. 865–875, 2003.
- P. Dehal and J. L. Boore, “Two rounds of whole genome duplication in the ancestral vertebrate,” PLoS Biology, vol. 3, no. 10, pp. 1700–1708, 2005.
- N. M. Kopelman, D. Lancet, and I. Yanai, “Alternative splicing and gene duplication are inversely correlated evolutionary mechanisms,” Nature Genetics, vol. 37, no. 6, pp. 588–589, 2005.
- L. Jin, K. Kryukov, J. C. Clemente et al., “The evolutionary relationship between gene duplication and alternative splicing,” Gene, vol. 427, no. 1-2, pp. 19–31, 2008.
- A. L. Hughes and R. Friedman, “Alternative splicing, gene duplication and connectivity in the genetic interaction network of the nematode worm Caenorhabditis elegans,” Genetica, vol. 134, no. 2, pp. 181–186, 2008.
- H. Lin, S. Ouyang, A. Egan et al., “Characterization of paralogous protein families in rice,” BMC Plant Biology, vol. 8, article 18, 2008.
- J. Roux and M. Robinson-Rechavi, “Age-dependent gain of alternative splice forms and biased duplication explain the relation between splicing and duplication,” Genome Research, vol. 21, no. 3, pp. 357–363, 2011.
- D. Talavera, C. Vogel, M. Orozco, S. A. Teichmann, and X. de la Cruz, “The (In)dependence of alternative splicing and gene duplication,” PLoS Computational Biology, vol. 3, no. 3, pp. 375–388, 2007.
- D. Wegmann, I. Dupanloup, and L. Excoffier, “Width of gene expression profile drives alternative splicing,” PLoS ONE, vol. 3, no. 10, article e3587, 2008.
- E. L. Heinzen, D. Ge, K. D. Cronin et al., “Tissue-specific genetic control of splicing: implications for the study of complex traits,” PLoS Biology, vol. 6, no. 12, Article ID e1000001, pp. 2869–2879, 2008.
- B. Hartmann, R. Castelo, B. Miñana et al., “Distinct regulatory programs establish widespread sex-specific alternative splicing in Drosophila melanogaster,” RNA, vol. 17, no. 3, pp. 453–468, 2011.
- R. Blekhman, J. C. Marioni, P. Zumbo, M. Stephens, and Y. Gilad, “Sex-specific and lineage-specific alternative splicing in primates,” Genome Research, vol. 20, no. 2, pp. 180–189, 2010.
- D. W. MacLean, T. H. Meedel, and K. E. M. Hastings, “Tissue-specific alternative splicing of ascidian troponin I isoforms: redesign of a protein isoform-generating mechanism during chordate evolution,” The Journal of Biological Chemistry, vol. 272, no. 51, pp. 32115–32120, 1997.
- W. P. Yu, S. Brenner, and B. Venkatesh, “Duplication, degeneration and subfunctionalization of the nested synapsin-Timp genes in Fugu,” Trends in Genetics, vol. 19, no. 4, pp. 180–183, 2003.
- J. A. Lister, J. Close, and D. W. Raible, “Duplicate mitf genes in zebrafish: complementary expression and conservation of melanogenic potential,” Developmental Biology, vol. 237, no. 2, pp. 333–344, 2001.
- J. Altschmied, J. Delfgaauw, B. Wilde et al., “Subfunctionalization of duplicate mitf genes associated with differential degeneration of alternative exons in fish,” Genetics, vol. 161, no. 1, pp. 259–267, 2002.
- S. Short and L. Z. Holland, “The evolution of alternative splicing in the Pax family: the view from the basal chordate amphioxus,” Journal of Molecular Evolution, vol. 66, no. 6, pp. 605–620, 2008.
- L. Chen, Q. J. Zhang, W. Wang, and Y. Q. Wang, “Spatiotemporal expression of Pax genes in amphioxus: insights into Pax-related organogenesis and evolution,” Science China Life Sciences, vol. 53, no. 8, pp. 1031–1040, 2010.
- T. D. Barber, M. C. Barber, T. E. Cloutier, and T. B. Friedman, “PAX3 gene structure, alternative splicing and evolution,” Gene, vol. 237, no. 2, pp. 311–319, 1999.
- S. Singh, R. Mishra, N. A. Arango, J. M. Deng, R. R. Behringer, and G. F. Saunders, “Iris hypoplasia in mice that lack the alternatively spliced Pax6(5a) isoform,” Proceedings of the National Academy of Sciences of the United States of America, vol. 99, no. 10, pp. 6812–6815, 2002.
- L. Z. Holland and S. Short, “Alternative splicing in development and function of chordate endocrine systems: a focus on Pax genes,” Integrative and Comparative Biology, vol. 50, no. 1, pp. 22–34, 2010.
Copyright © 2012 Lu Chen et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.