- About this Journal ·
- Abstracting and Indexing ·
- Aims and Scope ·
- Annual Issues ·
- Article Processing Charges ·
- Articles in Press ·
- Author Guidelines ·
- Bibliographic Information ·
- Citations to this Journal ·
- Contact Information ·
- Editorial Board ·
- Editorial Workflow ·
- Free eTOC Alerts ·
- Publication Ethics ·
- Reviewers Acknowledgment ·
- Submit a Manuscript ·
- Subscription Information ·
- Table of Contents
International Journal of Evolutionary Biology
Volume 2012 (2012), Article ID 292068, 12 pages
Polyploidy and the Evolution of Complex Traits
1CMB, Karolinska Institute, 17177 Stockholm, Sweden
2DBB, Stockholm University, 10691 Stockholm, Sweden
3MU Informatics Institute, University of Missouri, Columbia, MO 65211, USA
4Division of Animal Sciences, University of Missouri, Columbia, MO 65211-5300, USA
Received 15 March 2012; Revised 29 May 2012; Accepted 5 June 2012
Academic Editor: Ben-Yang Liao
Copyright © 2012 Lukasz Huminiecki and Gavin C. Conant. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
We explore how whole-genome duplications (WGDs) may have given rise to complex innovations in cellular networks, innovations that could not have evolved through sequential single-gene duplications. We focus on two classical WGD events, one in bakers’ yeast and the other at the base of vertebrates (i.e., two rounds of whole-genome duplication: 2R-WGD). Two complex adaptations are discussed in detail: aerobic ethanol fermentation in yeast and the rewiring of the vertebrate developmental regulatory network through the 2R-WGD. These two examples, derived from diverged branches on the eukaryotic tree, boldly underline the evolutionary potential of WGD in facilitating major evolutionary transitions. We close by arguing that the evolutionary importance of WGD may require updating certain aspects of modern evolutionary theory, perhaps helping to synthesize a new evolutionary systems biology.
Characteristic changes in karyotype number have allowed researchers to infer polyploidy events for many decades . It was thus with a reasonably long history of research that Susumo Ohno was able to suggest that polyploidy was a vital route to evolutionary innovation . Ohno was of course a forceful proponent of a general role for duplication in evolution: writing that “[if evolution occurred only through changes allele frequencies] … from a bacterium only numerous forms of bacteria would have emerged […B]ig leaps in evolution required the creation of new gene loci with previously nonexistent functions” . What is less obvious on first reading is his distinction between the role played by WGD and that played by other, smaller scale, duplications (or SSDs). While the differences in the scales of these events are self-evident, there are at least two other features of WGD that are critical in giving rise to these differing roles. The first is that, as many authors have reported, particular functional classes of genes (e.g., transcription factors, kinases, ribosomal proteins, and cyclins) are duplicated by WGD more frequently than by SSD [3–8]. Ohno had in fact explored the most likely reason for this difference: “hub” genes with many interactions with other loci, be those interactions regulatory, protein interaction or metabolic, will tend to respond poorly to a change in copy number. As a result, they will tend to survive in duplicate after WGD but will not survive after smaller scale events [2, 5, 9–11]. This idea has now been termed the dosage balance hypothesis [12–14].
The second difference between single-gene and genome duplication is the kind of adaptations each may give rise to. Interest in gene duplication is intense in evolutionary biology circles because, as Haldane recognized , duplication is a powerful means for generating genetic material with the potential for innovation. There are many models of duplicate gene evolution : probably the most discussed are neofunctionalization [1, 2, 16], whereby one copy of a duplicate gene pair acquires a new beneficial function after the duplication, and subfunctionalization, where multifunctioned genes have their functions subdivided by duplication [17–19]. Since some of these subfunctions might themselves be novel and suffer from antagonistic pleiotropy (e.g., one subfunction cannot be optimized without detrimentally altering the other; [17, 20]) subfunctionalization can represent an important path to innovation. What genome duplication brings to this story is the potential for multigene novelties : with a duplication of the entire genome to explore, evolution has more space to innovate. In this paper, we explore the evidence for multi-gene innovations in yeast and animals resulting from their respective WGDs [8, 22–25]. We then discuss in detail two key innovations that are associated with WGD: aerobic ethanol fermentation in yeast and increased complexity in the vertebrate developmental regulatory network. In so doing, we will remind ourselves of Francois Jacob’s insight as to the mechanisms of evolution: the innovations produced are in keeping with the work of a tinkerer, not an engineer , and are contingent on their possessors’ evolutionary history .
2. WGD and Single-Gene Innovations
The existence of neutral models of duplicate gene resolution [18, 19] and apparent examples of their action after WGD  means that, before pursuing multi-gene adaptations from WGD, it is worthwhile to pause and ask whether examples of single-gene innovations due to WGD are known. We do so even though those innovations may appear no different than what might be expected from an SSD event. As a matter of fact, there are good examples from yeast. For instance, consider the S. cerevisiae WGD-produced paralogs GAL1 and GAL3: a sugar kinase and a regulator, respectively . In the non-WGD Kluyveromyces lactis, the single ortholog of these two genes possesses both functions . However, these two ohnologs  are not simply an example of neutral subfunctionalization: Hittinger and Carroll  have shown an adaptive conflict in the promoter of the K. lactis gene that was resolved by the gene duplication. In particular, it would be more “cost-effective” to have highly dynamic expression in the K. lactis GAL1 gene, with strong repression in the absence of galactose. However, because this same locus also encodes the regulatory function performed by the Gal3 protein in S. cerevisiae, such strong repression would result in insufficient expression of GAL1 to perform its regulatory function in the absence of galactose. Gene duplication allowed a decoupling of the expression levels of these two distinct functions. The WGD-produced duplication was thus exploited as the last step in the evolutionary development of a metabolic subsystem with a fine degree of transcriptional control.
3. Multigene Adaptations
The most unique potential impact of genes duplicated at WGD, however, is not in single-gene adaptations. Instead, it is the potential for correlated changes across multiple genes resulting in altered cellular networks, including signal transduction and transcriptional regulatory networks. That such changes occur is indirectly suggested by the observations that duplicates from the yeast WGD are more likely to be part of protein complexes and more likely to share protein interaction partners than SSD duplicates [10, 32]. The products of such retained duplicates are also enriched for proteins regulated by phosphorylation . Both observations are in keeping with the expectations of the dosage balance hypothesis . Similarly, we have shown an example of coherent changes in the coexpression networks of S. cerevisiae. To do so, we used an algorithm for detecting subdivided networks. This algorithm divides genes (connected by edges if they are coexpressed across multiple microarray experiments; ) into two columns, where each row consists of a pair of WGD-produced paralogs (Figure 1(a)). We then searched for the arrangement of genes that minimized the number of edges crossing between columns and compared that number to the number of such crossing edges seen in randomized networks. The relative paucity of crossing edges in the real network suggests network subfunctionalization, where groups of ohnologs are subdivided into two co-expression clusters .
3.1. WGD and the Crabtree Effect
While these global patterns of change after WGD suggest large-scale alterations, the best example of a change that can be at least provisionally tied to a phenotype is the evolution of the Crabtree effect. Baker’s yeast is somewhat unusual in its metabolism: even when oxygen is available, it prefers to only partially oxidize glucose into ethanol rather than fully oxidize it into CO2 and water (the Crabtree effect; [35, 36]). This fermentative lifestyle is odd insomuch as it is energetically less favorable than the complete conversion of sugars into carbon dioxide (e.g., respiration). However, there is a general association between whether or not a yeast species possesses the ancient WGD and the Crabtree effect .
One clue to the source of this apparent paradox can be found in a group of duplicated genes from the WGD, all involved in the early stages of glucose metabolism. These genes include two glucose sensors (SNF3 and RGT2), two glucose transporters (HXT6/HXT1), and two duplicate enzymes that catalyze the initial step of glycolysis (e.g., the hexokinases HXK1 and HXK2). Strikingly, in all three ohnolog pairs, one member acts when glucose concentrations are low and the other when they are high [38–40]. A second piece of the puzzle is due to theoretical work on resource competition among organisms inhabiting a large but ephemeral environmental resource. Such competition among cells can actually favor lineages that rapidly oxidize glucose relative to their more efficient but slower-growing competitors [41–43]. This tragedy of the commons  occurs because even though the efficient cells are able to convert more glucose into energy, they pay for this efficiency in reduced temporal growth rates, meaning that the fast, wasteful, cells can come to numerically dominate the resource patch.
Given these observations and expectations, we and others proposed that the yeast WGD had several effects on its patterns of glucose metabolism (Figure 2). First, we proposed that the increase in gene copy number produced by the WGD gave rise (after some gene losses in other parts of the genome) to an increased flux through glycolysis [37, 47, 51, 52]. Second, because oxidative phosphorylation of pyruvate is constrained by oxygen concentrations and the spatial structure of the mitochondria, the WGD-possessing cells were required to redirect some of this increased glycolytic flux to the (previously anaerobic) fermentative pathways . The result was likely to induce the sort of competitive situation between efficient and inefficient cells just described. A degree of independent confirmation to these ideas was provided by Van Hoek and Hogeweg , who were able to show computationally that similar WGD events modeled in modern S. cerevisiae could also be expected to result in over retention of glycolytic enzymes and increased glycolytic flux. Recent work in our lab also supports this contention, showing that duplicate losses immediately after WGD were biased toward genes coding for low-flux enzymes (Figure 2, unpublished data).
If the WGD was in fact a trigger for moving S. cerevisiae and its relatives down a path toward increasing Crabtree effect, we would expect it to have been followed by later evolutionary changes reinforcing this propensity. Indeed, at least two such post-WGD changes are known. First, in yeasts with the WGD, loss of cis-regulatory elements among the genes for the mitochondrial ribosomal proteins has de-coupled the expression of the cytosolic and mitochondrial ribosomal proteins . This change had an important effect: S. cerevisiae can now upregulate production of cytosolic ribosomes independently of the mitochondrial ones, an outcome that increases the efficiency of aerobic fermentation by avoiding unnecessary ribosome synthesis in the quiescent mitochondria. The second example is a post-WGD SSD event in the alcohol dehydrogenase family. The result of this event was two specialized ADH loci, one for the synthesis of ethanol and a second isoenzyme responsible for the back-conversion of ethanol to pyruvate (once glucose is exhausted Crabtree yeasts can reimport and respire the ethanol they previously produced; ). Such specialization likely would only have been beneficial in the context of a preexisting WGD-produced Crabtree adaptation.
3.2. Other Examples of Coordinated Evolution in Post-WGD Yeasts
There are at least two other cellular subsystems in S. cerevisiae that show evidence of large-scale changes after WGD, although the details are less well understood than is the case for metabolism. First, in the transcriptional regulatory network, pairs of transcription factors duplicated at WGD, while still showing detectable similarities in their targets inherited from the WGD, have diverged considerably (Figure 1(b); [45, 56]). More interestingly, the cytosolic ribosomal proteins in S. cerevisiae were highly over-retained after-WGD , representing roughly 10% of all retained duplicates, despite being less than 4% of the pre-WGD genome [57, 58]. These duplicates are extremely curious in that many of them have undergone considerable gene conversion, such that, despite their divergence at the ancient WGD, they have virtually identical protein sequences in modern bakers’ yeast [23, 58]. At first blush, this result could be explained in terms of selection for high copy number  and the dosage balance hypothesis. The story became mysterious, however, with the discovery that several of these paralogs, while nearly identical in protein sequence, have distinctly different knockout phenotypes [60–62]. In keeping with the idea of coordinated evolution among multiple paralogs, a number of these duplicated pairs show asymmetric specialization of one of the two ohnologs to expression in the developing bud of the yeast cell [61, 62]. We speculate that these ribosomal proteins will represent another example of a system-level specialization induced by the WGD. In this view, the rampant gene conversion is a result of the highly interactive nature of the ribosome. Thus, both paralogs must “fit” exactly into the complex ribosomal structure and what differs is not their protein function but their expression domain.
3.3. WGD and Evolutionary Innovations in Plants
WGD is rampant in plant genomes, particularly those of angiosperms [63, 64]. The systems and network biology of these events have recently been extensively reviewed [65–68], and we will not attempt to do justice to the subject here. However, we do note that while the complexity of plant biology makes identifying precise evolutionary trajectories quite difficult, there are several suggestive coincidences of timing between the origins of new traits and the duplication of regulatory genes involved in those traits . For example, glucosinolates are a class of secondary metabolites, the diversity of which has become expanded in the model plant Arabidopsis thaliana and its relatives. If one maps this expansion onto the phylogeny of these plants, it is curiously close to one of the Arabidopsis WGD events. Even more strikingly, several of the regulators and enzymes responsible for glucosinolate production in Arabidopsis have surviving duplications from that WGD . More generally, we have recently shown  that the pattern of post-WGD duplicate retention in the Arabidopsis metabolic network seems to be driven by two different forces: a tendency to initially retain clusters of related enzymes (as would be expected under the dosage balance hypothesis) followed by a selective regime that appears to retain duplicates for reactions of high flux (similar to situation seen in S. cerevisiae).
3.4. 2R and the Remodeling of the Vertebrate Developmental and Signal Transduction Networks
Another example of WGD-induced functional innovation at the systems level concerns the vertebrate developmental toolkit and signal transduction engines. The metazoans, because they have bodies organized into distinct tissues, are clearly characterized by significant phenotypic complexity. They seem to have appeared about 640 million years ago and may have been preceded by other multicellular lineages of uncertain relationships . On the basis of mitochondrial DNA sequence comparisons, the choanoflagellates have been identified as the closest single-celled animal relatives [72, 73] with the basal metazoan being either the placozoans [74–76] or the sponges [77, 78]. Although the role of WGD in metazoan evolution is not fully understood, several examples of WGDs among the vertebrates have been identified . These include two rounds (2R) of genome duplication at the base of vertebrates (2R-WGD; ), the fish-specific genome duplication (FSGD; [4, 79, 80]), and WGDs in the genus Xenopus .
Despite their phenotypic complexity, animals’ gene content is not vastly greater than that of other organisms [82, 83]. Part of the explanation for this relative paucity of extra genes is the nature of development, which occurs by sequential differentiation in bifurcating cell lineages rather than through entirely distinct differentiation programs for each tissue. Nonetheless, the transformation to multicellularity must have been accompanied by appearance of new genes coding for adhesion molecules, extracellular matrix proteins (such as collagen), and cell-to-cell communication. Indeed, considerable progress has been made in identifying the novel signaling pathways involved in control of development and body plan formation . In keeping with the theme of relatively little genome expansion coupled to the appearance of the metazoans, only a small fraction of the genes in the genome contribute to the development of the body plan. However, these genes make up a developmental toolkit that is strongly conserved across the eumetazoans. Transcriptional factors of particular interest are homeobox genes (Hox, ParaHox, EHGbox, and NK-like); KLF, Osr and Sp1/Egr genes, tlx, Snail, and slug zinc-finger proteins; MASH, myoD, mef, hairy, and twist helix-loop-helix transcriptional factors; T-box transcriptional factors [85–87]. These transcriptional regulators interact with the outside world through signal transduction pathways, the most important of which are those employing transforming growth factor-β (TGF-β), Wnt, Notch, Hedgehog, Toll, tyrosine kinase receptors, the nuclear hormone receptors, and the G-protein-coupled receptors. The identification of the shared toolkit of signalling pathways underlying animal development is a key discovery of modern biology. Following this work, we have recently found that the vertebrate signal transduction engine was highly modified by the 2R-WGD , suggesting that some of the complexity of vertebrates may have required the innovative capacity of WGD .
3.5. Gene Duplications in the Transforming Growth Factor-β Pathway
Our initial study, focused on the TGF-β pathway, provided early evidence of the impact of the 2R-WGD on vertebrate signaling. This signaling pathway has been long recognized as one of the most fundamental and versatile in metazoans, with central roles in development, organogenesis, stem-cell control, immunity, and cancer . After an investigation of 33 genomes, we showed that the evolution of the TGF-β pathway in animals can be best explained according to the 2R model, with additional duplications in teleost fishes . The components of the core pathway (both receptors and Smads) expanded dramatically and permanently at the base of vertebrates as a result of the 2R-WGD. In particular, four ancestral Smads (an I-Smad, a Co-Smad, and two R-Smads of the BMP and TGF-β sensu stricto channels) gave rise to the eight known Smads of the human genome, classified as two TGF-β sensu stricto (Smad2,3) and three bone-morphogenetic-protein- (BMP-) type (Smad1,5,8) receptor-activated Smads (R-Smads), one common mediator Smad (Co-Smad; Smad4), and two inhibitory Smads (I-Smads; Smad6,7).
3.6. General Expansion of Signaling Pathways after 2R
In a more general analysis, we found that the 2R-WGD affected the overwhelming majority (three quarters) of human signaling genes, with the strongest effect on developmental pathways involving receptor tyrosine kinases, Wnt and TGF-β ligands, GPCRs, and the apoptosis pathway. Unlike genes deriving from recent tandem duplications, genes retained after 2R were enriched in protein interaction domains and multifunctional signaling modules of Ras and MAP-kinase cascades. The set of human 2R-ohnologs (2ROs), corresponding to 9,958 unique Entrez Genes, is enriched in many classic signaling domains (such as tyrosine and serine/threonine kinase domains, the seven-transmembrane receptor domains of the rhodopsin and secretin families, and the Ras family domain), as well as well-known protein interaction domains, including the SH2, SH3, PTB, and PDZ domains.
PDZ domains are particularly interesting as they are abundant in vertebrate neuronal synapses, serving as scaffolds for the assembly of large neurotransmission signaling complexes . Thus, these results suggest that 2R may have provided evolutionary material for subsequent changes in vertebrate brain development. Further evidence for this contention came when we found that 2ROs are preferentially expressed in Gene Expression Atlas samples associated with brain and nervous tissue. These brain-expressed 2ROs are also enriched in Gene Ontology (GO) terms related to synaptic transmission. Studies in fly and mouse have shown that vertebrate synapses are more complex than those of invertebrates : it is thus intriguing to speculate as to a role for 2R in inducing this phenomenon.
Another potential source of vertebrate neuronal complexity is their use of apoptosis to shape brain structures and compartments. We found that the apoptosis pathway was dramatically remodeled through 2R . Figure 3 illustrates the complex topology of the human apoptosis signaling subnetwork created by 2R . It is clear that coordinated duplications of caspases resulted in a substantial evolutionary novelty. Moreover, the complexity of the evolutionary changes introduced by 2R is best appreciated by examining the conservation of regulatory interactions (directed edges in the network). To better illustrate changes in network topology induced by the 2R-WGD, we subdivided the conserved edges into those originating from a shared regulator and acting on a pair of 2ROs (conserved incoming edges—CIEs), and those originating from a paralogous pair directed towards a shared target (conserved outgoing edges—COEs). CIEs suggest a common conserved regulator, located upstream in terms of information flow. In contrast, COEs indicate evolutionary conservation of a common regulatory target, located downstream (Figure 3).
Finally, while many genes for ancient cellular functions were not retained in duplicate after 2R, the genes of the cell cycle are an exception to this rule (an interesting link to the overretention of cyclins after the yeast WGD; ). Most cyclins, including key cell cycle-regulating groups A, B, and D, underwent diversification at the base of vertebrates and are represented by between two and four vertebrate-specific paralogs derived from the 2R-WGD . Similarly, cyclin-dependent kinases, cyclin-dependent kinase inhibitors, and orthologs of the S. pombe WEE1 inactivator of the CDK/cyclin complex (Wee1 and Wee2) were also retained in duplicate after 2R . Strikingly, cyclins D1-D3 respond to extracellular mitogens, cytokines, hormones, and juxtacrine ligands, providing an interface between signal transduction and the cell cycle. These cyclins then pair with CDKs 4 and 6, driving the transition to G1 [94–96]. It would be very informative to test if cyclins D1-D3 and CDK4/6 simply increase robustness of the cell cycle. If not, there may be functional differences between the 2R-derived cyclin D/CDK complexes in terms of the upstream signaling pathways they integrate or the downstream target genes they activate .
3.7. 2R and Vertebrate Complexity
In contrast to the predictions of the dosage balance hypothesis, vertebrate genes having developmental expression were more likely to revert to single copy after whole-genome duplication [97, 98]. However, this observation may be qualified by the fact that, after the FSGD, almost all retained duplicates have diverged in spatial and/or temporal expression during embryogenesis, and many were key developmental genes that function as transcription factors or signaling molecules during embryogenesis . These general trends of retention and expression change, as well as the above functional analyses, clearly indicated that 2R fundamentally altered vertebrates’ signaling pathways and cell cycles . In consequence, it may have set the stage for the emergence of other key vertebrate evolutionary novelties (such as complex brains, the circulatory system, or heart, bone, cartilage, musculature, and adipose tissues; [71, 100]).
It should also be noted that the methodology used in these studies of the 2R-WGD  precluded an investigation of the amphioxus genome, as this genome was not included in release 6 of the TreeFam database. However, in other studies, the genome of the cephalochordate Branchiostoma floridae (e.g., amphioxus or lancelet) provided very strong evidence in support of the 2R hypothesis [101, 102]. Another strategically positioned pre-2R genome, that of sea urchin, is being developed as a developmental and systems biology model for understanding gene regulatory network evolution, which, together with the signal transduction pathways of this species, has been particularly well annotated [103–105]. Comparisons of sea urchin’s developmental regulatory networks with those of vertebrates is likely to reveal further insights into the impact of the 2R-WGD.
4. Concluding Thoughts
The broader significance of these changes for our understanding of the forces and mechanisms driving the evolutionary process could well be extremely significant. Firstly, we propose that WGDs, like human technical innovations such as the railroad, greatly expand of genotypic and phenotypic space that might be explored by evolution. For example, the 2R quadrupling of components of the vertebrate signaling network not only immediately expanded the available space of signaling network states, but also kick-started rapid co-evolution of nodes into novel topologies during the subsequent “diploidization.” We have also recently proposed that WGD has an important role in evolutionary transitions by relaxing epistatic constraints , effectively increasing the size of the neutral genetic space in which innovation can occur . Secondly, an exciting possibility exists that at least some WGDs may be instantaneous speciations: if so, they would be evolutionary events whose occurrence is somewhat in contrast to an exclusively gradualist view of evolution. Early authors of modern synthesis, coming from background in population genetics, were perhaps overly wedded to gradualism, where natural selection acts on small variations in large populations. The molecular mechanism of WGDs is most likely auto- or allopolyploidy. WGDs could therefore be interpreted as saltations, that is, sudden evolutionary changes occurring within a single generation. However, population genetic processes are of course of central importance during subsequent re-diploidization. During that gradual process of duplicate loss over millions of years, there may be losses driven by natural selection acting to fix null mutants for duplicated loci, a process which fits well with Neo-Darwinian views.
In a related vein, gene duplications may have a role in enhancing robustness—the organism’s resilience to genetic or environmental perturbations . At the simplest level, duplication provides short-term robustness through genetic “backups.” However, WGDs could also lead to an increase in distributed robustness, which is a consequence of the existence of multiple solutions to the same biological problem. A well-known example of this idea is the redundant paths through metabolic networks that confer robustness [48, 108]. It is fairly straightforward to envisage an analogous situation in signal transduction or the cell cycle: multiple regulatory mechanisms could in that case increase the level of control, allowing, for instance, the development of complex vertebrate embryos with many novel organs and tissue types.
Genome duplication might have also facilitated innovation in other ways. For instance, the establishment of crosstalk between signaling pathways  may have resulted from WGD. The post-WGD redundancy would have allowed the partial subdivision of duplicated pathways, resulting in a network of a higher degree of connectivity and robustness. Thus, it is striking that few novel signaling genes emerged through post-2R events , since SSD events lack the opportunity for this type of change. Another area for future investigation is the impact of 2R-WGD on non-coding genes . Published studies and our own observations indicate that no preferential retention of miRNA genes can be attributed to 2R-WGD . Instead, functional innovation in miRNA regulation appears to have occurred during the more recent mammalian diversification . This suggests a model where major evolutionary transitions exploit expansions in different classes of genomics elements: protein-coding genes at the transition to vertebrates and miRNA genes during diversification of mammals.
It is tempting to hypothesize that gene duplication can initially promote redundancy of system parts, allowing evolutionary tinkering, while genome duplications are correlated with an increase in distributed robustness. In the future, we propose testing this hypothesis by asking if more WGD-produced duplications are found in distinct signaling pathways when compared to SSD gene duplicates of similar age. More generally, both redundancy and robustness provide the evolutionary space for adaptations, and there are suggestions that WGD facilitated the colonization of novel environments and ecological niches [52, 113].
Genome duplication undoubtedly represents a tremendous evolutionary opportunity: the release of epistasis alone that results from WGD may have important implications . However, as the examples described here suggest, the resulting innovations are unlikely to fit neatly into the neofunctionalization/subfunctionalization paradigm [16, 114], nor are they likely to be fully understood without a detailed knowledge of the cellular systems in which they are active.
The authors would like to thank Michaël Bekaert, Patrick Edger, Carl-Henrik Heldin, Corey Hudson, and Chris Pires for helpful discussions. L. Huminiecki is supported by ENFIN, a Network of Excellence funded by the European Commission FP6 Programme, under the thematic area “Life sciences, genomics and biotechnology for health,” Contract no. LSHG-CT-2005-518254, the Swedish Research Council, and Swedish Bioinformatics Infrastructure for Life Sciences (BILS). G. C. Conant is supported by the Reproductive Biology Group of the Food for the 21st Century program at the University of Missouri.
- J. S. Taylor and J. Raes, “Duplication and divergence: the evolution of new genes and old ideas,” Annual Review of Genetics, vol. 38, pp. 615–643, 2004.
- S. Ohno, Evolution by Gene Duplication, Springer, New York, NY, USA, 1970.
- J. M. Aury, O. Jaillon, L. Duret et al., “Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia,” Nature, vol. 444, no. 7116, pp. 171–178, 2006.
- O. Jatllon, J. M. Aury, F. Brunet et al., “Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype,” Nature, vol. 431, no. 7011, pp. 946–957, 2004.
- S. Maere, S. De Bodt, J. Raes et al., “Modeling gene and genome duplications in eukaryotes,” Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 15, pp. 5454–5459, 2005.
- C. Seoighe and K. H. Wolfe, “Yeast genome evolution in the post-genome era,” Current Opinion in Microbiology, vol. 2, no. 5, pp. 548–554, 1999.
- B. C. Thomas, B. Pedersen, and M. Freeling, “Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes,” Genome Research, vol. 16, no. 7, pp. 934–946, 2006.
- K. H. Wolfe and D. C. Shields, “Molecular evidence for an ancient duplication of the entire yeast genome,” Nature, vol. 387, no. 6634, pp. 708–713, 1997.
- I. Wapinski, A. Pfeffer, N. Friedman, and A. Regev, “Natural history and evolutionary principles of gene duplication in fungi,” Nature, vol. 449, no. 7158, pp. 54–61, 2007.
- L. Hakes, J. W. Pinney, S. C. Lovell, S. G. Oliver, and D. L. Robertson, “All duplicates are not equal: the difference between small-scale and genome duplication,” Genome Biology, vol. 8, no. 10, article R209, 2007.
- M. Freeling, “Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition,” Annual Review of Plant Biology, vol. 60, pp. 433–453, 2009.
- J. A. Birchler and R. A. Veitia, “The gene balance hypothesis: from classical genetics to modern genomics,” Plant Cell, vol. 19, no. 2, pp. 395–402, 2007.
- M. Freeling and B. C. Thomas, “Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity,” Genome Research, vol. 16, no. 7, pp. 805–814, 2006.
- B. Papp, C. Pál, and L. D. Hurst, “Dosage sensitivity and the evolution of gene families in yeast,” Nature, vol. 424, no. 6945, pp. 194–197, 2003.
- J. B. S. Haldane, “The part played by recurrent mutation in evolution,” American Naturalist, vol. 67, pp. 5–9, 1933.
- H. Innan and F. Kondrashov, “The evolution of gene duplications: classifying and distinguishing between models,” Nature Reviews Genetics, vol. 11, no. 2, pp. 97–108, 2010.
- D. L. Des Marais and M. D. Rausher, “Escape from adaptive conflict after duplication in an anthocyanin pathway gene,” Nature, vol. 454, no. 7205, pp. 762–765, 2008.
- A. Force, M. Lynch, F. B. Pickett, A. Amores, Y. L. Yan, and J. Postlethwait, “Preservation of duplicate genes by complementary, degenerative mutations,” Genetics, vol. 151, no. 4, pp. 1531–1545, 1999.
- A. Stoltzfus, “On the possibility of constructive neutral evolution,” Journal of Molecular Evolution, vol. 49, no. 2, pp. 169–181, 1999.
- C. T. Hittinger and S. B. Carroll, “Gene duplication and the adaptive evolution of a classic genetic switch,” Nature, vol. 449, no. 7163, pp. 677–681, 2007.
- M. Sémon and K. H. Wolfe, “Consequences of genome duplication,” Current Opinion in Genetics and Development, vol. 17, no. 6, pp. 505–512, 2007.
- B. Dujon, D. Sherman, G. Fischer et al., “Genome evolution in yeasts,” Nature, vol. 430, pp. 35–44, 2004.
- M. Kellis, B. W. Birren, and E. S. Lander, “Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae,” Nature, vol. 428, no. 6983, pp. 617–624, 2004.
- F. S. Dietrich, S. Voegeli, S. Brachat et al., “The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome,” Science, vol. 304, no. 5668, pp. 304–307, 2004.
- M. Kasahara, “The 2R hypothesis: an update,” Current Opinion in Immunology, vol. 19, no. 5, pp. 547–552, 2007.
- F. Jacob, “Evolution and tinkering,” Science, vol. 196, no. 4295, pp. 1161–1166, 1977.
- U. Alon, “Biological networks: the tinkerer as an engineer,” Science, vol. 301, no. 5641, pp. 1866–1867, 2003.
- A. Van Hoof, “Conserved functions of yeast genes support the duplication, degeneration and complementation model for gene duplication,” Genetics, vol. 171, no. 4, pp. 1455–1461, 2005.
- P. J. Bhat and T. V. S. Murthy, “Transcriptional control of the GAL/MEL regulon of yeast Saccharomyces cerevisiae: mechanism of galactose-mediated signal transduction,” Molecular Microbiology, vol. 40, no. 5, pp. 1059–1066, 2001.
- F. T. Zenke, R. Engels, V. Vollenbroich, J. Meyer, C. P. Hollenberg, and K. D. Breunig, “Activation of Gal4p by galactose-dependent interaction of galactokinase and Gal80p,” Science, vol. 272, no. 5268, pp. 1662–1665, 1996.
- K. Wolfe, “Robustness—it's not where you think it is,” Nature Genetics, vol. 25, no. 1, pp. 3–4, 2000.
- Y. Guan, M. J. Dunham, and O. G. Troyanskaya, “Functional analysis of gene duplications in Saccharomyces cerevisiae,” Genetics, vol. 175, no. 2, pp. 933–943, 2007.
- G. D. Amoutzias, Y. He, J. Gordon, D. Mossialos, S. G. Oliver, and Y. Van De Peer, “Posttranslational regulation impacts the fate of duplicated genes,” Proceedings of the National Academy of Sciences of the United States of America, vol. 107, no. 7, pp. 2967–2971, 2010.
- G. C. Conant and K. H. Wolfe, “Functional partitioning of yeast co-expression networks after genome duplication,” PLoS Biology, vol. 4, no. 4, article e109, 2006.
- R. Geladé, S. Van de Velde, P. V. Van Dijck, and J. M. Thevelein, “Multi-level response of the yeast genome to glucose,” Genome Biology, vol. 4, no. 11, article 233, 2003.
- M. Johnston and J. H. Kim, “Glucose as a hormone: receptor-mediated glucose sensing in the yeast Saccharomyces cerevisiae,” Biochemical Society Transactions, vol. 33, no. 1, pp. 247–252, 2005.
- A. Merico, P. Sulo, J. Piškur, and C. Compagno, “Fermentative lifestyle in yeasts belonging to the Saccharomyces complex,” FEBS Journal, vol. 274, no. 4, pp. 976–989, 2007.
- P. Herrero, J. Galíndez, N. Ruiz, C. Martinez-Campa, and F. Moreno, “Transcriptional regulation of the Saccharomyces cerevisiae HXK1, HXK2 and GLK1 genes,” Yeast, vol. 11, no. 2, pp. 137–144, 1995.
- A. Maier, B. Völker, E. Boles, and G. F. Fuhrmann, “Characterisation of glucose transport in Saccharomyces cerevisiae with plasma membrane vesicles (countertransport) and intact cells (initial uptake) with single Hxt1, Hxt2, Hxt3, Hxt4, Hxt6, Hxt7 or Gal2 transporters,” FEMS Yeast Research, vol. 2, no. 4, pp. 539–550, 2002.
- S. Özcan and M. Johnston, “Function and regulation of yeast hexose transporters,” Microbiology and Molecular Biology Reviews, vol. 63, no. 3, pp. 554–569, 1999.
- R. C. MacLean and I. Gudelj, “Resource competition and social conflict in experimental populations of yeast,” Nature, vol. 441, no. 7092, pp. 498–501, 2006.
- T. Pfeiffer and S. Schuster, “Game-theoretical approaches to studying the evolution of biochemical systems,” Trends in Biochemical Sciences, vol. 30, no. 1, pp. 20–25, 2005.
- T. Pfeiffer, S. Schuster, and S. Bonhoeffer, “Cooperation and competition in the evolution of ATP-producing pathways,” Science, vol. 292, no. 5516, pp. 504–507, 2001.
- G. Hardin, “The tragedy of the commons,” Science, vol. 162, no. 3859, pp. 1243–1248, 1968.
- G. C. Conant, “Rapid reorganization of the transcriptional regulatory network after genome duplication in yeast,” Proceedings of the Royal Society B, vol. 277, no. 1683, pp. 869–876, 2010.
- W. K. Huh, J. V. Falvo, L. C. Gerke et al., “Global analysis of protein localization in budding yeast,” Nature, vol. 425, no. 6959, pp. 686–691, 2003.
- G. C. Conant and K. H. Wolfe, “Increased glycolytic flux as an outcome of whole-genome duplication in yeast,” Molecular Systems Biology, vol. 3, article 129, 2007.
- N. C. Duarte, M. J. Herrgård, and B. Ø. Palsson, “Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model,” Genome Research, vol. 14, no. 7, pp. 1298–1309, 2004.
- Å. Pérez-Bercoff Å, A. McLysaght, and G. C. Conant, “Patterns of indirect protein interactions suggest a spatial organization to metabolism,” Molecular BioSystems, vol. 7, pp. 3056–3064, 2011.
- G. C. Conant and K. H. Wolfe, “Probabilistic cross-species inference of orthologous genomic regions created by whole-genome duplication in yeast,” Genetics, vol. 179, no. 3, pp. 1681–1692, 2008.
- L. M. Blank, F. Lehmbeck, and U. Sauer, “Metabolic-flux and network analysis in fourteen hemiascomycetous yeasts,” FEMS Yeast Research, vol. 5, no. 6-7, pp. 545–558, 2005.
- J. Piškur, E. Rozpedowska, S. Polakova, A. Merico, and C. Compagno, “How did Saccharomyces evolve to become a good brewer?” Trends in Genetics, vol. 22, no. 4, pp. 183–186, 2006.
- M. J. A. Van Hoek and P. Hogeweg, “Metabolic adaptation after whole genome duplication,” Molecular Biology and Evolution, vol. 26, no. 11, pp. 2441–2453, 2009.
- J. Ihmels, S. Bergmann, M. Gerami-Nejad et al., “Molecular biology: rewiring of the yeast transcriptional network through the evolution of motif usage,” Science, vol. 309, no. 5736, pp. 938–940, 2005.
- J. M. Thomson, E. A. Gaucher, M. F. Burgan et al., “Resurrecting ancestral alcohol dehydrogenases from yeast,” Nature Genetics, vol. 37, no. 6, pp. 630–635, 2005.
- D. Fusco, L. Grassi, B. Bassetti, M. Caselle, and M. Cosentino Lagomarsino, “Ordered structure of the transcription network inherited from the yeast whole-genome duplication,” BMC Systems Biology, vol. 4, article 77, 2010.
- K. P. Byrne and K. H. Wolfe, “The Yeast Gene Order Browser: combining curated homology and syntenic context reveals gene fate in polyploid species,” Genome Research, vol. 15, no. 10, pp. 1456–1461, 2005.
- A. M. Evangelisti and G. C. Conant, “Nonrandom survival of gene conversions among yeast ribosomal proteins duplicated through genome doubling,” Genome Biology and Evolution, vol. 2, pp. 826–834, 2010.
- F. A. Kondrashov and A. S. Kondrashov, “Role of selection in fixation of gene duplications,” Journal of Theoretical Biology, vol. 239, no. 2, pp. 141–151, 2006.
- T. Y. Kim, C. W. Ha, and W. K. Huh, “Differential subcellular localization of ribosomal protein L7 paralogs in Saccharomyces cerevisiae,” Molecules and Cells, vol. 27, no. 5, pp. 539–546, 2009.
- S. Komili, N. G. Farny, F. P. Roth, and P. A. Silver, “Functional specificity among ribosomal proteins regulates gene expression,” Cell, vol. 131, no. 3, pp. 557–571, 2007.
- L. Ni and M. Snyder, “A genomic study of the bipolar bud site selection pattern in Saccharomyces cerevisiae,” Molecular Biology of the Cell, vol. 12, no. 7, pp. 2147–2170, 2001.
- Y. Van De Peer, “Computational approaches to unveiling ancient genome duplications,” Nature Reviews Genetics, vol. 5, no. 10, pp. 752–763, 2004.
- D. E. Soltis, V. A. Albert, J. Leebens-Mack et al., “Polyploidy and angiosperm diversification,” American Journal of Botany, vol. 96, no. 1, pp. 336–348, 2009.
- R. De Smet and Y. Van de Peer, “Redundancy and rewiring of genetic networks following genome-wide duplication events,” Current Opinion in Plant Biology, vol. 15, pp. 168–176, 2012.
- M. E. Schranz, S. Mohammadin, and P. P. Edger, “Ancient whole genome duplications, novelty and diversification: the WGD Radiation Lag-Time Model,” Current Opinion in Plant Biology, vol. 15, pp. 147–153, 2012.
- L. Chae, I. Lee, J. Shin, and S. Y. Rhee, “Toward understanding how molecuar networks evolve in plants,” Current Opinion in Plant Biology, vol. 15, pp. 177–184, 2012.
- L. M. Liberman, R. Sozzani, and P. N. Benfey, “Integrative systems biology: an attempt to describe a simple weed,” Current Opinion in Plant Biology, vol. 15, pp. 162–167, 2012.
- M. E. Schranz, P. P. Edger, J. C. Pires, N. M. van Dam, and C. W. Wheat, “Comparative genomics in the Brassicales: ancient genome duplications, glucossinolate diversification and Pierinae herbivore radiation,” in Genetics, Genomics and Breeding of Oilseed Brassicas, J. B. David Edwards, I. Parkin, and C. Kole, Eds., Science Publishers, Jersey, British Isles, UK, 2011.
- M. Bekaert, P. P. Edger, J. C. Pires, and G. C. Conant, “Two-phase resolution of polyploidy in the Arabidopsis metabolic network gives rise to relative and absolute dosage constraints,” Plant Cell, vol. 23, no. 5, pp. 1719–1728, 2011.
- J. W. Valentine, On the Origin of Phyla, University of Chicago Press, Chicago, Ill, USA, 2004.
- N. King, M. J. Westbrook, S. L. Young et al., “The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans,” Nature, vol. 451, no. 7180, pp. 783–788, 2008.
- B. F. Lang, C. O'Kelly, T. Nerad, M. W. Gray, and G. Burger, “The closest unicellular relatives of animals,” Current Biology, vol. 12, no. 20, pp. 1773–1778, 2002.
- O. Voigt, A. G. Collins, V. B. Pearse et al., “Placozoa—no longer a phylum of one,” Current Biology, vol. 14, no. 22, pp. R944–R945, 2004.
- B. Schierwater, “My favorite animal, Trichoplax adhaerens,” BioEssays, vol. 27, no. 12, pp. 1294–1302, 2005.
- S. L. Dellaporta, A. Xu, S. Sagasser et al., “Mitochondrial genome of Trichoplax adhaerens supports Placozoa as the basal lower metazoan phylum,” Proceedings of the National Academy of Sciences of the United States of America, vol. 103, no. 23, pp. 8751–8756, 2006.
- S. P. Leys, D. S. Rohksar, and B. M. Degnan, “Sponges,” Current Biology, vol. 15, no. 4, pp. R114–R115, 2005.
- C. Nielsen, “Six major steps in animal evolution: are we derived sponge larvae?” Evolution and Development, vol. 10, no. 2, pp. 241–257, 2008.
- J. S. Taylor, Y. Van de Peer, I. Braasch, and A. Meyer, “Comparative genomics provides evidence for an ancient genome duplication event in fish,” Philosophical Transactions of the Royal Society B, vol. 356, no. 1414, pp. 1661–1679, 2001.
- K. Vandepoele, W. De Vos, J. S. Taylor, A. Meyer, and Y. Van De Peer, “Major events in the genome evolution of vertebrates: paranome age and size differ considerably between ray-finned fishes and land vertebrates,” Proceedings of the National Academy of Sciences of the United States of America, vol. 101, no. 6, pp. 1638–1643, 2004.
- F. J. Chain and B. J. Evans, “Multiple mechanisms promote the retained expression of gene duplicates in the tetraploid frog Xenopus laevis,” PLoS Genetics, vol. 2, no. 4, article e56, 2006.
- E. S. Lander, L. M. Linton, B. Birren et al., “Initial sequencing and analysis of the human genome,” Nature, vol. 409, pp. 860–921, 2001.
- J. C. Venter, M. D. Adams, E. W. Myers et al., “The sequence of the human genome,” Science, vol. 291, pp. 1304–1351, 2001.
- I. Ruiz-Trillo, G. Burger, P. W. H. Holland et al., “The origins of multicellularity: a multi-taxon genome initiative,” Trends in Genetics, vol. 23, no. 3, pp. 113–118, 2007.
- A. Pires-daSilva and R. J. Sommer, “The evolution of signalling pathways in animal development,” Nature Reviews Genetics, vol. 4, no. 1, pp. 39–49, 2003.
- E. M. De Robertis, “Evo-Devo: variations on ancestral themes,” Cell, vol. 132, no. 2, pp. 185–195, 2008.
- S. B. Carroll, J. K. Grenier, and S. D. Weatherbee, From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design, Blackwell, Malden, Mass, USA, 2nd edition, 2005.
- L. Huminiecki and C. H. Heldin, “2R and remodeling of vertebrate signal transduction engine,” BMC Biology, vol. 8, article 146, 2010.
- P. W. H. Holland, J. Garcia-Fernandez, N. A. Williams, and A. Sidow, “Gene duplications and the origins of vertebrate development,” Development, vol. 120, pp. 125–133, 1994.
- P. ten Dijke and C. H. Heldin, Smad Signal Transduction: Smads in Proliferation, Differentiation and Disease, Springer, Dordrecht, The Netherlands, 2006.
- L. Huminiecki, L. Goldovsky, S. Freilich, A. Moustakas, C. Ouzounis, and C. H. Heldin, “Emergence, development and diversification of the TGF- signalling pathway within the animal kingdom,” BMC Evolutionary Biology, vol. 9, no. 1, article 28, 2009.
- E. Kim and M. Sheng, “PDZ domain proteins of synapses,” Nature Reviews Neuroscience, vol. 5, no. 10, pp. 771–781, 2004.
- R. D. Emes, A. J. Pocklington, C. N. G. Anderson et al., “Evolutionary expansion and anatomical specialization of synapse proteome complexity,” Nature Neuroscience, vol. 11, no. 7, pp. 799–806, 2008.
- J. D. Watson, T. A. Baker, S. P. Bell, A. Gann, M. Levine, and R. Losick, Molecular Biology of the Gene: International Edition, Benjamin Cummings, San Francisco, Calif, USA, 6th edition, 2007.
- F. Marks, U. Klingüller, and K. Müller-Decker, Cellular Signal Processing: An Introduction To the Molecular Mechanisms of Signal Transduction, Garland Science, New York, NY, USA, 2009.
- B. Alberts, A. Johnson, J. Lewis, M. Raff, K. Roberts, and P. Walter, Molecular Biology of the Cell, Garland Science, New York, NY, USA, 5th, edition, 2008.
- J. Roux and M. Robinson-Rechavi, “Developmental constraints on vertebrate genome evolution,” PLoS Genetics, vol. 4, no. 12, article e1000311, 2008.
- J. Roux and M. Robinson-Rechavi, “Age-dependent gain of alternative splice forms and biased duplication explain the relation between splicing and duplication,” Genome Research, vol. 21, no. 3, pp. 357–363, 2011.
- K. S. Kassahn, V. T. Dang, S. J. Wilkins, A. C. Perkins, and M. A. Ragan, “Evolution of gene function and regulatory control after whole-genome duplication: comparative analyses in vertebrates,” Genome Research, vol. 19, no. 8, pp. 1404–1418, 2009.
- C. P. Hickman Jr., L. S. Roberts, S. L. Keen, A. Larson, and D. Eisenhour, Animal Diversity, The McGraw-Hill, Columbus, Ohio, USA, 5th edition, 2008.
- N. H. Putnam, T. Butts, D. E. K. Ferrier et al., “The amphioxus genome and the evolution of the chordate karyotype,” Nature, vol. 453, no. 7198, pp. 1064–1071, 2008.
- L. Z. Holland, R. Albalat, K. Azumi et al., “The amphioxus genome illuminates vertebrate origins and cephalochordate biology,” Genome Research, vol. 18, pp. 1100–1111, 2008.
- A. Fernandez-Guerra, A. Aze, J. Morales et al., “The genomic repertoire for cell cycle control and DNA metabolism in S. purpuratus,” Developmental Biology, vol. 300, no. 1, pp. 238–251, 2006.
- E. Sodergren, G. M. Weinstock, E. H. Davidson et al., “The genome of the sea urchin Strongylocentrotus purpuratus,” Science, vol. 314, pp. 941–952, 2006.
- Q. Tu, C. T. Brown, E. H. Davidson, and P. Oliveri, “Sea urchin Forkhead gene family: phylogeny and embryonic expression,” Developmental Biology, vol. 300, no. 1, pp. 49–62, 2006.
- A. Wagner, “Neutralism and selectionism: a network-based reconciliation,” Nature Reviews Genetics, vol. 9, no. 12, pp. 965–974, 2008.
- A. Wagner, Robustness and Evolvability in Living Systems, Princeton University Press, Princeton, NJ, USA, 2005.
- J. S. Edwards and B. O. Palsson, “The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities,” Proceedings of the National Academy of Sciences of the United States of America, vol. 97, no. 10, pp. 5528–5533, 2000.
- I. Amit, R. Wides, and Y. Yarden, “Evolvable signaling networks of receptor tyrosine kinases: relevance of robustness to malignancy and to cancer therapy,” Molecular Systems Biology, vol. 3, article 151, 2007.
- X. Dong, P. Navratilova, D. Fredman, Ø. Drivenes, T. S. Becker, and B. Lenhard, “Exonic remnants of whole-genome duplication reveal cis-regulatory function of coding exons,” Nucleic Acids Research, vol. 38, no. 4, pp. 1071–1085, 2009.
- A. M. Heimberg, L. F. Sempere, V. N. Moy, P. C. J. Donoghue, and K. J. Peterson, “MicroRNAs and the advent of vertebrate morphological complexity,” Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 8, pp. 2946–2950, 2008.
- J. Li, G. Musso, and Z. Zhang, “Preferential regulation of duplicated genes by microRNAs in mammals,” Genome Biology, vol. 9, no. 8, article R132, 2008.
- J. A. Fawcett, S. Maere, and Y. Van De Peer, “Plants with double genomes might have had a better chance to survive the Cretaceous-Tertiary extinction event,” Proceedings of the National Academy of Sciences of the United States of America, vol. 106, no. 14, pp. 5737–5742, 2009.
- X. He and J. Zhang, “Rapid subfunctionalization accompanied by prolonged and substantial neofunctionalization in duplicate gene evolution,” Genetics, vol. 169, no. 2, pp. 1157–1164, 2005.