International Journal of Evolutionary Biology

International Journal of Evolutionary Biology / 2012 / Article
!A Erratum for this article has been published. To view the article details, please click the ‘Erratum’ tab above.
Special Issue

Molecular Evolutionary Routes that Lead to Innovations

View this Special Issue

Research Article | Open Access

Volume 2012 |Article ID 298147 | 12 pages |

Evolution of the FGF Gene Family

Academic Editor: Frédéric Brunet
Received27 Apr 2012
Accepted06 Jun 2012
Published07 Aug 2012


Fibroblast Growth Factors (FGFs) are small proteins generally secreted, acting through binding to transmembrane tyrosine kinase receptors (FGFRs). Activation of FGFRs triggers several cytoplasmic cascades leading to the modification of cell behavior. FGFs play critical roles in a variety of developmental and physiological processes. Since their discovery in mammals, FGFs have been found in many metazoans and some arthropod viruses. Efforts have been previously made to decipher the evolutionary history of this family but conclusions were limited due to a poor taxonomic coverage. We took advantage of the availability of many new sequences from diverse metazoan lineages to further explore the possible evolutionary scenarios explaining the diversity of the FGF gene family. Our analyses, based on phylogenetics and synteny conservation approaches, allow us to propose a new classification of FGF genes into eight subfamilies, and to draw hypotheses for the evolutionary events leading to the present diversity of this gene family.

1. Introduction

Fibroblast growth factors (FGFs) form a family of generally extracellular signaling peptides, which are key regulators of many biological processes ranging from cell proliferation to the control of embryonic development in metazoans. Ever since the mitogenic activity of FGF-like factors was first observed in 1939 [1] and the first FGF factor was isolated in the 1970s [2], a large number of members of this gene family have been isolated and characterized in different metazoans.

FGFs are small proteins (between 17 and 34 kDa) characterized by a relatively well conserved central domain of 120 to 130 amino acids. This domain is organized into 12 antiparallel β sheets forming a triangular structure called beta trefoil. In general, FGFs function through binding to a tyrosine kinase receptor (FGFR) on the surface of the cell membrane. Two FGF ligands bind a dimeric receptor in the presence of heparan sulphate proteoglycan (HSPG) allowing the transphosphorylation and activation of the intracellular tyrosine kinase domain of the receptor. Binding to FGFRs usually activates several intracellular cascades (i.e., Ras/MAPK, PI3K/Akt, and PLC /PKC) which may regulate the transcription of different target genes. Through the activation of these cytoplasmic pathways, the FGF signal controls several major cellular functions such as cell proliferation, migration, differentiation, or survival. An intracellular mode of action has also been described in the case of FGF1 but it is poorly documented [3].

Concerning the evolutionary history of the FGF gene family, several studies using molecular phylogenetics as well as synteny conservation analyses have been performed [48]. The first phylogeny-based classifications of the gene family were proposed before the whole complement of FGF genes was described in mammals which led to incomplete conclusions [5, 8]. The first phylogenetic studies including all the mammalian FGFs proposed a division of the gene family into six [9] or seven [6] subfamilies. In 2005, Popovici and collaborators performed the first study including both protostome and deuterostome FGFs as well as FGFs from baculoviruses, an arthropod-specific group of viruses [4]. They proposed to divide the FGF gene family into eight subfamilies: subfamily A (including orthologs of FGF 1 and 2), subfamily B (orthologs of FGF 3, 7, 10, and 22), subfamily C (orthologs of FGF 4, 5, and 6), subfamily D (orthologs of FGF 8, 17, 18, and 24 from vertebrates but also of EGL-17, PYR, and THS from protostomes), subfamily E (orthologs of FGF 9, 16, and 20 but also of LET-756 from nematodes), subfamily F (orthologs of FGF 11, 12, 13, and 14), subfamily G (orthologs of FGF 15/19, 21, and 23), and subfamily H which is specific of arthropod FGFs (i.e., BNL) and of FGFs found in arthropod-specific viruses [4]. This classification is widely accepted today, however, the phylogenetic position of FGF3 and FGF5 is not completely solved, which calls into question the constitution of the two subfamilies B and C. Moreover, the description of FGF genes in the sea anemone Nematostella vectensis now raises the question of the timing of the appearance and diversification of the FGF gene family.

In this study we take advantage of the exponential increase of publicly available genomic sequences to present an update of the FGF gene content in different evolutionary lineages. Phylogenetic approaches, together with synteny conservation analyses of these data, allow us to propose a new classification of the FGF gene family which (i) confirms the paralogy relationships of the FGF4/5/6 subfamily members and (ii) suggest that orthologs of the mammalian FGF3 form a new subfamily.

2. The FGF Gene Content Varies among Different Metazoan Lineages

The recent development of high throughput sequencing techniques has generated a large number of sequences available in different public databases. Among them we have searched for FGF domain coding sequences within the major metazoan phyla, in order to clarify the evolutionary history of this family. We have limited our study to the analysis of amino acid sequences deposited in the Genbank, the Ensembl, and the JGI databases for cnidarians, lophotrochozoans, ecdysozoans, and deuterostomes, although many ESTs sequences putatively coding for FGF proteins might also be found.

2.1. FGF Genes in Diploblastic Metazoans

FGF genes were previously described in two anthozoan species: Nematostella vectensis and Acropora millepora [10, 11]. In Nematostella, 13 genes encoding FGF ligands were predicted from the genome sequence [11] but their phylogenetic relationships with bilaterian FGFs are not fully established. Four of these genes group with the FGF8/17/18/24 subfamily and six group with the FGF1/2 subfamily with low support. In the hydrozoan Hydra magnipapillata we have found 4 predicted genes coding for FGFs (see Table 1). Among them, one (called FGF24) belongs to the FGF8/17/18/24 subfamily. Another one groups with several Nematostella FGF genes whose position is not robustly supported but might belong to the FGF1/2 subfamily (see Figure S1 in supplementary material available online at For the other two, no clear relationship with either Nematostella or bilaterian FGFs can be proposed according to phylogenetic reconstructions. We also looked for ctenophore EST sequences putatively encoding FGF domains but we failed to find any in public databases.

SpeciesAccession numberDescriptionDatabaseBest blastP hit accessionBest blastP hit nameOrthology

Hydra magnipapillataXP_002165496.1Predicted: similar to fibroblast growth factor homologous factor 4GenbankNP_001180935.1Fibroblast growth factor 12 (Macaca mulatta)
XP_002164870.1Predicted: similar to fibroblast growth factor 24GenbankFGF8/17/18
XP_002166704.1Predicted: similar to fibroblast growth factor 1B, partialGenbankFGF1/2
XP_002170051.1Predicted: similar to Fibroblast growth factor 14 (Hydra magnipapillata)GenbankXP_001094679.1Predicted: fibroblast growth factor 20 (Macaca mulatta)

Lottia giganteafgenesh2_pg.C_sca_
JGIXP_002643284.1EGL-17 (Caenorhabditis briggsae)
JGIXP_003455818.1Predicted: fibroblast growth factor 20-like (Oreochromis niloticus)

Capitella teletafgeneshl_pg.C_scaffold_
JGIXP_002922927.1Predicted: fibroblast growth factor 18-like (Ailuropoda melanoleuca)

Trichinella spiralisXP_003370033Fibroblast growth factor 20GenbankNP_001098209.1Fibroblast growth factor 20a (Oryzias latipes)
EFV50493.1Fibroblast growth factor 18GenbankFGF8/17/18

Brugia malayiXP_001894505.1Fibroblast growth factor family proteinGenbankFGF9/16/20
XP_001899322.1Fibroblast growth factor family proteinGenbankFGF8/17/18

Apis melliferaXP_623927.2Predicted: hypothetical protein LOC551529GenbankFGF1/2
XP_001120331.2Predicted: hypothetical protein LOC724469GenbankBNL
XP_003695580.1Predicted: fibroblast growth factor 18-likeGenbankFGF8/17/18

Harpegnathos saltatorEFN80858.1Hypothetical protein EAI_11890GenbankXP_003399646.1Predicted: hypothetical protein LOCI00646960 (Bombus terrestris)
EFN81752.1Fibroblast growth factor 18GenbankFGF8/17/18
EFN88402.1Heparin-binding growth factor 1GenbankFGF1/2

Pediculus humanus subsp. corporisEEB17861.1Fibroblast growth factor, putativeGenbankXP_003243356.1Predicted: hypothetical protein LOC100572243 (Acyrthosiphon pisum)
EEB19433.1Heparin-binding growth factor 1 precursor, putativeGenbankFGF1/2
EEB18362.1Conserved hypothetical proteinGenbankXP_002431100.1Predicted: hypothetical protein LOC 100569010 (Acyrthosiphon pisum)

Ixodes scapu larisXP_002433492.1Hypothetical protein IscW_ISCW015993GenbankXP_003203489.1Predicted: glia-activating factor-like (Meleagris gallopavo)
XP_002400933.1Heparin-binding growth factor, putativeGenbankFGF1/2

Daphnia pulexEFX75093.1Hypothetical protein DAPPUDRAFT_
GenbankXP_003243356.1Predicted: hypothetical protein LOC
100572243 (Acyrthosiphon pisum)
EFX86332.1Hypothetical protein DAPPUDKAFT_
GenbankXP_001635198.1Predicted protein (Nematostella vectensis)

Saccoglossus kowalevskiiADB22412.1Fibroblast growth factor 8/17/18 proteinGenbankFGF8/17/18
ADB22409.1Hypothetical proteinGenbankXP_799351.2
ACY92516.1 Fgf-Sk1proteinGenbankNP_001233192.1
ACY92517.1FGF9-like proteinGenbankFGF9/16/20
ACY92515.1FGF13-like proteinGenbankFGF9/16/20
ADB22411.1Fibroblast growth factor 20-like proteinGenbankFGF9/16/20

Oikopleura dioicaCBY43668.1Unnamed protein productGenbankXP_003441021.1Predicted: fibroblast growth factor 14-like (Oreochromis niloticus)
CBY37156.1Unnamed protein productGenbankFGF11/12/13/14
CBY40156.1Unnamed protein productGenbankFGF11/12/13/14
CBY12333.1Unnamed protein productGenbankFGF9/16/20
CBY34733.1Unnamed protein productGenbankNP_001007762.1Keratinocyte growth factor precursor (Danio rerio)
CBY23701.1Unnamed protein productGenbankXP_002594626.1Hypothetical protein BRAFLDRAFT_149779 (Branchiostoma floridae)

2.2. FGF Genes in Protostomes

In protostomes, FGF genes have only been described in ecdysozoans, particularly in arthropods. Three genes have been characterized in the model organism Drosophila melanogaster [12, 13], called Branchless (Bnl), Thisbe (Ths), and Pyramus (Pyr). In the coleopteran Tribolium castaneum, four FGF genes called Tc-FGF1a, Tc-FGF1b, Tc-FGF8, and Tc-Bnl [14] have also been identified. Ths and Pyr from Drosophila, as well as Tc-FGF8 from Tribolium, were shown to belong to the FGF8/17/18/24 subfamily, whereas Tc-FGF1a, and TcFGF1b belong to the FGF1/2 subfamily. On the other hand, Branchless orthologs from both species show no clear evolutionary relationships with any of the vertebrates FGF gene subfamilies leading Popovici and collaborators to propose a new subfamily including Bnl from arthropods and baculovirus-specific FGF genes [4]. In the genome of the nematode Caenorhabditis elegans two FGF genes are found called let-756 (lethal protein 756) and egl-17 (egg laying defective 17) [4, 15], which are members of the FGF9/16/20 and FGF8/17/18/24 subfamilies, respectively [4].

In order to obtain a more complete picture of the diversity of the FGF gene family in ecdysozoans, we searched other available sequences (see Table 1). Thus, in different nematode species we only found orthologs of the two known C. elegans genes (Figure S2). In arthropods, we found FGF coding genes in the crustacean Daphnia pulex, in the chelicerate Ixodes scapularis, and in insects from different classes such as Apis mellifera, Harpegnathos saltator, or Pediculus humanus (see Table 1). The orthology relationships of the two FGF genes we found in Daphnia cannot be clearly determined, whereas for all the other arthropods the different genes we found always belong to the Bnl, FGF1/2, or FGF8/17/18/24 subfamilies (Figure S2).

No study of the FGF gene set in lophotrochozoans has been published yet so we searched for lophotrochozoan FGF coding sequences in Genbank and in the complete genome sequences of the mollusc Lottia gigantea and of the annelids Helobdella robusta and Capitella teleta. We found only one gene in Capitella whose position in the FGF phylogenetic tree is not robustly supported, but probably belongs to the FGF8/17/18/24 subfamily. In Lottia gigantea, two FGF genes are present in the complete genome, and again their evolutionary relationship with the different subfamilies cannot be clearly determined even if the best blast hit results for these genes are always orthologs of the FGF8/17/18/24 and FGF9/16/20 subfamilies (see Table 1). Taken together, these data demonstrate (i) that lophotrochozoans also possess some FGF coding genes, although quite divergent from the other protostome genes, and (ii) that members of only four subfamilies, FGF1/2, FGF8/17/18/24, FGF9/16/20, and Bnl, can be clearly found in protostomes.

2.3. FGF Genes in Deuterostomes

Deuterostomes comprise vertebrates, the related invertebrate chordates (urochordates and cephalochordates) and three other invertebrate taxa: hemichordates and echinoderms, which form the Ambulacraria group, and the recently described phylum of Xenoturbellida [16]. Nothing is known concerning the FGF gene content in Xenoturbella and we did not find any FGF coding sequence for this group. Conversely, recent studies have shown that one FGF gene exists in the sea urchin Strongylocentrotus purpuratus (i.e., echinoderm) [17], and we have identified in the databases six FGF genes in the hemichordate Saccoglossus kowalevskii of which one gene can be clearly assigned to the FGF8/17/18/24 subfamily. Three other genes are orthologs of the FGF9/16/20 subfamily, indicating that an hemichordate-specific duplication occurred for this gene; another one has been previously shown to be ortholog of the FGF19/21/23 [18]; the sixth gene shows no clear orthology relationships with any FGF gene subfamily (see Table 1) [18].

In chordates, the FGF gene content is also different among the three subphyla. In cephalochordates, eight FGF genes have been found and orthology relationships using phylogenetics or conservation of synteny approaches have been suggested for six of them (i.e., FGF1/2, FGF8/17/18, FGF9/16/20, FGFA ortholog of FGF3/7/10/22, FGFB ortholog of FGF4/5/6, and FGFC ortholog of FGF19/21/23) [19]. In the urochordate Ciona intestinalis, six genes encoding FGF ligands have been described [20], and we identified one more gene in databases, called FGF-NA1, bringing the total FGF gene content to seven. Of them, only two were shown to be clear orthologs of the FGF8/17/18/24 and FGF11/12/13/14 subfamilies [20]. In another urochordate, the larvacean Oikopleura dioica, we found six FGF coding genes, among which two can be assigned to the FGF11/12/13/14 subfamily, and one to the FGF9/16/20 subfamily (see Table 1 and Figure S4). In vertebrates, an explosion in the number of genes encoding FGFs occurred and we can find between 19 and 27 FGF genes depending on the species. This explosion is not specific to the FGF gene family and is linked to the two rounds of genome duplication (three rounds in teleosts) that occurred in this lineage as previously demonstrated [4, 21]. In sarcopterygians we identified 19 FGF genes in the chicken and 23 in the coelacanth, whereas 22 FGF genes (FGF 1–23) have been characterized in mouse and human (the mouse FGF15 is the ortholog of the human FGF19). These 22 mammalian genes were previously used to reconstruct the evolutionary history of the family [4, 6], which led to the classification of FGFs into seven paralogy groups. However, in teleosts, an additional round of genome duplication (3R hypothesis) occurred [22], which, together with a high number of FGF gene losses, produced 27 FGF genes in the zebrafish [23].

3. The FGF Gene Family Is Composed by Eight Subfamilies

Due to the low sequence conservation of most of the FGF genes found in early divergent metazoan lineages, and the short length of the FGF domain, we have based our phylogenetic study on vertebrate FGFs, as in previous studies [4, 6]. However, the new FGF sequence data, particularly within chordates, allow us to suggest a new classification of the FGF gene family in metazoans, which is divided into 8 subfamilies instead of 7 (in addition to the arthropod + baculoviruses—specific family proposed by Popovici et al. [4]). These families are the FGF1/2, FGF3, FGF4/5/6, FGF7/10/22, FGF8/17/18/24, FGF9/16/20, FGF11/12/13/14 and FGF19/21/23 (Figures 1 and S5).

In all the studies performed so far, the vertebrate FGF3 always grouped into either the subfamily FGF3/7/10/22 or the subfamily FGF3/4/6 [4, 6, 8]. In fact, the correct classification of FGF3 is still debated and assignment to one or another subfamily depends on the methods used. Therefore, most of the phylogenetic analyses published grouped FGF3 with FGF7, FGF10, and FGF22, but with very low node robustness. Other studies, using the genomic locations of this gene, grouped it with FGF4 and FGF6 and it has even been suggested that the FGF3/4/6 and FGF19/21/23 subfamilies can be assembled into a single subfamily FGF3/4/6/19/21/23 (with FGF5 grouping in this case with the FGF1/2 subfamily) [7]. Here, based particularly on results obtained through the study of gene content, phylogenetic distribution, and conservation of synteny between amphioxus and vertebrates [19], we propose a new evolutionary scenario in which FGF3 forms a new subfamily (Figures 1, 2, and S5). This scenario could reconcile the different evolutionary hypotheses suggested in previous studies.

In our hypothesis, an ancestral FGF gene (named FGF3/4/5/6) was duplicated in tandem before chordate diversification. Such duplication might have occurred before eumetazoan diversification or specifically in the chordate ancestor. Thus, the putative ancestor (either eumetazoan or chordate ancestor) had two FGF genes maintained in cluster: FGF3 and FGF4/5/6. This situation can still be observed in the cephalochordate Branchiostoma floridae in which FGFB and FGFE are clustered in a genomic region showing synteny conservation with the vertebrate locus containing the FGFs 3, 4 and 6 [19] (Figure 3). This hypothesis implies a loss of FGF3 in different lineages, the number of lineages that lost FGF3 depends on the timepoint at which this gene appeared (i.e., in urochordates in one hypothesis (Figures 2(b) and 5), or in urochordates, ambulacrarians, protostomes, and cnidarians in the other hypothesis, see Figure 5). According to this scenario the origin of FGF3 would be ancient (i.e., at least prior to chordates diversification) and not due to the vertebrate-specific genome duplications.

Another FGF gene whose phylogenetic position is debated is FGF5. Indeed, depending on the phylogenetic approach and on the gene set used for the phylogenetic reconstruction, it clusters either with FGF4/6 or with FGF1/2 [4, 23]. Moreover, conservation of synteny also suggests the paralogy of FGF1, 2, and 5 [7]. However, a deeper synteny analysis of the human FGF5 locus shows conservation of this locus with both the FGF1/2 and FGF4/6 loci (Figure 3). This mixed syntenic conservation, together with our phylogenetic analyses supporting the FGF4/5/6 subfamily (Figure 1), suggests that FGF5 is a real paralog of FGF4 and 6. The partial synteny conservation with the FGF1 and 2 loci might be explained by a genomic translocation of the FGF5 locus (including its neighbouring genes BMP3, PAQR3) close to the ANXA3 locus (Figures 2(a) and 3).

4. The Evolutionary History of the FGF Gene Family Is Characterized by Gene Duplications and Gene Losses

Phylogenetic reconstructions using FGF sequences from all metazoan phyla often fail to completely solve the orthology relationship between the different members of this family mainly because of the reduced size of the FGF domain and because of the high divergence of the sequences between the different lineages. However, using the phylogenetic distribution of FGF genes into eight subfamilies, we can propose evolutionary scenarios accounting for the FGF gene content found in the different metazoan lineages. Several hypotheses can be drawn explaining such a distribution of FGF orthologs. Here we focus mainly on two of these hypotheses: a first hypothesis where the eight FGF subfamilies are chordate-specific (Figures 4 and 5, hypothesis 1) and a second hypothesis where the eight subfamilies were ancestral to all eumetazoans (Figure 5, hypothesis 2). In both hypotheses, the evolutionary history of the FGF gene content in chordates is the same (Figure 4), but depending on the hypothesis, it changes for the other metazoan lineages (Figure 5).

As we have shown, in cnidarians (diploblastic metazoans) we found the presence of, at least, orthologs of the FGF8/17/18 and probably FGF1/2 subfamilies. Thus, we can suggest that the eumetazoan ancestor possessed at least one ortholog of these two subfamilies.

Our analyses suggest that the arthropod ancestor already possessed at least three FGF genes belonging to the FG1/2, FGF8/17/18 and Bnl subfamilies (Figure 5). Bnl is specific to arthropods and arthropod viruses and its origin is still unknown. Two possible evolutionary scenarios can be drawn for Bnl genes. In the first scenario, a Bnl ortholog might have existed ancestrally and then been lost in all metazoan lineages except arthropods. Then this gene was captured by baculoviruses after the arthropod radiation [4]. In a second scenario, an arthropod FGF gene was translocated into baculoviruses and, following a period of fast evolution leading to the loss of any phylogenetic signal, reintegrated into the arthropod genome. In the ancestor of nematodes, two FGF genes, orthologs of the FGF9/16/20 and FGF8/17/18/24 families were present. Taking these results into account, we can propose the existence of a minimal FGF gene set of three genes in the ancestor of ecdysozoans (orthologs of FGF1/2, FGF8/17/18/24 and FGF9/16/20). The few data obtained in lophotrochozoans do not allow us to clearly conclude on the FGF gene set of the protostome ancestor. However, we can suggest the presence of at least members of the FGF1/2, FGF8/17/18, and FGF9/16/20 subfamilies.

The two hypotheses proposed here for the evolutionary history of the FGF gene family (Figure 5) suggest that a single paralogous gene for each subfamily was kept in cephalochordates and that specific gene duplications or losses did not occur during evolution in this lineage (Figure 4). In fact, genetic conservation in amphioxus is not restricted to FGFs since different studies have shown that gene content in amphioxus tends to be associated with very few gene losses [2428]. Concerning other chordates, even if the phylogenetic distribution of the seven urochordate FGF genes is not strongly supported (see Figure S4), we can assume that C. intestinalis has orthologs of the FGF4/5/6, FGF7/10/22, FGF8/17/18, FGF9/16/20, FGF11/12/13/14, and FGF19/21/23 subfamilies but that it lost the orthologs of the FGF1/2 and FGF3 subfamilies (Figure 4). Moreover, the seventh gene (Ci-FGFL), as proposed by Popovici et al., could be a specific duplication of FGF7/10/22 [4]. In sarcopterygian vertebrates, the gene set of the different species suggests that numerous gene losses occurred following the two rounds of genome duplication (from eight ancestral genes, after two rounds of duplication, we should find 32 genes, but depending on the species we find between 19 and 23 genes—Figure 4). Moreover, some lineage-specific gene losses also occurred in sarcopterygians; for example, the loss of FGF24 in tetrapods and losses of FGF11, 17, and 21 in chicken. In teleosts, gene losses were even more important, since instead of 46 genes (i.e., a duplication of the 23 FGF genes present in the osteichthyan ancestor [22]) we only find 27 in zebrafish [23]. Indeed, duplicated copies generated by this third genome duplication were only retained for FGF10, FGF6, FGF17, FGF18, and FGF20 (Figure 4).

In non-chordate deuterostomes, the only FGF gene found in the sea urchin cannot be assigned to any FGF subfamily using phylogenetic reconstructions, whereas five of the six genes found in S. kowalevskii belong to the FGF8/17/18/24, FGF9/16/20, and FGF19/21/23 subfamilies (Figure S3) [18]. The remaining gene does not show clear phylogenetic relationships with the different FGF subfamilies. Therefore, whatever the evolutionary hypothesis (i.e., chordate-specific duplications versus early duplication giving rise to eight subfamilies in the ancestral eumetazoan), we can propose that there were at least three FGF genes in the ambulacrarian ancestor (i.e., orthologs of FGF8/17/18/24, FGF9/16/20, and FGF19/21/23) (Figure 5). This result suggests that the deuterostome ancestor had probably at least these three genes plus FGF1/2 which is present in chordates and in protostomes but seems to be lost in the Ambulacraria. At this stage of the analysis it is difficult to say if specific chordate duplications led to the eight chordate FGFs (hypothesis 1, Figure 5), or if there was already eight genes in the deuterostome ancestor, several of them having being lost in Ambulacraria (hypothesis 2, Figure 5).

Here, for simplicity, we showed two extreme scenarios, one starting from the minimum gene set in the eumetazoan ancestor (only two genes) and the second starting from the maximum (eight genes). However, many other intermediate scenarios can be imagined. These two major evolutionary scenarios (Figure 5) imply different duplication/loss evolutionary histories. The first hypothesis implies two main points: (i) the ancestral eumetazoan had an FGF gene set of at least two genes (orthologs of FGF1/2 and FGF8/17/18/24) and (ii) important chordate-specific duplications occurred generating the present diversity of the FGF gene family observed in this lineage, which is divided into eight subfamilies (hypothesis 1, Figure 5). The second scenario implies a high degree of gene losses during metazoan evolution. Thus, from eight ancestral FGF gene families already present in the eumetazoan ancestor, six gene losses occurred in cnidarians, five in protostomes and five in ambulacrarians (hypothesis 2, Figure 5). Moreover, both hypotheses require lineage-specific duplications. The second hypothesis is less parsimonious than the first, but no matter which is correct, what seems clear is that the evolutionary history of the FGF gene family required numerous events of gene duplication and gene loss at different times and in different evolutionary lineages. The next question we should address in the near future is which are the implications of this complicated evolutionary history of the FGF gene family on the functional evolution of this signal and in the morphological evolution of metazoans.

5. Materials and Methods

5.1. Identification of FGF Sequences

FGF sequences were identified using BLASTP search in the NCBI and JGI [25] databases using all known FGF domain amino acid sequences. We also browsed the Pfam database [29] for entries possessing an FGF domain. Sequence accession numbers of FGF sequences identified in this study are shown in Table 1.

5.2. Phylogenetic Analyses of Vertebrate FGFs

FGF amino acid sequences were aligned using clustalX [30] and regions of ambiguous homology were removed. Neighbour-Joining tree was generated using MEGA version 5 [31] with a Poisson model and a discrete gamma-distribution model with four rate categories. Maximum Likelihood (ML) tree was built using PHYML3.0 [32] with a JTT model as proposed by ProtTest2.4 [33]. The node robustness of both trees was estimated by a bootstrap test (100 replicates).

5.3. Phylogenetic Analyses of Nonvertebrate FGFs

The FGF domain coding region of retrieved sequences was aligned with known FGF sequences from metazoans using T-Coffee [34]. The resulting alignment was manually corrected in SeaView [32]. Maximum Likelihood (ML) trees were generated using PHYML3.0 [32] with a LG+G model as proposed by ProtTest2.4 [33]. The robustness of the tree nodes was estimated using aLRT.


The laboratory of H.Escriva is supported by the Agence Nationale de la Recherche Grants ANR-2010-BLAN-1716 01 and ANR-2010-BLAN-1234 02. The authors thank also Peter Mills who kindly revised the English of an earlier version of the paper.

Supplementary Materials

Figure 1: Phylogenetic relationships of vertebrate and cnidarian FGF genes. FGF1/2 and FGF8/17/18/24 families are yellow boxed. The aLRT support for the nodes of these families is encircled in red.

Figure 2: Phylogenetic relationships of vertebrate and protostome FGF genes. FGF1/2, FGF8/17/18/24 and FGF9/16/20 families are yellow boxed. The aLRT support for the nodes of these families is encircled in red.

Figure 3: Phylogenetic relationships of vertebrate and hemichordate FGF genes. FGF8/17/18/24 and FGF9/16/20 families are yellow boxed. The aLRT support for the nodes of these families is encircled in red.

Figure 4: Phylogenetic relationships of vertebrate and Oikopleura FGF genes. FGF11/12/13/14 and FGF9/16/20 families are yellow boxed. The aLRT support for the nodes of these families is encircled in red.

Figure 5: Phylogenetic relationships of vertebrate FGFs. Maximum likelihood tree showing the classification into eight subfamilies of the different vertebrate FGF genes (i.e. FGF1/2, FGF3, FGF4/5/6, FGF7/10/22, FGF8/17/18/24, FGF9/16/20, FGF11/12/13/14 and FGF19/21/23). Sequences of Homo sapiens, Mus musculus, Bos taurus, Gallus gallus, Xenopus tropicalis, and Danio rerio were used to perform the phylogeny. Branches of the eight subfamilies are highly supported (at least 68 %) but internal branches within the different subfamilies do not always follow the evolution of species.

  1. Supplementary Figure 1
  2. Supplementary Figure 2
  3. Supplementary Figure 3
  4. Supplementary Figure 4
  5. Supplementary Figure 5


  1. O. A. Trowell and E. N. Willmer, “Studies on the Growth of Tissues in vitro,” The Journal of Experimental Biology, vol. 16, pp. 60–70, 1939. View at: Google Scholar
  2. D. Gospodarowicz, K. L. Jones, and G. Sato, “Purification of a growth factor for ovarian cells from bovine pituitary glands,” Proceedings of the National Academy of Sciences of the United States of America, vol. 71, no. 6, pp. 2295–2299, 1974. View at: Google Scholar
  3. E. Kolpakova, A. Wiedlocha, H. Stenmark, O. Klingenberg, P. O. Falnes, and S. Olsnes, “Cloning of an intracellular protein that binds selectively to mitogenic acidic fibroblast growth factor,” Biochemical Journal, vol. 336, part 1, pp. 213–222, 1998. View at: Google Scholar
  4. C. Popovici, R. Roubin, F. Coulier, and D. Birnbaum, “An evolutionary history of the FGF superfamily,” BioEssays, vol. 27, no. 8, pp. 849–857, 2005. View at: Publisher Site | Google Scholar
  5. D. M. Ornitz and N. Itoh, “Fibroblast growth factors,” Genome Biology, vol. 2, no. 3, article 3005, 2001. View at: Google Scholar
  6. N. Itoh and D. M. Ornitz, “Evolution of the Fgf and Fgfr gene families,” Trends in Genetics, vol. 20, no. 11, pp. 563–569, 2004. View at: Publisher Site | Google Scholar
  7. N. Itoh, “The Fgf families in humans, mice, and zebrafish: their evolutional processes and roles in development, metabolism, and disease,” Biological and Pharmaceutical Bulletin, vol. 30, no. 10, pp. 1819–1825, 2007. View at: Publisher Site | Google Scholar
  8. F. Coulier, P. Pontarotti, R. Roubin, H. Hartung, M. Goldfarb, and D. Birnbaum, “Of worms and men: an evolutionary perspective on the fibroblast growth factor (FGF) and FGF receptor families,” Journal of Molecular Evolution, vol. 44, no. 1, pp. 43–56, 1997. View at: Publisher Site | Google Scholar
  9. H. S. Kim, “The human FGF gene family: chromosome location and phylogenetic analysis,” Cytogenetics and Cell Genetics, vol. 93, no. 1-2, pp. 131–132, 2001. View at: Google Scholar
  10. U. Technau, S. Rudd, P. Maxwell et al., “Maintenance of ancestral complexity and non-metazoan genes in two basal cnidarians,” Trends in Genetics, vol. 21, no. 12, pp. 633–639, 2005. View at: Publisher Site | Google Scholar
  11. D. Q. Matus, G. H. Thomsen, and M. Q. Martindale, “FGF signaling in gastrulation and neural development in Nematostella vectensis, an anthozoan cnidarian,” Development Genes and Evolution, vol. 217, no. 2, pp. 137–148, 2007. View at: Publisher Site | Google Scholar
  12. D. Sutherland, C. Samakovlis, and M. A. Krasnow, “branchless encodes a Drosophila FGF homolog that controls tracheal cell migration and the pattern of branching,” Cell, vol. 87, no. 6, pp. 1091–1101, 1996. View at: Publisher Site | Google Scholar
  13. A. Stathopoulos, B. Tam, M. Ronshaugen, M. Frasch, and M. Levine, “Pyramus and thisbe: FGF genes that pattern the mesoderm of Drosophila embryos,” Genes and Development, vol. 18, no. 6, pp. 687–699, 2004. View at: Publisher Site | Google Scholar
  14. A. Beermann and R. Schröder, “Sites of Fgf signalling and perception during embryogenesis of the beetle Tribolium castaneum,” Development Genes and Evolution, vol. 218, no. 3-4, pp. 153–167, 2008. View at: Publisher Site | Google Scholar
  15. R. D. Burdine, E. B. Chen, S. F. Kwok, and M. J. Stern, “egl-17 encodes an invertebrate fibroblast growth factor family member required specifically for sex myoblast migration in Caenorhabditis elegans,” Proceedings of the National Academy of Sciences of the United States of America, vol. 94, no. 6, pp. 2433–2437, 1997. View at: Publisher Site | Google Scholar
  16. S. J. Bourlat, T. Juliusdottir, C. J. Lowe et al., “Deuterostome phylogeny reveals monophyletic chordates and the new phylum Xenoturbellida,” Nature, vol. 444, no. 7115, pp. 85–88, 2006. View at: Publisher Site | Google Scholar
  17. F. Lapraz, E. Röttinger, V. Duboc et al., “RTK and TGF-β signaling pathways genes in the sea urchin genome,” Developmental Biology, vol. 300, no. 1, pp. 132–152, 2006. View at: Publisher Site | Google Scholar
  18. A. M. Pani, E. E. Mullarkey, J. Aronowicz, S. Assimacopoulos, E. A. Grove, and C. J. Lowe, “Ancient deuterostome origins of vertebrate brain signalling centres,” Nature, vol. 483, no. 7389, pp. 289–294, 2012. View at: Publisher Site | Google Scholar
  19. S. Bertrand, A. Camasses, I. Somorjai et al., “Amphioxus FGF signaling predicts the acquisition of vertebrate morphological traits,” Proceedings of the National Academy of Sciences of the United States of America, vol. 108, no. 22, pp. 9160–9165, 2011. View at: Publisher Site | Google Scholar
  20. Y. Satou, K. S. Imai, and N. Satoh, “Fgf genes in the basal chordate Ciona intestinalis,” Development Genes and Evolution, vol. 212, no. 9, pp. 432–438, 2002. View at: Publisher Site | Google Scholar
  21. P. Dehal and J. L. Boore, “Two rounds of whole genome duplication in the ancestral vertebrate,” PLoS Biology, vol. 3, no. 10, p. e314, 2005. View at: Google Scholar
  22. O. Jatllon, J. M. Aury, F. Brunet et al., “Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype,” Nature, vol. 431, no. 7011, pp. 946–957, 2004. View at: Publisher Site | Google Scholar
  23. N. Itoh and M. Konishi, “The zebrafish FgF family,” Zebrafish, vol. 4, no. 3, pp. 179–186, 2007. View at: Publisher Site | Google Scholar
  24. N. Takatori, T. Butts, S. Candiani et al., “Comprehensive survey and classification of homeobox genes in the genome of amphioxus, Branchiostoma floridae,” Development Genes and Evolution, vol. 218, no. 11-12, pp. 579–590, 2008. View at: Publisher Site | Google Scholar
  25. N. H. Putnam, T. Butts, D. E. K. Ferrier et al., “The amphioxus genome and the evolution of the chordate karyotype,” Nature, vol. 453, no. 7198, pp. 1064–1071, 2008. View at: Publisher Site | Google Scholar
  26. S. Huang, S. Yuan, L. Guo et al., “Genomic analysis of the immune gene repertoire of amphioxus reveals extraordinary innate complexity and diversity,” Genome Research, vol. 18, no. 7, pp. 1112–1126, 2008. View at: Publisher Site | Google Scholar
  27. L. Z. Holland, R. Albalat, K. Azumi et al., “The amphioxus genome illuminates vertebrate origins and cephalochordate biology,” Genome Research, vol. 18, no. 7, pp. 1100–1111, 2008. View at: Publisher Site | Google Scholar
  28. S. D'Aniello, M. Irimia, I. Maeso et al., “Gene expansion and retention leads to a diverse tyrosine kinase superfamily in amphioxus,” Molecular Biology and Evolution, vol. 25, no. 9, pp. 1841–1854, 2008. View at: Publisher Site | Google Scholar
  29. M. Punta, P. C. Coggill, R. Y. Eberhardt et al., “The Pfam protein families database,” Nucleic Acids Research, vol. 40, pp. D290–D301, 2012. View at: Google Scholar
  30. J. D. Thompson, T. J. Gibson, F. Plewniak, F. Jeanmougin, and D. G. Higgins, “The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools,” Nucleic Acids Research, vol. 25, no. 24, pp. 4876–4882, 1997. View at: Publisher Site | Google Scholar
  31. K. Tamura, D. Peterson, N. Peterson, G. Stecher, M. Nei, and S. Kumar, “MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods,” Molecular Biology and Evolution, vol. 28, no. 10, pp. 2731–2739, 2011. View at: Publisher Site | Google Scholar
  32. M. Gouy, S. Guindon, and O. Gascuel, “Sea view version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building,” Molecular Biology and Evolution, vol. 27, no. 2, pp. 221–224, 2010. View at: Publisher Site | Google Scholar
  33. F. Abascal, R. Zardoya, and D. Posada, “ProtTest: selection of best-fit models of protein evolution,” Bioinformatics, vol. 21, no. 9, pp. 2104–2105, 2005. View at: Publisher Site | Google Scholar
  34. C. Notredame, D. G. Higgins, and J. Heringa, “T-coffee: a novel method for fast and accurate multiple sequence alignment,” Journal of Molecular Biology, vol. 302, no. 1, pp. 205–217, 2000. View at: Publisher Site | Google Scholar

Copyright © 2012 Silvan Oulion et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

More related articles

3209 Views | 880 Downloads | 4 Citations
 PDF  Download Citation  Citation
 Download other formatsMore
 Order printed copiesOrder

Related articles

We are committed to sharing findings related to COVID-19 as quickly and safely as possible. Any author submitting a COVID-19 paper should notify us at to ensure their research is fast-tracked and made available on a preprint server as soon as possible. We will be providing unlimited waivers of publication charges for accepted articles related to COVID-19. Sign up here as a reviewer to help fast-track new submissions.