International Journal of Genomics

International Journal of Genomics / 2014 / Article

Research Article | Open Access

Volume 2014 |Article ID 267482 |

Sergio Stefanni, Raul Bettencourt, Miguel Pinheiro, Gianluca De Moro, Lucia Bongiorni, Alberto Pallavicini, "Transcriptome of the Deep-Sea Black Scabbardfish, Aphanopus carbo (Perciformes: Trichiuridae): Tissue-Specific Expression Patterns and Candidate Genes Associated to Depth Adaptation", International Journal of Genomics, vol. 2014, Article ID 267482, 21 pages, 2014.

Transcriptome of the Deep-Sea Black Scabbardfish, Aphanopus carbo (Perciformes: Trichiuridae): Tissue-Specific Expression Patterns and Candidate Genes Associated to Depth Adaptation

Academic Editor: Elena Pasyukova
Received25 May 2014
Accepted19 Jul 2014
Published17 Sep 2014


Deep-sea fishes provide a unique opportunity to study the physiology and evolutionary adaptation to extreme environments. We carried out a high throughput sequencing analysis on a 454 GS-FLX titanium plate using unnormalized cDNA libraries from six tissues of A. carbo. Assemblage and annotations were performed by Newbler and InterPro/Pfam analyses, respectively. The assembly of 544,491 high quality reads provided 8,319 contigs, 55.6% of which retrieved blast hits against the NCBI nonredundant database or were annotated with ESTscan. Comparison of functional genes at both the protein sequences and protein stability levels, associated with adaptations to depth, revealed similarities between A. carbo and other bathypelagic fishes. A selection of putative genes was standardized to evaluate the correlation between number of contigs and their normalized expression, as determined by qPCR amplification. The screening of the libraries contributed to the identification of new EST simple-sequence repeats (SSRs) and to the design of primer pairs suitable for population genetic studies as well as for tagging and mapping of genes. The characterization of the deep-sea fish A. carbo first transcriptome is expected to provide abundant resources for genetic, evolutionary, and ecological studies of this species and the basis for further investigation of depth-related adaptation processes in fishes.

1. Introduction

The deep-sea (>1000 m depth) covers about 70% of the Earth’s surface, representing one of the last large unexplored areas on the planet. Only within the last few decades the technology has advanced sufficiently to reach the deep-sea effectively, revealing unexpected high levels of biodiversity and extremely diverse habitats (canyons, cold seeps, hydrothermal vents, deep-water coral reefs, mud volcanoes, seamounts, and trenches) of significant conservation interest and potential high economic values. Deep-sea environments are characterized by extremely high hydrostatic pressures (1 MPa every 100 m), lack of light, and low temperatures (down to 1-2°C). Therefore, fish as well as any other organism living in the deep-sea had to adapt to tolerate conditions of this extreme habitat [1].

First studies on adaptation to high pressure and low temperatures are dated back in the ‘70s and they report comparison of common proteins present in shallow and deep-water fishes [2, 3]. Key enzymes in muscle tissues that exhibit adaptive differences among species at different depths are the lactate dehydrogenase (LDH) and malate dehydrogenase (MDH) presenting differences in structural stability (reviews in [46]). More recent studies on evolutionary adaptation of functional genes to high pressure report unique amino acid substitutions in -skeletal actin and myosin heavy chain (MyHC) proteins in deep-sea fishes [710]. For deep-sea species inhabiting hydrothermal vents and cold seeps, environments characterized by high pressure, chronic hypoxia, and high concentrations of toxic compounds, molecular and functional adaptation of hemoglobins (Hbs) are reviewed in Hourdez and Weber [11]. Despite these studies, our knowledge on wide scale gene expression patterns in deep-sea fish remains elusive.

The black scabbardfish, Aphanopus carbo (Lowe 1839), is a bathypelagic species belonging to the Trichiuridae family and is distributed in temperate-cold Atlantic waters at depths between 200 and 1800 m [12, 13]. A. carbo represents a commercially valuable species for several regions of the Iberic peninsula, especially in Madeira where catches have reached up to 1000 tons per year [14] amounting to ca. 55% of the total landings. Recently this species has become increasingly targeted by Portuguese, French, and Irish fishing fleets ([15] and literature therein) and fishery data have shown a constant decline in population [16]. The information available on the biology, maturity, spawning, and growth of this species [17, 18] is scattered. Recent studies are reporting a panmictic distribution of this species in the NE Atlantic with multiple breeding sites at low latitudes [19]. It is also worth mentioning that, in southern locations, this species lives in sympatry with A. intermedius, a close related species with very similar morphology [20], therefore attracting interest for evolutionary studies.

High-throughput sequencing approaches applied to transcriptomics now provide a global perspective on taxonomic and functional profiling of genes expectedly expressed under the influence of environmental conditions in which these organisms live. Also known as next-generation sequencing, these techniques allow for a massive characterization of expressed sequence tags (ESTs) providing an overview of those genes expressed in a given tissue at any given time [21]. In silico analyses of massive gene libraries may serve several interests among others. For instances, from discovery and identification of new genes, characterization of gene expression, to development of novel genetic markers for quantitative trait locus (QTL), and population of genomic analyses. The breadth of next-generation sequencing applications extends over a variety of biological questions including those addressing pertinent questions regarding a species’ ecology, life history, and evolution [22, 23].

Previous studies regarding transcriptome sequencing and gene expression studies in deep-water species were mostly limited to hydrothermal vents invertebrates [24], microbial communities in hydrothermal plumes [25], deep-sediments [26], and in the water column [27] leaving vertebrates species virtually under-represented. The present work represents a pioneer study for deep-sea fishes providing new insights into the role of differential gene expression on the environmental adaptation of deep-sea black scabbardfish.

Here we describe the assembly and annotation of the transcriptome of A. carbo obtained by sequencing mRNA libraries of six tissues (spleen, brain, heart, gonads, liver, and muscle) and explore functional genes whose sequence might be associated to depth adaptation. Additionally, we tested the correlation of selected candidate genes comparing the number of contigs against the gene expression normalized to a relative value of 1.0, as determined by qPCR amplification. Furthermore, the screening of the libraries allowed the identification of new EST-simple-sequence repeats (SSRs) and the design of primer pairs suitable for population genetic studies as well as tagging and mapping of genes.

2. Methods

2.1. Fish and Tissue Samples

Specimens of Aphanopus carbo (two males and two females) were collected in 2009 onboard of the RV “Arquipelago. A. carbo were fished at depth range 1100–1250 m using deep-water long-lines in proximity of the Condor Seamount, located approximately at 15 nm SW of the island of Fayal (Azores, Portugal). The four specimens used in this study were caught on the same longline set and once onboard, the freshly caught animals were dissected and portions (or complete organs) of spleen, brain, heart, gonads, liver, and muscle tissues were preserved in formamide solution and kept at −20°C until RNA extraction was performed.

To validate the correct identification of the species, a small portion of muscle tissue was also preserved in 95% ethanol for molecular screening following Stefanni et al. [28] protocols.

2.2. RNA Extraction and Sequencing

Total RNA was extracted from 20 to 40 mg of each of the six preserved tissues of a pool of four A. carbo individualsusing the RiboPure kit (Ambion, Applied Biosystems). Quantity and purity of the RNA was determined on a 1.4% agarose-MOPS-formaldehyde denaturing gels and by assessing the and ratios using the NanoVue spectrophotometer (GE Healthcare). Poly-A RNA was extracted from 15 μg of each total RNA sample using the Poly(A)Purist mRNA Purification kit according to manufacturer’s instructions (Ambion, Applied Biosystems). mRNAs were transcribed into cDNA utilizing Mint-2 cDNA synthesis kit (Evrogen, according to manufacturer’s instructions for NGS platforms. Six cDNA libraries were constructed from mRNA of individual pools of tissues and sequenced in a single 454 GS FLX Titanium run. Each of the cDNA libraries was characterised by unique sequence tags (MIDs) that allowed to trace back the sequences generated from single tissues after assembly.

cDNAs were sheared by nebulization to yield random fragments approximately 500–800 bp in length, by applying 30 psi (2,1 bar) of nitrogen for 1 minute on 4 μg of each library. The distribution of fragments was verified on a BioAnalyser DNA 7500 LabChip (Agilent Technologies). The fragmented cDNA sample was end-repaired with T4 DNA polymerase and T4 polynucleotide kinase and adaptor sequences ligated according to the manufacturer’s instructions [29]. The fragments were immobilized onto streptavidin beads and nick-repaired with Bst polymerase. The cDNA fragments were denaturated with alkali to yield single stranded cDNA (sscDNA) library. Quality of the library was assayed on a BioAnalyser RNA 6000 Pico LabChip (Agilent Technologies) and quantity measured by spectrofluorimetry with the Quant-iT RiboGreen RNA Assay kit (Invitrogen). A titration was set up at 1, 2, 4, and 8 copies per bead (cpb) in the clonal amplification by emulsion PCR to optimize yield and sequence quality. The percent enrichment of beads carrying the sscDNA was determined and the amount of library input calculated to 18%. A large scale emulsion PCR was set-up based on the previous value and sequenced at Biocant (Cantanhede, Portugal) using the 454 GS FLX Titanium pyrosequencing on a full 70X75 PicoTiterPlate, according to manufacturer’s instructions (Roche).

2.3. Bioinformatic Analyses

High quality reads were assembled using Newbler ver. 2.6 (Roche 454) sequence analysis software. All reads were identified and grouped by their unique MIDs to the tissue of origin. Trimming and masking the polyAs was a common procedure for the assembling tool.

The assemblage is characterized by read overlaps and multiple alignments made in nucleotide space. Consensus base-calling and quality value determination for contigs are performed in flow space. The use of flow space in determining the properties of the consensus sequence results in an improved accuracy for the final base-calls. The implementation of this software was performed using default parameters. Assembled contigs were annotated through sequence similarity searches against the National Centre for Biotechnology Information (NCBI) nonredundant (nr) protein database using the BLASTx [30] with a cut-off criterion of an expect-value (-value) < . The contigs that did not find a hit were further processed with ESTScan ( The two assemblages of amino acid sequences, resulting from the BLASTx searches at high level of stringency and the ESTScan, were processed by InterProScan for functional annotation of transcripts applying the function for the mapping of gene ontology (GO) terms. The GO method classifies genes within a hierarchy using a systematic nomenclature of attributes that can be assigned to all gene products independently from the organism of origin. To reduce the redundancy in the consensus sequences which correspond to the same gene we used BLASTClust to detect similar assemblies with 95% identity and 90% coverage. All the results from both assemblage methods were loaded into a SQL database developed for this purpose.

To validate the accuracy of the assembly, the resulting contigs were compared to previously sequenced transcriptomes of 6 teleosts including Danio rerio, Gasterosteus aculeatus, Oreochromis niloticus, Oryzias latipes, Takifugu rubripes, and Tetraodon nigroviridis, using tBLASTn [30] to find protein homologs at two levels of stringency (-value < and -value < ).

To identify protein conserved domain specific for each tissue analysed a new annotation was performed with Hmmer against the Pfam database (ver. 25.0). Protein domain representativeness for each tissue was obtained comparing protein domain abundance in a particular tissue versus all the tissues compiled together using a hypergeometric test.

2.4. cDNA Synthesis and qPCR Validation Tests

Fresh cDNA was synthesised from the six mRNAs that were used for pyrosequencing, cDNA synthesis was performed using primers with oligo(dT) and the ThermoScript RT-PCR System (Invitrogen) following the manufacturer’s instructions.

A set of 28 genes were selected including candidates that were tissue-specific and genes that were encountered in the tissue expressed at similar as well as at different amounts in all the six libraries, with the aim of covering most of the possible expression scenarios within the dataset. Frequencies of contigs for all candidates genes in the mRNA libraries were obtained by detecting orthologous gene sequences using the BLAST tool included in the A. carbo database.

For the design of all qPCR primers (Table 1) we used the web interface NCBI Primer-BLAST ( Alignments of the sequences provided by the output from the internal blast search were used to select all primer sets.

GenePrimer namePrimer Sequences (5′-3′)Size (bp)NCBI Accession #

Elongation factor 1-betaEF-1B L GCTTGGACATGTCGGTCTCGTC229 bpAll_gs454_000396
Ras-related GTP-binding protein ARab-1A L AGTAGCCGTTCCACCTTGTCGG247 bpAll_gs454_000598
Basic Transcription Factor 3-like 4BTF3 L CCCAAAGTTCAGGCCTCCCTGT273 bpAll_gs454_000873
Cu/Zn Superoxide DismutaseSOD-1 L AAACGTGACTGCAGGAGGGGAT240 bpAll_gs454_000925
2-Cys PeroxiredoxinPRDX1 L CCGATAACCTCGCAGCCGATAC243 bpAll_gs454_000558
Heat Shock Protein 90HSP90 L TGACGATGTCCCCACAGATGAGG221 bpAll_gs454_000008
Ferritin, heavy subunitFerr L CCTGCAGCTTGAGAAGAGCGTC203 bpAll_gs454_000681
GlobinHb-A L AAATTGTTGGCCATGCGGAGGA208 bpAll_gs454_001919
GlobinHb-B L TCGTCTACCCCTGGTGTCAGAG245 bpAll_gs454_001018
Ependymin-1 precursorEPD-1 L CAGGTGTGAGGCAGTGCAGT230 bpAll_gs454_000469
Fatty acid-binding protein, brainBLBP L CAACACTTCTTGGCCGGTTTGG239 bpAll_gs454_001220
CD63-like protein Sm-TSP-2TSPAN-8 L TCGCTGGCTGCTCTGAGAAAGA200 bpAll_gs454_000381
Tropomyosin 4 isoform 1TRPM-1 L GTGGAGGAGGAGTTGGACCGAG221 bpAll_gs454_000222
C-Myc-binding proteinMYCBP L CGCCAGTTTACCTGCGTTCCAA182 bpAll_gs454_001640
Cathepsin SCTSS L AACAGCCTACCCCTACACAGCC200 bpAll_gs454_000156
TransferrinSTF-1 L AGCTGCACCAGCTTCACAGTTG215 bpAll_gs454_000004
Warm Temperature Acclimation related-like 65 kDa proteinHPX L TGATACCGGGTGGAACCTGGTG207 bpAll_gs454_000060
Betaine Homocysteine S-methyltransferaseBHTM L GGGGGTTCGCTGTTACCAAGTG194 bpAll_gs454_000088
FUCL1 Fucolectin-1FUCL1 L CGCAAACCCTTTGGCTGGTGTA196 bpAll_gs454_000758
Aldolase BALDB L GCCATTGGTCTTGGCCCTGATC220 bpAll_gs454_000115
Type-4 ice-structuring protein LS-12 precursorISP LS12 L AAGACCTGACAAACCAGGCCCA198 bpAll_gs454_001277
Alcohol Dehydrogenase 8aADH L GGCAAGAAGGTGCTGCAGTTCA228 bpAll_gs454_000105-6
Glyceraldehyde-3-phosphate DehydrogenaseGAPDH L GTCAACCACTGACACGTTGGGG229 bpAll_gs454_000148
Lactate Dehydrogenase-ALDH-A L TCTTAACCTGGTGCAGCGCAAC219 bpAll_gs454_000149
Phosphoglycerate Mutase 2-1 (muscle)PglyM L ACACCTCTGTGCTGAAACGTGC212 bpAll_gs454_000309
Heat Shock Protein 70HSP70 L CGGTGTTGTGTGCTGGGTGAAA207 bpAll_gs454_000005
Fructose-bisphosphate Aldolase AFBPA L GGAACCAACGGCGAGACAACAA208 bpAll_gs454_002732
Phosphoglucose Isomerase-2PGI L CCACACTGGGCCAATTGTCTGG217 bpAll_gs454_000011

Gene expression, calculated as relative expression, was determined by means of real-time PCR using the CFX96 (Bio-Rad). Primer concentrations and sample dilutions were optimized to meet highest efficiency in the PCR reaction in a total reaction volume of 20 μL. Fluorescent signal was determined by the addition of SsoFast EvaGreen Supermix (Bio-Rad) which was included in the cocktail accordingly to manufacturer’s instructions. Baseline and threshold cycle were always set to automatic in the sequence detection software, CFX Manager (Bio-Rad). All plates contained a “no template control” (NTC) and each sample was tested in duplicate. Cycling conditions for gene amplifications were 95°C for 3 min followed by 35 cycles of 95°C for 10 s, 56°C for 10 s, and 68°C for 15 s. An additional protocol for melting curves analysis included a cycle at 95°C followed by a progressive reading of fluorescence for every cycle from 65°C to 95°C for 5 s at intervals of 0.5°C. Gene expression, normalized to a relative value of 1.0 for all the genes selected and for each tissue, was compared to the contigs frequencies generated by the assembly platform to determine the significance of correlation between qPCRs values and 454 sequencing reads from unnormalized cDNA libraries.

2.5. Characterization of Depth-Related Functional Genes

The predicted amino acid sequences of functional genes of A. carbo possibly related to depth adaptations were compared to those deposited in NCBI database searching for homologies. We aligned and compared translated sequences of lactate dehydrogenase (LDH-A and LDH-B), cytosolic malate dehydrogenase (MDHc), hemoglobins (Hb-A and Hb-B), actin (ACTA1), and myosin heavy chain (MyHC). Protein and nucleotide sequences were aligned using Clustal X [31] while sequence analysis and phylogenetic inferences were performed using CLC Main Workbench (v. 6.8.2, CLC Bio). The neighbor-joining (NJ) algorithm [32] was implemented to construct a phylogenetic tree using HKY substitution model and attributing a gap penalty of 10. The support for internal branches was assessed using the bootstrap [33] with 1000 replicates.

Nucleotide alignments and ML trees built, implementing the most appropriate substitution model under the Akaike Information Criterion (AIC), were used in the program “codeml” in PAML 4 [34] to assess selective pressure on those genes for which the complete sequences was available. Positive (or negative) selected sites were defined by the ratio between nonsynonymous versus synonymous substitutions ( or ). Two models were tested comparatively: M1 which groups codons in two classes ( and ), clustering sites under negative or neutral selection; and M2 which groups codons in three classes (, and ), adding a cluster for sites under positive selection to the ones defined in M1. Probabilistic measures of how well these models fits the evolutionary relationship of individual genes were calculated from the likelihood values of fitted models and the number of “free parameters” for all genes.

Protein stability was estimated using a virtual quantification software [35] calculating Gibbs free energy in terms of kinetic and thermodynamic quantities taking into account each amino acid contribution for the maintenance of the native structure of the protein. For a protein to maintain its stability there is a need of sufficient hydrophobic residues which will utilize free energy to guide proper folding [35].

2.6. EST-SSR Resources for Population Genetics

Among various molecular markers, simple sequence repeats (SSRs) are highly polymorphic, easier to develop, and very useful for researches such as genetic diversity assessment. Therefore A. carbo database was further used to detect such regions in the transcriptome sequencing data and provide a list of combination of primer sets on flanking regions that could be used for population genetic studies. To identify EST-SSRs, all the contigs were searched using MISA [36] and for primer design we used Primer3 [37]. The algorithm of the SSR finder identifies a good quality repeat when one locus is present with adjacent loci at an up or downstream distance higher than 100 bp and parameters were set to locate a minimum of 20 bp sequence repeats: di-mers (x12), 3-mers (x8), 4-mers (x5), 5-mers (x5), and 6-mers (x5). Primer design was performed setting parameters of a minimum size of 20 bp and melting temperatures of 60°C.

3. Results and Discussion

3.1. Sequences Assemblage and Functional Annotations

After sequence trimming, a total of 544,491 high quality reads were produced with an average length of 237 bp corresponding to 129.5 Mb. A total of 8,319 contigs were assembled with Newbler (ver 2.6) as high quality consensus sequences without the presence of singletons. A summary of EST data for each of the six tissues is reported in Table 2. A total of 2,440 assembled contigs were annotated against the NCBI nonredundant protein database at the cut-off (-value) < 10−6. Additional 3,843 contigs with no protein matches were further processed with ESTScan to find 2,715 homologous proteins. All the contigs were then processed by InterProScan for functional annotation of transcripts and mapping functional information to gene ontology (GO) terms. Of the 5,155 amino-acid sequences (out of the 6,283 totally sequenced), 1,728 could be annotated within the GO hierarchy (Figure 1), while 2,395 could be annotated accordingly to InterProScan. The complete GO mapping of A. carbo for individual tissues can be accessed on the dedicated online database ( The largest proportions of GO functional categories are of similar proportion to the unnormalized libraries of Tilapia [38]. In the Cellular Components GO group (Figure 1(a)), the genes involved in cell and cell part functional categories corresponded to the largest percentage of the pie chart (28.15% each). In the Molecular Function GO group (Figure 1(b)), almost half of the total genes were involved in binding (47.40%), followed by those related to catalytic activity (27.53%). Finally, for the Biological Processes GO group (Figure 1(c)), the largest portion of the genes were involved in metabolic processes (33.45%), tightly followed by the category of genes linked to the Cellular Process (31.93%).


Total EST15,03433,33773,263157,275134,52392,788544,491
Total bases3,426,5108,219,50017,647,60037,123,70031,792,60023,342,900129,412,000
Average contig length470619567465612689555
Contigs -value < 10−62204206229516174092,440
No similarity3670109406211741,128
GO annotation2023384736235093071,728
InterPro annotation2234176109086493952,395

To further validate the accuracy of the A. carbo assembly, we compared our dataset to the transcriptomes of six other prior sequenced teleosts including Danio rerio, Gasterosteus aculeatus, Oreochromis niloticus, Oryzias latipes, Takifugu rubripes, and Tetraodon nigroviridis. Similarity searches were performed comparing our assembled contigs against each of the available transcriptomes using tBLASTn [30]. The results highlighted strong similarity between the transcriptomes of A. carbo and other teleosts indicating that about 30% of the total contigs matched protein homologs (-values ranging between 2,268 and 2,457) and that our assembly presents the highest similarity (by little margin) with the transcriptome of Gasterosteus aculeatus (-value < ) (Table 3).

SpeciesSequences availableA. carbo
-value < 10−3
A. carbo
-value < 10−10

Danio rerio 42,7872,4572,199
Gasterosteus aculeatus 27,5762,4452,223
Oreochromis niloticus 26,7632,4362,202
Oryzias latipes 24,6742,4122,173
Takifugu rubripes 47,8412,2682,062
Tetraodon nigroviridis 23,1182,3002,073

A total of 1639 (20%) transcripts were annotated by Pfam protein domains matches and the set of protein domains characterizing each single tissue was identified by hypergeometric test (Table 4).


Pfam IDDomainFreqPfam IDDomainFreqPfam IDDomainFreq

PF00100Zona pellucida173.15 10−13PF00011HSP2034.03 10−4PF00084Sushi141.44 10−10
PF00125Histone86.06 10−5PF13405EF hand 456.77 10−4PF00089Trypsin207.59 10−9
PF01400Astacin33.22 10−3PF00412LIM31.52 10−3PF00079Serpin125.63 10−6
PF00069Pkinase30.01PF05556Calsarcin33.60 10−3PF07678A2M comp41.35 10−4
PF00653BIR20.02PF01576Myosin tail 133.60 10−3PF01042Ribonuc L-PSP31.26 10−3
PF13424TPR 1220.02PF00056Ldh 1 N33.60 10−3PF00045Hemopexin31.26 10−3
PF13695zf-3CxxC20.02PF05300DUF73725.50 10−3PF00059Lectin C61.69 10−3
PF09360zf-CDGSH20.02PF00992Troponin46.40 10−3PF00701DHDPS44.64 10−3
PF01712dNK20.02PF00595PDZ36.82 10−3PF00386C1q86.63 10−3
PF00538Linker histone20.02PF00022Actin30.01PF00021UPAR LY650.01
PF10178DUF237220.02PF02874ATP-synt ab N20.02PF00754F5 F8 type C90.01
PF04856Securin20.02PF13895Ig 230.02PF08702Fib alpha20.01
PF00250Fork head20.02PF00191Annexin20.03PF03982DAGAT20.01
PF01498HTH Tnp Tc3 230.03PF00212ANP20.03PF01048PNP UDP 120.01
PF00268Ribonuc red sm20.06PF05347Complex1 LYR20.07PF01014Uricase20.01


Pfam IDDomainFreqPfam IDDomainFreqPfam IDDomainFreq

PF01410Troponin71.99 10−5PF01669Myelin MBP44.03 10−5PF00042Globin51.33 10−6
PF02807COLFI39.67 10−4PF01453B lectin26.44 10−3PF00078RVT 151.33 10−6
PF00041ATP-gua PtransN33.59 10−3PF00612IQ26.44 10−3PF00993MHC II alpha45.02 10−6
PF02453fn333.59 10−3PF11414Suppressor APC26.44 10−3PF07686V set52.49 10−6
PF13895Reticulon33.59 10−3PF05768DUF83626.44 10−3PF07654C1 set54.71 10−4
PF00365Ig_247.97 10−3PF05196PTN MK N26.44 10−3PF00089Trypsin52.01 10−3
PF01216PFK29.84 10−3PF04300FBA26.44 10−3PF01391Collagen22.30 10−3
PF01267Calsequestrin29.84 10−3PF00287Na K-ATPase26.44 10−3PF00240ubiquitin33.28 10−3
PF00261F-actin cap A29.84 10−3PF11032ApoM38.53 10−3PF09307MHC2-interact26.67 10−3
PF00856Tropomyosin29.84 10−3PF00061Lipocalin40.01PF00643Zf-B box20.01
PF01661SET29.84 10−3PF00007Cys knot20.02PF13445Zf-RING LisH20.01
PF01667Macro20.03PF01275Myelin PLP20.02PF01498HTH Tnp Tc3 220.02
PF00036Ribosomal S27e20.05PF00300His Phos 120.02PF02301HORMA10.05
PF01576efhand20.05PF00230MIP20.02PF14259RRM 610.05

3.2. qPCR Assays and Validation Tests

Quantitative PCR assays were performed using cDNA samples at 1 : 400 dilution (up to 100 ng approximately in the qPCR master mix), conditions at which the PCR resulted to be more efficient. We tested seven series of dilutions ranging from 1 : 50 to 1 : 1000. Based on optimized qPCR conditions; all targeted cDNAs were tested across all tissues for the complete set of genes selection used in this study. Normalized expression to a relative value of 1.0 for 28 genes is shown in Figure 2 (see: Figure S1 in Supplementary Material available online at

Statistical tests were performed to identify the correlation between the mean value of normalized expression and a relative value of 1.0, as determined by qPCR against contig frequencies using Pearson’s coefficient and its significance (Figure 3). Most of the genes showed highly significant correlation; however one case indicated a reduced significance (elongation factor) and a few other cases resulted to be not significant (e.g., Ras-related GTP-binding protein, Basic Transcription Factor 3, Cu/Zn Superoxide Dismutase and 2-Cys Peroxiredoxin). Reduced or lack of significance appeared to be related to a low number of contigs from the 454 sequencing (Table 5), which might be regarded as an indication of failure to reach library saturation. Low values in contig frequency should correspond to a reduced level of expression as the libraries are nonnormalized; therefore caution should be used when exploring 454 outputs below a certain threshold. Our data derived from the reading of a single plate were insufficient for systematical comparison of all the genes, although the majority of the genes show a high correlation between the 454 data and the qPCR. However, it should be emphasized that this exercise was not meant to be a surrogate to the qPCR approach for the study of gene expression but more a proxy for a preliminary screening of differentially expressed genes in multiple libraries.

GeneContig codeSpleenBrainGonadsHeartLiverMuscle

ISP LS12isotig0300400002111

3.3. Candidate Genes Associated to Depth

The predicted amino acid sequences of A. carbo functional genes putatively related to depth adaptations were compared to those deposited in NCBI database for homologies searches. Complete alignments of orthologous protein sequences from the set of functional genes that have been reported as responsive to depth adaptations included representatives of Teleosts belonging to several families (Table 6). Exploring the levels of similarities expressed as percentage of amino acid identity or number of amino acid that differ among sequences between A. carbo and other fish species (Table 7) brings evidence supporting the fact that deep-living as well as polar species are not the most similar to the black scabbard fish. We attempted to reconstruct the phylogenies for those species using amino acid sequences of functional genes (LDH-A, LDH-B, MDHc-A, Hb-A and Hb-B) and corresponding mtCOI nucleotide sequences (Figure 4). The resulting trees showed similar topologies, suggesting that the signals embedded in these functional genes reflect evolutionary divergence among taxa rather than any enzyme adaptations relationships.

OrderFamilySpeciesCommon nameEnvironmentClimateDepth rangeGeneNCBI Acc. Nr

PerciformesTrichiuridaeAphanopus carbo Black scabbardfishMDeep-water200–1700COIEU854076
BeloniformesAdrianichthydaeOryzias latipes Japanese rice fishFW + BRSubtropicalshallowACTA1, MDHc, MyHC, COINM_001104806, NM_001163134, XM_004071618, AB498066
ScorpaeniformesHexagrammidaePleurogrammus azonus Okhotsk atka mackerelMTemperate0–240ACTA1AB073381
PerciformesScombridaeScomber scombrus Atlantic mackerlMTemperate0–200 (0–1000)ACTA1, COIEF607093, KC015895
PerciformesPercihcthyidaeSiniperca chuatsi Mandarin fishFWTemperate10ACTA1, MyHC, COIAY395872, AY454304, NC_015822
PerciformesSparidaeSparus aurata Gilthead seabreamMTemperate1–30 (1–150)ACTA1AF190473
PerciformesSphyraenidaeSphyraena idiastes Pelican barracudaMTropical3–24ACTA1, LDH-A, MDHc, mMDHAF503593, SIU80001, AF390559, AF390561
TetraodontiformesTetraodontidaeTakifugu rubripes Japanese pufferfishM + FW + BRTemperate0–200 (0–1000)Hb-A, mMDH, COIXM_003964767, XM_003965959, HM102315
ScorpaeniformesAnoplopomatidaeAnoplopoma fimbria SablefishMDeep-water0–2740Hb-B, COIBT082849, JQ353978
PerciformesSerranidaeEpinephelus coioides Orange-spotted grouperM + BRSubtropical1–100Hb-BGU982530
GasterosteiformesGasterosteidaeGasterosteus aculeatus Three-spined sticklebackM + FW + BRTemperate0–100Hb-BNM_001267638
PerciformesNototheniidaeNotothenia coriiceps Black rockcodMPolar0–550LDH-A, MyHC, COIAF079822, AJ243767, EU326390
PerciformesNototheniidaeNotothenia angustata Maori chiefMTemperate0–100Hb-A, Hb-BP62363, P29628
GadiformesMacrouridaeCoryphaenoides armatus Abyssal grenadierMDeep-water282–5180LDH-B, MyHC, COIAJ609232, AB330140, FJ164497
GadiformesGadidaeGadus morhua Atlantic codM + BRTemperate0–600LDH-B, COIAJ609233, KC015385
GadiformesGadidaeArctogadus glacialis Arctic codMDeep-water0–1000Hb-A, COIQ1AGS4, KC015200
PerciformesLatidaeLates calcarifer BarramundiM + FW + BRTropical10–40LDH-B, COIFJ439507, JQ431879
GadiformesGadidaeMerlangius merlangus WhitingMTemperate30–100 (10–200)LDH-B, COIAJ609234, JQ623954
CyprinodontiformesPoeciliidaePoecilia reticulata GuppyFW + BRTropicalShallowLDH-B, COIEF408825, JX968696
GadiformesMacrouridaeTrachyrincus murrayi Roughnose grenadierMDeep-water0–1630LDH-B, COIAJ609235, AP008990
PerciformesChannichthyidaeChionodraco rastrospinosus Ocellated icefishMPolar0–1000LDH-A, COIAF079829, EU326337
PerciformesPomacentridaeChromis caudalis Blue-axil chromisMTropical15–55LDH-AAY289558
PerciformesGobiidaeRhinogobiops nicholsii Blackeye gobyMSubtropical?-106LDH-AAF079534
CypriniformesCyprinidaeCyprinus carpio Common carpFW + BRSubtropicalshallowLDH-A, MyHC, COIAF076528, D89992, HQ960709
CyprinodontiformesFundulidaeFundulus heteroclitus MummichogM + FW + BRTemperateshallowLDH-A, LDH-B, COIL43525, L23792, EU524629
OsmeriformesOsmeridaeOsmerus mordax Rainbow smeltM + FW + BRTemperate0–425MDHc, mMDHBT075651, BT075600
SalmoniformesSalmonidaeSalmo salar Atlantic salmonM + FW + BRTemperate10–23 (0–210)MDHc, mMDHBT060183, BT048216
GadiformesMacrouridaeCoryphaenoides acrolepis Pacific grenadierMDeep-water900–1300MyHC, COIAB330141, JQ354060
GadiformesMacrouridaeCoryphaenoides yaquinae n.a.MDeep-water3400–5800MyHC, COIAB330139, GU440291
PerciformesCirrhitidaeParacirrhites forsteri Blackside hawkfishMTropical5–35MyHC, COIAJ243770, HQ561521
PerciformesCarangidaeSeriola dumerili Greater amberjackMSubtropical18–72 (1–360)MyHC, COIAB032020, KC015917
GadiformesGadidaeBoreogadus saida Polar codM + BRPolar0–400Hb-A, Hb-B, COIDQ125471, Q1AGS6, KC015250

(a) LDH-A (332 AA)

All_gs454_00149 vs. Diff.Id. %GapsAcc. nr.

Sphyraena idiastes 1695.180U80001
Rhinogobiops nicholsii 1794.880AF079534
Chromis caudalis 2193.670AY289558
Chionodraco rastrospinosus 2692.171AF079829
Notothenia coriiceps 2791.871AF079822
Fundulus heteroclitus 2791.870L43525
Cyprinus carpio 4486.750AF076528

(b) MDHc-A (333 AA)

All_gs454_00146 vs. Diff.Id. %GapsAcc. nr.

Oryzias latipes 1595.500NM_1163134
Sphyraena idiastes 2093.990AF390559
Osmerus mordax 3090.990BT075651
Salmo salar 3589.490BT060183

(c) Actin-1 (375 AA)

All_gs454_00104 vs. Diff.Id. %GapsAcc. nr.

Scomber scombrus 1 01000EF607093
Iniperca chuatsi 1 01000AY395872
Oryzias latipes 1 199.730NM_1104806
Pleurogrammus azonus 1 199.730AB073381
Coryphaenoides acrolepis 1 199.730AB021649
Coryphaenoides cinereus 1 199.730AB021651
Coryphaenoides armatus 2a 299.470AB086240
Coryphaenoides yaquinae 2a 299.470AB086242
Coryphaenoides acrolepis 2a 299.470AB021650
Coryphaenoides cinereus 2a 299.470AB021652
Cyprinus carpio 1 299.470AY395870
Sphyraena idiastes 1 399.200AF503593
Coryphaenoides armatus 2b 598.670AB086241
Coryphaenoides yaquinae 2b 598.670AB086243
Aphanopus carbo 2a 698.400All_gs454_00102
Sparus aurata 1 997.600AF190473

(d) LDH-B (334 AA)

All_gs454_00143 vs. Diff.Id. %GapsAcc. nr.

Lates calcarifer 1495.810FJ439507
Fundulus heteroclitus 2193.710L23792
Trachyrincus murrayi 4785.930AJ609235
Coryphaenoides armatus 5085.030AJ609232
Merlangius merlangus 5284.431AJ609234
Gadus morhua 5583.531AJ609233

(e) MyHC1 (305 AA)

All_gs454_00001 vs. Diff.Id. %GapsAcc. nr.

Paracirrhites forsteri 5394.950AJ243770
Seriola dumerilii 5494.860AB032020
Siniperca chuatsi 5994.380AY454304
Oryzias latipes 6294.102XM_4071618
Cyprinus carpio 6993.432D89992
Coryphaenoides acrolepis 8691.821AB330141
Notothenia coriiceps 8691.821AJ243767
Coryphaenoides yaquinae 8991.531AB330139
Coryphaenoides armatus 9291.251AB330140

(f) Hb-A1 (143 AA)

All_gs454_01074 vs. Diff.Id. %GapsAcc. nr.

Notothenia angustata 3972.731P623632
Boreogadus saida 4369.930DQ125471
Arctogadus glacialis 4469.930DQ125475
Takifugu rubripes 4667.830XM_3964767
Gadus morhua 5362.940O424252

(g) Hb-B1 (146 AA)

All_gs454_01018 vs. Diff.Id. %GapsAcc. nr.

Epinephelus coioides 2781.630GU982530
Anoplopoma fimbria 3178.910BT082849
Gasterosteus aculeatus 3178.910NM_1267638
Boreogadus saida 4072.790Q1AGS62
Notothenia angustata 4271.430P296282

The lactate dehydrogenase A (LDH-A: isotig01401) was encountered abundantly in the muscle tissue (Table 5, Figure 2) and its comparison with other orthologous sequences differs for a number of amino acids between 16 and 44. One indel at position 75 had to be added to the two polar species to obtain a complete alignment (Table 7). The percentage of identity was higher with the strictly marine species ranging between 95.2 and 93.7%. Unfortunately, no sequences from neither deep-sea nor abyssal fishes were available for comparison. On the other hand, lactate dehydrogenase B (LDH-B: isotig01423) in A. carbo was highly represented in the heart while very scarcely recorded in the other tissues (Table S1). The largest sequence divergence in terms of amino acid substitutions ranges between 47 and 55 when compared with deep-water macrourids, and similarly to LDH-A, an indel had to be added in the sequences of the gadiform species at position 76 to obtain a complete alignment (Table 7). Previous studies on benthopelagic fish report a significant decline of LDH activities with increasing water depth [39]. However, it is known that variation in enzyme activities of fish at a given depth is influenced by feeding behaviour and locomotory modes [40, 41].

Cytosolic malate dehygrogenase A (MDHc-A: isotig01010) was detected in most of the tissues, with the exception of the muscle (Table S1). An isoform of MDHc was also detected but its sequence covered only the last 110 amino acids (isotig01731). However the type of substitutions and tissue expression pattern (Table S1) according to Merrit and Quattro [42], lead us to believe that this cytosolic isoform is MDHc-B. It has been reported that the two teleost MDHc isozymes are the products of a gene duplication event after the separation of teleosts and tetrapods (see [42] and references therein) although the exact timing of this duplication has not been inferred.

Further comparison analyses were carried out only for MDHc-A, with orthologous sequences obtained from NCBI database where only shallow-water fish sequences were available. The complete alignment did not include any indel, and sequences differed from A. carbo between 15 and 35 amino acid substitutions, representing a percentage of identity ranging from 95.5 to 89.5%.

Two -skeletal actins were detected from the A. carbo database: the isoform 1 (isotig01459), expressed in shallow-water species and probably not functioning in abyssal fishes, and the isoform 2a (isotig01561). The mutations specific for A. carbo in the actin 2a were the substitutions Asp3Glu, Ala155Ser (a mutation also common with the isoform 2b [7]), Ser234Val, Val165Ile, Leu261Val, and finally Ala278Thr. This type of isoform was reported in other deep-sea species as Coryphaenoides acrolepis, C. cinereus, C. yaquinae, and C. armarus [7]. The isoform 2b, so far reported only in abyssal species, was not detected in A. carbo.

Actins 1 and 2a were significantly expressed in muscle tissue with evidence of its presence also in the heart (Table S1). Direct comparison of A. carbo actin 1 with orthologous sequences reported for other marine and deep-water species differed by just 1 amino acid at position 3 (Table 7). The number of substitutions increases to 2 when this sequence is compared to the isoform 2a of Coryphaenoides species (substitution at position 155) and to 5 amino acids replacement (Table 7) when compared to the isoform 2b, two of which are unique (positions 116 and 137) [7]. Actin has a function in polymerization of G-actin to F-actin in neutral salts. While the volume is increased following polymerization, the reaction is strongly affected by high pressure [43]. It is reported that the substitutions of Val54Ala or Leu67Pro reduced the volume change associated with actin polymerization [7].

The assemblage of the myosin heavy chain protein (MyHC: isotig02568 and isotig1394) obtained in A. carbo was incomplete (All_gs454_000756 and All_gs454_000001) containing a gap of 711 amino acids at the positions from 171 to 882 of the 1933 AA of the complete sequence. Unfortunately, within this gap there are the two loop regions with characteristic structures that are uniquely reported for deep-sea fish: loop-2 region is shorter and the loop-1 region has a proresidue [9]. MyHC was almost uniquely found in muscle tissue (Table S1) and for comparison analyses with orthologous genes from other fish we only used the last 305 AA (All_gs454_000184) of the complete sequence. The lowest number of amino acid substitutions ranged between 53 and 69 (corresponding to sequence identity between 94.95 to 93.43%) when the sequence from A. carbo is compared to its relative shallow water marine and freshwater species, while this value increases up to 92 amino acid substitutions (and corresponding sequence identity of 91.25%) when compared to its relatives from deep water or polar regions (Table 7).

Variation in globin sequences was analysed exploring the - and -chains (Hb-A: isotig00247 and Hb-B: isotig00163), whose relative expressions were detected more abundantly in the spleen, followed by liver, heart, brain, and muscle and virtually absent in the gonads (Figure 2, Table 5). Comparison analyses with orthologous sequences of deep-sea gadiforms and a notothenoid indicate that the number of amino acid substitutions ranged between 39 and 53 (Hb-A) and between 31 and 42 (Hb-B). The complete alignments included the additional single indel for the -chain of Notothenia angustata at position 102. Notothenioids acquired a completely different globin genotype with respect to other teleostean groups. The Antarctic ichthyofauna (dominated by a single taxonomically uniform group) lost its globin multiplicity in correlation with temperature stability. On the other hand, for the Arctic ichthyofauna it may have been advantageous to maintain a multiple globin system, helping to deal with environmental changes and metabolic demands [44].

Selective pressure in the site-by-site patterns among species was evaluated for LDH-A, -B, MDHc, and ACTA1 (isoform 1). There was no evidence of positive selection at the nucleotide site level in any of those genes whose global ω value was very low in all cases (from 0 in ACTA1 and 0.032 in LDH-B) (Table S2). The proportion of sites supporting positive selection, the model M2, was null (LDH-A and -B) or extremely low (0.3% in MDHc and 1.6% in ACTA1) (Table S2). These results indicate that the evolution of these four genes in teleosts is constrained by very stringent selective pressure (model probabilities for M1 versus M2 ranging from 87% in MDHc and 88% in all others).

In terms of protein stability calculated as Gibbs free energy and taking into consideration kinetic and thermodynamic quantities, we explored only the functional genes for which the complete sequences were obtained (Figure 5). In this context, protein stability is defined by the ability of a protein to retain its structural conformation or its activity when subjected to physical or chemical manipulations; therefore the energy consumed by activation to promoted folding has to be compensated by thermal stability provided by the energy of denaturation [35]. Hence, protein stability is quantitatively calculated by the standard Gibbs energy change (), allowing comparison of stabilities for different proteins [45].

The normalised difference in protein stability () of lactate dehydrogenase A (LDH-A) was lower in A. carbo compared to fish from polar waters and tropical marine (Figure 5). This gradual variation was primarily dictated by the lower contributions of leucine and lysine in the hydrophobic trend of kinetic calculation in the A. carbo sequence (Table S3). The substantial drop in of LDH-B in A. carbo compared to the orthologous sequences of polar, abyssal, and shallower marine fishes (Figure 5) was due to a larger contribution of glutamic acid (Table S3) promoting the thermal denaturation of the protein. Other studies investigating kinetic, physical properties and ability to withstand high pressure of LDH-B of two gadiformes support our calculations showing that the enzyme from the deep-sea species has a significant increased tolerance to pressure and higher thermostability [6]. To provide protection to A. carbo LDH-B from pressure and temperature denaturation, osmolytes might play an essential role. Experiments adding Trimethylamine-N-oxide (TMAO) to samples resulted in substantial increment of activity of LDH-B under conditions at which the enzyme was previously sensitive [6].

In the cytosolic malate dehygrogenase A (MDHc-A) there was a sharp decrease in protein stability of marine fishes going from shallower to deeper waters (no sequences of abyssal species were available). Lowest values are encountered in the sequences of the two euryhaline species included in the comparison (Figure 5). This progressive decrement in stability was associated to the larger contribution of aspartic acid (Table S3), also a promoter of thermal denaturation of the protein. The responses to pressure and temperature of soluble enzymes like LDH and MDH differ adaptively among species found at different depths.

The isoforms 1 and 2a of -skeletal actin of A. carbo showed very similar numerical values of compared to their homologues of deep-water (actin 1) and deep and abyssal water (actin 2a). Protein stability drops substantially when these two isoforms were compared with actin 2b, only present in abyssal species (Figure 5). Such reduction in stability was due to a higher number of hydrophobic amino acids contributing to the thermodynamic calculation (Table S3). Morita [7] reported that the substitutions of Gln137Lys and Ala155Ser generate a mechanism for stabilizing enzyme-substrate interactions under high pressure.

Comparing the stability of globin, -chain (Hb-A), in shallow, deep-water, and polar fish, the larger was found in polar and deep-water cods, followed by A. carbo before dropping to negative values for the shallower representatives of Antarctic and temperate environments (Figure 5). Stability of protein in A. carbo and its deep-water relatives was linked to larger contributions by leucine in the kinetic calculations promoting folding and thermal stability (Table S3). The stability as of globin sequences of the -chains (Hb-B) showed a very similar trend as in Hb-A (although there were no negative values), and the two deep-living species had very similar protein stability value. Variation in among environments was linked to the balanced contribution of hydrophobic amino acids promoting folding versus those lowering thermal stability (Table S3).

Although for key enzymes it was found that adaptive differences among species at different depths showed structural changes as well as structural stability (reviews in: [46]), several studies remarked on the importance of regulatory regions of the genome acting on gene regulation (see [46]). Promoter regions often work together with other cys-regulatory elements (e.g., transcription factors binding sites, enhancers, silencers, and insulators) to regulate the transcription and expression of mRNA in a specific tissue at different times and places throughout the genome. In addition, in regards to adaptation to depth, it is remarked the importance of osmolyte concentrations of methylamines (TMAO is the most relevant). These are protein stabilizers that counteract inhibition of proteins by hydrostatic pressure [4749].

Finally, we should also take into account the stress imposed on this fishes being captured at a depth range of 1100–1250 m and brought to surface in at least 3–5 hours. By the time specimens of black scabbardfish reach the surface they are already dead or dying; therefore the expression of some of the transcripts might be different if compared to a transcriptome of a fish sampled at a thousand meters depth. Notwithstanding, several experiments using model species have been targeting specific genes linked to stress factors [50] and this might prove useful in interpreting transcriptomes of animals undergoing stressful conditions before being sampled.

On the bases of this preliminary but encouraging results, we are planning to explore further deep water adaptivity employing different NGS sequence technology and experimental design, for instance using individual RADseq approach on a larger number of specimens caught at deep or shallow waters or by comparing sister species that live at different depth levels.

3.4. EST-SSR Resources for Population Genetics

In this study, a total of 153 EST-SSRs were detected in 142 contigs (3.87%) with a frequency of one EST-SSR per 27,69 kb sequence (Table 8). A further selection was made taking into account the type of repeats, size of the fragment, and quality of the primer sets, reducing the EST-SSR to 98. Some of those microsatellite markers were found in one tissue but not in others (16.3%), while others resolved to be found in more than one tissue (83.7%). Among the identified EST-SSRs, tri-nucleotide repeats represented the largest portion (40.8%), followed by di-nucleotide (28.6%), and tetra-nucleotide (25.7%), and only 2 EST-SSRs penta-nucleotide were identified (Table S4). Primer pairs have sizes ranging between 19 and 25 bp and melting temperatures from 57.1°C to 61.2°C, while expected PCR products are 100 to 280 bp in length. None of these EST-derived primer pairs have been tested for neutrality; therefore not all may be suitable for basic population genetic studies.

Searching itemNumbers

Total number of sequences examined7,920
Total size of examined sequences (bp)4,235,839
Total number of identified SSRs153
Number of SSR containing sequences142
Number of sequences containing more than 1 SSR9
Number of SSRs present in compound formation8

4. Conclusions

The transcriptome analysis of A. carbo, revealed a comprehensive set of genes expressed in six tissues, producing over 8,000 genes, 55.6% of which annotated by similarity to known proteins or nucleotide sequences from freely accessible database. The transcriptome of black scabbardfish sets the stage for expanding the genetic resources available while providing sets of genes that are likely tied with physiological adaptations to depth, particularly to low temperature and high pressure factors. This study represents the first transcriptome analysis for a deep-sea fish providing insights based on comparison analyses of homologous depth-related functional genes from shallow, deep-water, and abyssal fish highlighting similarities for A. carbo isozyme patterns and stability to other bathypelagic fishes. Direct sequence comparison suggested that the signals embedded in these functional genes reflect evolutionary divergence among taxa rather than any kind of enzyme adaptations. Organisms are adapted to deep-sea environment and their physiological tolerance may vary among taxa as well as their enzymatic activities under extreme conditions. Osmolyte concentrations as protein stabilizers that counteract inhibition of proteins by hydrostatic pressure [6, 4749] play a key role in deep-sea adaptation. Furthermore, contribution to adaptation is also provided by promoter regions that together with cys- and trans-regulatory elements work concertedly, at the mRNA transcription level, to drive the expression of a gene in a specific tissue or at a specific time [46].

The strong correlation detected between the values of standardized contigs frequency with the expression level of the gene by qPCR supports the use of our data to explore gene expression patterns in A. carbo.

Finally, considering the importance of this species for fishery management, a further exploration of this database has provided the characterization of new EST-SSR markers as additional resource for basic population genetic studies as well as tagging and mapping of genes.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Authors’ Contributions

Sergio Stefanni conceived the study and design, collected samples, participated in the data analysis, and drafted the paper. Raul Bettencourt contributed to gene validation experiments and discussion. Miguel Pinheiro and Gianluca De Moro developed the pipe-line analysis for all functional annotations and the SQL database. Lucia Bongiorni and Alberto Pallavicini participated in the structure, discussion, and paper drafting.


All molecular work was supported by the research Projects DEECON (European Science Foundation, under the EUROCORES programme, proposal no. 06-EuroDEEP-FP-008) and ReDEco (MarinERA programme funded by the EU FP6 ERA-NET Scheme, MARIN-ERA/MAR/0003/2008). Specimens of A. carbo were provided by the CONDOR Project (EEA Grants Financial Mechanism, Iceland, Liechtenstein and Norway, proposal no. PT0040). The authors are thankful to captain, crew, and technicians onboard of the R/V Arquipelago for their excellent contribution while at sea. Sergio Stefanni is a research fellow supported by the Marie Curie grant cofunded by the EU under the FP7-People-2012-COFUND; Cofunding of Regional, National and International Programmes, GA no. 600407 and the Bandiera Project RITMARE. IMAR/DOP is funded through the pluri-annual and programmatic funding scheme as research unit number 531 and associate laboratory number 9. The authors are also grateful to Conceição Egas for the support provided at Biocant. Last, but not least, the authors thank three anonymous reviewers for their comments and suggestions that greatly improved the paper.

Supplementary Materials

As supplementary material we included: amplification plots of all genes selected for this study in different tissue from Aphanopus carbo (Figure S1); a summary table of contig frequency in all tissues for those genes associated to depth not listed in Table 3 (Table S1); details on outputs related to the selection models for four teleost genes associated to depth (Table S2); values of Gibbs free energy calculations in terms of kinetic and thermodynamic quantities for all depth-related genes (Table S3); and the list of EST-SSR markers (Table S4).

  1. Supplementary Material


  1. G. N. Somero, “Adaptations to high hydrostatic pressure,” Annual Review of Physiology, vol. 54, pp. 557–577, 1992. View at: Publisher Site | Google Scholar
  2. J. J. Childress and M. H. Nygaard, “The chemical composition of midwater fishes as a function of depth of occurence off southern California,” Deep-Sea Research and Oceanographic Abstracts, vol. 20, no. 12, pp. 1093–1109, 1973. View at: Publisher Site | Google Scholar
  3. J. J. Childress and G. N. Somero, “Depth-related enzymic activities in muscle, brain and heart of deep-living pelagic marine teleosts,” Marine Biology, vol. 52, no. 3, pp. 273–283, 1979. View at: Publisher Site | Google Scholar
  4. G. N. Somero, “Biochemical ecology of deep-sea animals,” Experientia, vol. 48, no. 6, pp. 537–543, 1992. View at: Publisher Site | Google Scholar
  5. P. Sebert, “Fish at high pressure: a hundred year history,” Comparative Biochemistry and Physiology A, vol. 131, no. 3, pp. 575–585, 2002. View at: Publisher Site | Google Scholar
  6. A. A. Brindley, R. W. Pickersgill, J. C. Partridge, D. J. Dunstan, D. M. Hunt, and M. J. Warren, “Enzyme sequence and its relationship to hyperbaric stability of artificial and natural fish lactate dehydrogenases,” PLoS ONE, vol. 3, no. 4, Article ID e2042, 2008. View at: Publisher Site | Google Scholar
  7. T. Morita, “Structure-based analysis of high pressure adaptation of α-actin,” Journal of Biological Chemistry, vol. 278, no. 30, pp. 28060–28066, 2003. View at: Publisher Site | Google Scholar
  8. T. Morita, “Studies on molecular mechanisms underlying high pressure adaptation of α-actin from deep-sea fish,” Bulletin of Fisheries Research & Development Agency, vol. 13, pp. 35–77, 2004. View at: Google Scholar
  9. T. Morita, “Comparative sequence analysis of myosin heavy chain proteins from congeneric shallow- and deep-living rattail fish (genus Coryphaenoides),” The Journal of Experimental Biology, vol. 211, no. 9, pp. 1362–1367, 2008. View at: Publisher Site | Google Scholar
  10. T. Morita, “High-pressure adaptation of muscle proteins from deep-sea fishes, Coryphaenoides yaquinae and C. armatus,” Annals of the New York Academy of Sciences, vol. 1189, pp. 91–94, 2010. View at: Publisher Site | Google Scholar
  11. S. Hourdez and R. E. Weber, “Molecular and functional adaptations in deep-sea hemoglobins,” Journal of Inorganic Biochemistry, vol. 99, no. 1, pp. 130–141, 2005. View at: Publisher Site | Google Scholar
  12. D. W. Tucker, “Studies on Trichiuroid fishes—3. A preliminary revision of the family Trichiuridae,” Bulletin of the British Museum (Natural History) Zoology, vol. 4, pp. 73–131, 1956. View at: Google Scholar
  13. M. R. Martins, A. M. Leite, and M. L. Nunes, “Peixe-espada-preto. Algumas notas ácerca da pescaria do peixe-espada-preto,” Instituto Nacional de Investigação das Pescas, 1987. View at: Google Scholar
  14. FAO, “Fishstat,” FAO Fisheries Department, Fishery Information, Data and Statistics Unit, 2002. View at: Google Scholar
  15. M. Machete, T. Morato, and G. Menezes, “Experimental fisheries for black scabbardfish (Aphanopus carbo) in the Azores, Northeast Atlantic,” ICES Journal of Marine Science, vol. 68, no. 2, pp. 302–308, 2011. View at: Publisher Site | Google Scholar
  16. ICES, “Report of the Working Group on the Biology and Assssment of Deep-Sea Fisheries Resources (WGDEEP),” Tech. Rep. ICES CM 2008/ACOM :14, ICEC Headquarters, Copenhagen, Denmark, 2008. View at: Google Scholar
  17. I. Figueiredo, P. Bordalo-Machado, S. Reis et al., “Observations on the reproductive cycle of the black scabbardfish (Aphanopus carbo Lowe, 1839) in the NE Atlantic,” ICES Journal of Marine Science, vol. 60, no. 4, pp. 774–779, 2003. View at: Publisher Site | Google Scholar
  18. B. Morales-Nin and D. Sena-Carvalho, “Age and growth of the black scabbard fish (Aphanopus carbo) off Madeira,” Fisheries Research, vol. 25, no. 3-4, pp. 239–251, 1996. View at: Publisher Site | Google Scholar
  19. C. Longmore, C. N. Trueman, F. Neat et al., “Ocean-scale connectivity and life cycle reconstruction in a deep-sea fish,” Canadian Journal of Fisheries and Aquatic Sciences, pp. 1–12, 2014. View at: Publisher Site | Google Scholar
  20. S. Stefanni and H. Knutsen, “Phylogeography and demographic history of the deep-sea fish Aphanopus carbo (Lowe, 1839) in the NE Atlantic: vicariance followed by secondary contact or speciation?” Molecular Phylogenetics and Evolution, vol. 42, no. 1, pp. 38–46, 2007. View at: Publisher Site | Google Scholar
  21. S. H. Nagaraj, R. B. Gasser, and S. Ranganathan, “A hitchhiker's guide to expressed sequence tag (EST) analysis,” Briefings in Bioinformatics, vol. 8, no. 1, pp. 6–21, 2007. View at: Publisher Site | Google Scholar
  22. M. E. Hudson, “Sequencing breakthroughs for genomic ecology and evolutionary biology,” Molecular Ecology Resources, vol. 8, no. 1, pp. 3–17, 2008. View at: Publisher Site | Google Scholar
  23. H. Ellegren, “Sequencing goes 454 and takes large-scale genomics into the wild,” Molecular Ecology, vol. 17, no. 7, pp. 1629–1631, 2008. View at: Publisher Site | Google Scholar
  24. R. Bettencourt, M. Pinheiro, C. Egas et al., “High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus,” BMC Genomics, vol. 11, no. 1, article 559, 2010. View at: Publisher Site | Google Scholar
  25. R. A. Lesniewski, S. Jain, K. Anantharaman, P. D. Schloss, and G. J. Dick, “The metatranscriptome of a deep-sea hydrothermal plume is dominated by water column methanotrophs and lithotrophs,” The ISME Journal, vol. 6, no. 12, pp. 2257–2268, 2012. View at: Publisher Site | Google Scholar
  26. J. Wu, W. Gao, W. Zhang, and D. R. Meldrum, “Optimization of whole-transcriptome amplification from low cell density deep-sea microbial samples for metatranscriptomic analysis,” Journal of Microbiological Methods, vol. 84, no. 1, pp. 88–93, 2011. View at: Publisher Site | Google Scholar
  27. Y. Shi, J. McCarren, and E. F. Delong, “Transcriptional responses of surface water marine microbial assemblages to deep-sea water amendment,” Environmental Microbiology, vol. 14, no. 1, pp. 191–206, 2012. View at: Publisher Site | Google Scholar
  28. S. Stefanni, R. Bettencourt, H. Knutsen, and G. Menezes, “Rapid polymerase chain reaction-restriction fragment length polymorphism method for discrimination of the two Atlantic cryptic deep-sea species of scabbardfish,” Molecular Ecology Resources, vol. 9, no. 2, pp. 528–530, 2009. View at: Publisher Site | Google Scholar
  29. M. Margulies, M. Egholm, and W. E. Altman, “Genome sequencing in microfabricated high-density picolitre reactors,” Nature, vol. 437, pp. 376–380, 2005. View at: Google Scholar
  30. S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, “Basic local alignment search tool,” Journal of Molecular Biology, vol. 215, no. 3, pp. 403–410, 1990. View at: Publisher Site | Google Scholar
  31. J. D. Thompson, T. J. Gibson, F. Plewniak, F. Jeanmougin, and D. G. Higgins, “The CLUSTAL X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools,” Nucleic Acids Research, vol. 25, no. 24, pp. 4876–4882, 1997. View at: Publisher Site | Google Scholar
  32. N. Saitou and M. Nei, “The neighbor-joining method: a new method for reconstructing phylogenetic trees,” Molecular Biology and Evolution, vol. 4, no. 4, pp. 406–425, 1987. View at: Google Scholar
  33. J. Felsenstein, “Confidence limits on phylogenies: an approach using the bootstrap,” Evolution, vol. 39, pp. 783–791, 1985. View at: Google Scholar
  34. Z. Yang, “PAML 4: a program package for phylogenetic analysis by maximum likelihood,” Molecular Biology and Evolution, vol. 24, no. 8, pp. 1586–1591, 2007. View at: Publisher Site | Google Scholar
  35. S. Prashanth Kumar and M. Meenatchi, “Virtual quantification of protein stability using applied kinetic and thermodynamic parameters,” IIOAB Letters, vol. 1, pp. 21–28, 2011. View at: Google Scholar
  36. T. Thiel, W. Michalek, R. K. Varshney, and A. Graner, “Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.),” Theoretical and Applied Genetics, vol. 106, no. 3, pp. 411–422, 2003. View at: Google Scholar
  37. S. Rozen and H. Skaletsky, “Primer3 on the WWW for general users and for biologist programmers.,” Methods in Molecular Biology, vol. 132, pp. 365–386, 2000. View at: Google Scholar
  38. B.-Y. Lee, A. E. Howe, M. A. Conte et al., “An EST resource for tilapia based on 17 normalized libraries and assembly of 116,899 sequence tags,” BMC Genomics, vol. 11, article 278, 2010. View at: Publisher Site | Google Scholar
  39. J. C. Drazen and B. A. Seibel, “Depth-related trends in metabolism of benthic and benthopelagic deep-sea fishes,” Limnology and Oceanography, vol. 52, no. 5, pp. 2306–2316, 2007. View at: Publisher Site | Google Scholar
  40. K. M. Sullivan and G. N. Somero, “Enzyme activities of fish skeletal muscle and brain as influenced by depth of occurrence and habits of feeding and locomotion,” Marine Biology, vol. 60, no. 2-3, pp. 91–99, 1980. View at: Publisher Site | Google Scholar
  41. J. F. Siebenaller, G. N. Somero, and R. L. Haedrich, “Biochemical characteristics of macrourid fishes differing in their depths of distribution,” The Biological Bulletin, vol. 163, pp. 240–249, 1982. View at: Google Scholar
  42. T. J. S. Merrit and J. M. Quattro, “Evolution of the vertebrate cytosolic malate dehydrogenase gene family: duplication and divergence in actinopterygian fish,” Journal of Molecular Evolution, vol. 56, no. 3, pp. 265–276, 2003. View at: Publisher Site | Google Scholar
  43. T. Ikkai and T. Ooi, “The effects of pressure on F-G transformation of actin,” Biochemistry, vol. 5, no. 5, pp. 1551–1560, 1966. View at: Publisher Site | Google Scholar
  44. C. Verde, M. Balestrieri, D. de Pascale, D. Pagnozzi, G. Lecointre, and G. Di Prisco, “The oxygen transport system in three species of the boreal fish family Gadidae: molecular phylogeny of hemoglobin,” The Journal of Biological Chemistry, vol. 281, no. 31, pp. 22073–22084, 2006. View at: Publisher Site | Google Scholar
  45. H. J. Him, C. Steif, T. Vogl et al., “Fundamentals of protein stability,” Pure and Applied Chemistry, vol. 65, pp. 947–952, 1993. View at: Google Scholar
  46. H. E. Hoekstra and J. A. Coyne, “The locus of evolution: evo devo and the genetics of adaptation,” Evolution, vol. 61, no. 5, pp. 995–1016, 2007. View at: Publisher Site | Google Scholar
  47. P. H. Yancey, A. L. Fyfe-Johnson, R. H. Kelly et al., “Trimethylamine oxide counteracts effects of hydrostatic pressure on proteins of deep-sea teleosts,” Journal of Experimental Zoology, vol. 289, p. 172, 2001. View at: Google Scholar
  48. P. H. Yancey, W. R. Blake, and J. Conley, “Unusual organic osmolytes in deep-sea animals: adaptations to hydrostatic pressure and other perturbants,” Comparative Biochemistry and Physiology A: Molecular & Integrative Physiology, vol. 133, no. 3, pp. 667–676, 2002. View at: Publisher Site | Google Scholar
  49. P. H. Yancey, M. D. Rhea, K. M. Kemp, and D. M. Bailey, “Trimethylamine oxide, betaine and other osmolytes in deep-sea animals: depth trends and effects on enzymes under hydrostatic pressure,” Cellular and Molecular Biology, vol. 50, no. 4, pp. 371–376, 2004. View at: Google Scholar
  50. P. Prunet, M. T. Cairns, S. Winberg, and T. G. Pottinger, “Functional genomics of stress responses in fish,” Reviews in Fisheries Science, vol. 16, no. 1, pp. 157–166, 2008. View at: Publisher Site | Google Scholar

Copyright © 2014 Sergio Stefanni et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Related articles

No related content is available yet for this article.
 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder

Related articles

No related content is available yet for this article.

Article of the Year Award: Outstanding research contributions of 2021, as selected by our Chief Editors. Read the winning articles.