Analysis of Gene Expression in an Inbred Line of Soft-Shell Clams (Mya arenaria) Displaying Growth Heterosis: Regulation of Structural Genes and the NOD2 Pathway
Mya arenaria is a bivalve mollusk of commercial and economic importance, currently impacted by ocean warming, acidification, and invasive species. In order to inform studies on the growth of M. arenaria, we selected and inbred a population of soft-shell clams for a fast-growth phenotype. This population displayed significantly faster growth (), as measured by 35.4% greater shell size. To assess the biological basis of this growth heterosis, we characterized the complete transcriptomes of six individuals and identified differentially expressed genes by RNAseq. Pathways differentially expressed included structural gene pathways. Also differentially expressed was the nucleotide-binding oligomerization domain 2 (NOD2) receptor pathway that contributes to determination of growth, immunity, apoptosis, and proliferation. NOD2 pathway members that were upregulated included a subset of isoforms of RIPK2 (mean 3.3-fold increase in expression), ERK/MAPK14 (3.8-fold), JNK/MAPK8 (4.1-fold), and NFκB (4.08-fold). These transcriptomes will be useful resources for both the aquaculture community and researchers with an interest in mollusks and growth heterosis.
The soft-shell clam, Mya arenaria, is an infaunal, benthic marine bivalve that inhabits soft-sediments in the intertidal and shallow subtidal zone and ranges in the northwest Atlantic from North Carolina to Labrador [1–3]. M. arenaria is a member of the phylum Mollusca, ranging throughout the northern, boreal coastline spanning several continents . The clam is a filter-feeding bivalve, causing it to bioaccumulate environmental pollutants, and thus acts as a sentinel species . In Maine, M. arenaria is harvested commercially, and, in 2015, the soft-shell clam fishery was the second most important in dockside value ($22.5 million) behind the American lobster, Homarus americanus ($495.4 million) . Historically one of the most abundantly fished organisms caught along the coast of North America (over 1,200 metric tons in 2014 ), M. arenaria is prone to overharvest . In spite of their importance, clams and the entire phylum Mollusca are understudied and underrepresented in GenBank .
At present, the basis for differential growth rates (growth heterosis) in mollusks is poorly understood , but differential production of specific proteins has been shown to modulate growth. For example, it has been shown that overexpression of salmon growth hormone is sufficient to increase growth in rainbow trout  and that in cell culture of Pecten maximus L. digestive gland cells, insulin and IGF-I, but not EGF and bFGF, stimulated proliferation . In vertebrates, there are a number of genes, such as insulin-like growth factors (IGF) and IGF binding proteins, that have been found to be differentially expressed [12, 13]. Peterson et al. studied the differential mRNA expression of IGF-I and IGF-II in slow and fast-growth catfish . Studies examining the differential gene expression in a FG phenotype in Pacific oyster Crassostrea gigas [15, 16] identified genes overrepresented in screens of growth heterosis, 50% of which were ribosomal proteins, indicating the importance of translational regulation in fast growth . Several of these studies were on single growth factors, but Pace et al.  showed that a range of environmental and metabolic variables can affect growth heterosis and that these interactions are very complex. Other genes identified by Meyer and Manahan in fast-growth oysters included ATP synthase gamma, caveolin, and histone H2A . Thus, based on these previous studies in other organisms, we hypothesized that specific growth-related genes would be differentially expressed in the FG clams, such as those coding for growth factors and ribosomal proteins [9–12, 14, 15].
To that end, we developed a fast-growth inbred line of clams (M. arenaria) by classical selection. Siphon tissue was chosen because siphon growth is a reliable proxy for total growth  and because siphon can be easily dissected as clean, homogenous tissue. Differential gene expression analysis (RNAseq) of the selected F3 generation compared to unselected F1 clams was conducted following de novo transcriptome assembly. Real-time quantitative reverse-transcription polymerase chain reaction (qPCR) was used for comparison. The genes related to growth were examined in both a fast-growth (FG) inbred F3 line and an unselected (F1) line of M. arenaria. Pathways overrepresented in the screen were analyzed in depth.
Understanding the expression patterns of key genes will help illuminate the mechanisms involved in these processes in bivalves and have important implications for maintenance of this important food-stock. Furthermore, studies of the molecular events associated with the growth process have important implications for the balance between apoptosis and cell proliferation in growth and cancer.
2. Materials and Methods
M. arenaria adults were produced at the Downeast Institute for Applied Marine Research and Education (DEI) (Beals, Maine, USA), our shellfish production and research center, at The University of Maine at Machias, where this species is routinely cultured for stock enhancement programs in coastal communities. Large individuals were hand-selected and inbred for two generations, as described below. All FG clams in this study were from the F3 generation. To avoid batch effects, FG and F1 were subjected to identical field conditions and assigned random numbers upon harvest. The double blinding was maintained until grouping for data analysis.
Beginning in 2002 with wild stocks taken from eastern Maine, adults were spawned, and their larvae and juveniles reared at DEI. Juveniles (F1 generation) were placed in a field-based nursery through the summer and fall, and then overwintered seed was planted in April 2003 in protected field plots at an intertidal site in the town of Beals, Maine, USA.
2.2. Selection, Growth, and Survival
The initial size of the clams was measured at the “hatchery mark,” an area of pitted and gouged shell that forms a band near the umbo when hatchery raised clams are seeded into the wild. This mark has been shown to accurately reflect the size of the clam at seeding .
In June 2005, approximately 300 F1 animals were removed from the plots, and the 30 largest clams (size range = 50–55 mm shell length (SL)) were selected and stimulated to spawn. The juveniles from that spawning (F2 generation) were reared similarly and seeded in protected field plots at an intertidal site in the town of Cutler, Maine, in April 2006. In June 2008, approximately 300 animals were removed from the field plots in Cutler and another selection was made of the 30 largest clams (size range = 52–58 mm shell length). Those animals were stimulated to spawn, producing an F3 generation. The F3 larvae and juveniles were treated similarly through the summer and fall and then overwintered. Also, in June 2008, wild clams collected from a clam buying station in the town of Beals were stimulated to spawn, and these F1 individuals were treated identically through the summer, fall, and winter. On 29 May 2009, a field experiment was conducted at Duck Brook Flat in the town of Cutler, Maine, USA, near the low water mark to determine if growth and/or survival of the F3 stock were different than the F1 stock. Clams (10–12 mm SL) from the FG and F1 lines were added at a density of 1,320 m−2 separately to plastic horticultural pots (experimental units) filled with ambient sediments and pushed into the sediments to within 5 mm of the rim. One-half of the experimental units were covered with a protective flexible plastic netting (6.4 mm), while the other half had no covering of netting. This factorial combination of treatments was replicated 10 times, and the forty experimental units were arrayed in a single 8 × 5 matrix with 1 m spacing between rows and columns. After 201 days, the experiment was concluded (15 December 2009) when all units were removed from the flat, and the contents of each washed through a 2 mm sieve. All live and dead animals were counted. The initial and final SL were measured to the nearest 0.1 mm for each live clam, and the wet mass of each live individual was recorded to the nearest 0.001 g. All FG clams in this study are F3 generation. F1 clams in this study were reared in the hatchery under identical conditions but not hand-selected for size.
2.3. Clam Dissection
Clams from DEI were transported and stored at 4–10 degrees Celsius in plastic bags with moist paper towels in the bottom to prevent desiccation. Clams () were placed in a container and were immersed in 30 g/L MgCl2 in filter sterilized seawater for 5 minutes to anaesthetize the animals . Clams were then sprayed with 95% ethanol. The siphon sheath was removed from the siphon and the siphon was rinsed in the seawater and then in RNAse-free water. A cross section of the siphon tip, containing the fused incurrent and excurrent siphons, was removed and placed in a 2 mL tube, filled with RNAlater to preserve RNA integrity, and was stored at −20°C for less than two months.
2.4. Genome Sequences
Genomic sequence for M. arenaria was generated and kindly provided by Dr. Charles W. Walker of the University of New Hampshire and the Hubbard Center for Genome Studies at the University of New Hampshire. We loaded a local instance of NCBI BLAST with the contigs of the Mya arenaria genome to match transcriptomic contigs with their corresponding genomic contigs. tBLASTx was used to identify annotated genes on NCBI that most closely matched the contigs. CLCBio NGS Genomics Workbench (QIAgen, Hilden, Germany) was used to align the genomic and the transcriptomic contigs so as to identify intron and exon regions of the genes of interest when needed.
2.5. RNA Prep for RNAseq
Inner siphon tips were dissected from six individual clams, three F1 and three F3 FG clams. Tissue was homogenized in the TissueLyser LT (QIAgen, Hilden, Germany) and was run at 50 Hz for six minutes. RNA was purified with TRIzol using RNeasy Fibrous Tissue Mini Kit (QIAgen). RNA was assessed for quality by Bioanalyzer (Agilent, Santa Clara, CA, USA) at Mount Desert Island Biological Laboratory (Salisbury Cove, ME, USA). Illumina TruSeq RNA sequencing stranded library construction and transcriptome sequencing were conducted at the Delaware Biotechnology Institute at the University of Delaware using an Illumina HiSeq2000 according to manufacturer’s specifications.
2.6. RNAseq Analysis
FastQC was utilized to assess quality scores of RNAseq reads. Quality scores indicated no trimming to be necessary. De novo partial assembly was done using CLCBio. CLCBio NGS Genomics Workbench (v.6.0) (QIAgen) was used to compare transcript abundance between F1 and F3 transcriptomes. Expression value normalization is based on the reads per kilobase per million mapped reads (RPKM) to compensate for read length [22, 23]. The variance in transcripts between the FG (F3) selected transcriptome and the F1 transcriptome was analyzed using normalized Baggerley’s test. A Bonferroni correction for multiple comparisons was applied subsequent to Baggerley’s test for a more stringent screen. The cutoff point was Baggerley’s/Bonferroni less than or equal to 10−7.
Differentially expressed transcripts were categorized as matching annotated genes, as genes that are uncharacterized, or as having no matches. Uncharacterized genes were contigs that matched published genes in GenBank whose identity and function are not known. Placements were based on open tBLASTx searches of the GenBank database with an -value threshold of . After identifying a pathway that was differentially expressed, the expression level of other members of the pathway was examined by creating a local BLAST database of all Mya contigs and searching for orthologues (mollusk sequences when possible, but more often other invertebrate or mammalian sequences were used when no molluscan orthologues were found in GenBank). For this post hoc screen, Baggerley’s/Bonferroni was used.
The volcano plot was generated using Bioconductor  (v.3.2). The RNAseq Illumina reads from the current project have been submitted to the NCBI SRA (Sequence Read Archive), BioProject accession number: PRJNA221373.
2.7. RNA Preparation for qPCR
A sample of siphon tip tissue of about 30 mg was placed in a 2 mL microcentrifuge tube along with a 5 mm stainless steel bead and 300 μL of TRIzol reagent. The tubes were placed in the TissueLyser LT (QIAgen) and were run at 50 Hz for six minutes. The RNA was extracted using QIAgen RNAeasy Fibrous Tissue kit. 10 μL DNase (Ambion, Life Technologies, Carlsbad, CA, USA) stock solution was added to prevent genomic DNA contamination. Quality was assessed by QIAExcel (QIAgen) and/or by agarose gel. Purified RNA was assayed by Nanodrop (Thermo Scientific, Waltham, MA, USA) and was frozen at −80°C, typically with RNasin RNAse (Promega Life Sciences, Madison, WI, USA) inhibitor.
2.8. cDNA Preparation
Two-step qPCR was performed for each sample: reverse-transcription followed by qPCR. A sample of the RNA was diluted to 100 ng/μL and 1 μL was then used in the cDNA reaction. The reaction was conducted with a ProtoScript II first strand cDNA synthesis kit (NEB, Ipswitch, MA, USA) that was used according to the manufacturer’s specifications.
2.9. qPCR Reference Gene Validation and qPCR Conditions
The protocol and settings for the reference genes were optimized for temperature, primer concentration, and cDNA concentration and these genes were then run using cDNA from F1 clams and FG inbred line clams. Efficiency and linearity of linear fit for the cDNA concentration standard curve were assessed using standard techniques suggested by Bio-Rad (Hercules, CA, USA) . Efficiency was calculated as and linearity of fit was assessed as .
A subset of housekeeping genes from Araya et al.  was assessed for suitability as reference genes: actin gamma, elongation factor 1, ribosomal protein s-18 (RPS), and ubiquitin. PCR quantitation for these genes utilized primer sequences from Araya et al.  and they were synthesized by Integrated DNA Technologies (Coralville, Iowa). Genes were assessed for use as reference genes, along with candidate genes for differential expression.
Primers for experimental genes such as BIRC2 were designed using NCBI primer BLAST, under default settings, with the assembled transcriptome used as a PCR template, and were further BLASTed against a genomic assembly for Mya arenaria to test for primer specificity in our organism of interest, although the lack of annotated genome does not allow us to exclude the possibility of pseudogenes or to design exon-spanning primers to limit amplification of some genomic DNA. Primers were synthesized by Integrated DNA Technologies (Coralville, Iowa, USA).
A series of controls were run to ensure qPCR optimization and accuracy. Every run included at least one well containing a no template control (NTC) to control for primer dimer formation and gDNA contamination. Agarose gels and melting curves were run with the products of each gene of interest to ensure that the primers were amplifying a single product. RNA purity and integrity was assessed via Thermo Scientific NanoDrop 2000c spectrophotometer and/or QiAxcel Advanced, using manufacturer specifications. Standard concentrations of primers were kept through all of the experiments, and template concentration was assessed Via NanoDrop and then standardized. qPCR was conducted using SYBR green master mix (Promega). CT values, mean, standard deviation, and melting curves were generated by the instrument software CFX Manager (Bio-Rad). The CT values between technical replicates were found to be consistent by coefficient of determination () and efficiency. Reference genes, whose expression levels were constant between F3 (FG) and F1 individuals, were used to normalize the data. For some runs, alien RNA from Alien QRT-PCR Inhibitor Alert (Agilent Technologies, Santa Clara, CA) was used as per instructions to control for inhibitors in the M. arenaria cDNA. All qPCR runs were conducted on a Bio-Rad MiniOpticon; the reference dye HEX with a sample run was used to control for laser variation of the MiniOpticon. The alien RNA, the RPS standard, the no reverse transcriptase control (NRC), and no template controls (NTC) using the same primer set were analyzed together in the same 48-deep well plate (Bio-Rad) in order to minimize run-to-run variations. The threshold level generated by the instrument curves was manually evaluated for each run and adjusted to meet the linear portion of the curve for determination of the threshold cycle values (CT). Parallel samples were processed using the same batch of reagents to minimize sample-to-sample variations.
2.10. Annotation and Gene Ontology
Gene annotation was carried out using the BLAST2GO program , FASTA-formatted sequences representing the unique upregulated transcripts were uploaded to the program, and BLASTX or BLASTn searches were carried out. Some data mining was performed using BLASTX through a CLCBio workflow (QIAgen). Gene Ontology for candidate genes was assessed using AmiGo at Gene Ontology (http://www.geneontology.org/), GoTermFinder , UniProt (http://www.uniprot.org/), and GO Slim (http://go.princeton.edu/cgi-bin/GOTermMapper) . The NOD2-pathway figure was drawn using Pathvisio (http://www.pathvisio.org/).
The network analysis was conducted with the software STRING v.10 . We chose the top 100 most BLAST hits with the greatest expression difference. We searched for network interactions using the closest annotated genome in STRING, the purple sea urchin, Strongylocentrotus purpuratus (BioProjects # PRJNA13728, PRJNA56067, and PRJNA10736). We filtered the STRING v.8 human interactome to include only interactions which had a confidence score ≥0.4 (medium stringency).
3.1. Growth Heterosis
We examined the possibility of enhancing shell growth in M. arenaria through classic selective breeding. Selection resulted in a clam inbred line that displayed growth heterosis as assessed by shell length (SL) (Figure 1).
Growth, measured as mean final SL, was 35.4% greater in the F3 versus F1 line ( versus ) (Figure 1(a)). Also, mean percent survival was greater in the F3 versus F1 line by 126.4% ( versus ) (Figure 1(b)). Survival was not enhanced by netting () due to the accidental inclusion of green crab juveniles in seven of the ten netted units for both F1 and F3 line treatments. No differences were observed between selection lines in the mass-length relationship suggesting that selection for increased rate of shell production did not negatively interfere with tissue mass or growth. Notably, when released from artificial selection, in less than two generations, the FG inbred line was no longer significantly larger than the matched F1 population (dns).
Complete transcriptomes were sequenced from three F1 individuals and three FG individuals. The Phred quality scores from the paired-end reads were above 30 to 150 bp, indicating a base call accuracy of at least 99.9% (Figure S, in Supplementary Material available online at http://dx.doi.org/10.1155/2016/6720947). In the absence of a reference genome, de novo assembly was performed. A total of 122,012,641 matched, paired-end reads were available for contig assembly and mapping, with a median length of 142 bp (Figure S). Assembly utilized a cutoff of ≥200 bp to minimize low information assemblies. The assembly resulted in an average of 79,470 contigs with a maximum size of 25,395 bp and N50 of 1037 bp.
We identified 415 differentially expressed genes between the F1 and FG clams (Bonferroni value < 0.05). A volcano plot showed that the most highly differentially expressed genes were downregulated (Figure S). The genes were sorted by Bonferroni value and then sorted by fold-expression difference between F1 and FG inbred lines. More than 50% of the contigs had no useable BLAST hits (BLAST cutoff ) due to noncoding transcripts, transcriptome contigs that only represented untranslated regions, and poor sequence database representation of mollusks. Of the 415 unique, significantly differentially expressed transcripts in FG clams, putative annotation could be determined for 162 based on sequence similarity by BLASTX searches while the majority had no significant similarity to protein sequences in the nr database (cutoff -value = ). Genes with the largest positive and negative expression differences were often structural genes (Table 1).
3.3. Real-Time Quantitative PCR
To validate the differential expression observed by RNAseq, we chose an upregulated gene from our RNAseq data: baculoviral IAP repeat-containing protein 2 (BIRC2), (also known as c-IAP or IAPOP1). BIRC2 was of interest due to its position in the upregulated NOD2 pathway, as well as its physiological role in growth, immunity, and apoptosis. Comparing FG F3 individuals with F1 individuals, BIRC-2 demonstrated a CT (SD) = 7.6 (2.02). This result was consistent with the differential expression seen in our RNAseq, but the fold-difference was smaller when assayed by qPCR. The ribosomal protein S3A (RPS) was chosen as a reference gene, based on CT below 25, constant expression between individuals and between experimental and control groups, consistent melt temperature, and amplification efficiency (Table S). Between individuals, both FG and WT, the fold change was 0.3 1.4 (Table S).
3.4. Gene Ontology and Network Analysis
In order to categorize the function of the genes in our differential expression of RNAseq screen, we took 484 genes with the highest differences in expression between F3 FG and F1 clams. Of those genes with useable BLAST hits, 19% had Gene Ontology related to cell structure, 17% related to signaling or growth, 10% related to nutrient metabolism, 10% related to synthesis of critical macromolecules, and 2% related to energy metabolism (Figure 2).
Metabolic genes represented 21% of the characterized genes. For example, fatty acid synthase transcripts were 22-fold higher in the FG individuals (). By manual inspection, we also determined that five of the transcripts for the NOD-like receptor signaling pathway were differentially expressed in the screen (Figure 3).
A post hoc search of our RNAseq results for other pathway members yielded three that were consistently and significantly upregulated (NFκB, JNK, and ERK), one that had multiple isoforms upregulated but none of which reached significance singly (RIPK2), and eight other related proteins that were not significantly differentially expressed (ERBIN, iKKB, CASP8, TAK1, TRAF6, TRIP6, SGT1, and CARD6). One intermediate member of the pathway was not represented: no BLAST hits corresponded to TRAF.
The cluster analysis in STRING draws association data from several databases, including the Kyoto Encyclopedia of Genes and Genomes (KEGG) , and was used to establish genes sharing a common biological pathway (Figure 4).
The most complete network of differentially expressed genes was involved in cytoskeletal processes and protein translation.
We demonstrated that the F3 generation of classical selection was sufficient to generate transient growth heterosis. To the best of our knowledge, this is the first-ever demonstration of artificial selection for any morphological attribute in Mya arenaria. Coastal communities incorporating stock enhancement with cultured soft-shell clam juveniles may enjoy greater production in areas seeded with animals that have been genetically selected for fast growth.
Because we were interested in the physiological origin of the growth heterosis, we chose to assay for differential gene expression. Over just three generations, we did not expect heritable changes. In addition, once the selection was released, the clam size reverted to the mean in F4. This suggests that we have been selecting for higher gene expression.
The preponderance of the differentially expressed genes in the FG clams was structural genes. Curiously, in the FG clams, these genes are strongly and consistently downregulated. For example, myosins, actins, microtubules, and several related genes appear in the screen downregulated. One might expect that production of a number of the building blocks for growth of the organism would be increased to meet demand of growth—or, at least, maintained as a housekeeping function. However, there are numerous studies where strong variation in actin gene expression has been seen (e.g., [31–33]). In our hands, the RT-qPCR actin in the soft-shell clams proved to have significant individual variation and was rejected as a normalization gene (Table S). Furthermore, in growth states—particularly cancer—there are numerous examples of structural genes differentially regulated , although upregulation is perhaps more common, particularly with wounding or remodeling [35–37].
About a quarter of the characterized differentially expressed genes were metabolic genes. For example, two isoforms of fatty acid synthase, ATPase and elongation factor 2 (EF2), are all represented in the most significantly differentially expressed genes. This is consistent with other studies on growth heterosis that emphasize the importance of protein synthesis genes and protein processing  and turnover [17, 38].
The genes in the NOD-like receptor signaling pathway were overrepresented in the differential expression screen. The NOD pathway forms an interesting crossroads between innate immunity, growth, and apoptosis. Unfortunately, we were unable to find the sequence of some critical NOD2 players in the transcriptome, even though some have been found previously in scallops . RIPK2 is an interesting gene in this analysis because it is a convergence point for upstream genes that are differentially expressed. The screen pulled up seven contigs that mapped to RIPK2 most of which were not significant but six of the seven showed upregulation and near significance. In addition to contributing to growth, the NOD2 pathway leads to transcription of proinflammatory cytokines via TNF-alpha and NFκB . The pathway has been implicated in the inflammatory bowel condition Crohn’s Disease [41–43] and in cancers, particularly colorectal cancer [42, 44]. Other immune-related genes differentially expressed include a pathogenesis-related protein that was upregulated eightfold and interferon alpha-inducible protein that was downregulated 19-fold. The upregulation of immunity pathways is particularly interesting given the increased survival in the fast-growth F3 individuals seen in Figure 1(b). These results suggest that the regulatory genes in the NOD-like receptor signaling pathway may play a role in growth but we have no way of determining cause and effect from these data. Alternatively, it is possible that the upregulation of the innate immunity prevents pathogen invasion that would have otherwise limited growth; conversely, it is possible that the higher growth rate results in more pathogen exposure, which in turn upregulates the innate immune pathway.
We chose to validate BIRC2 by qPCR because it is an upregulated member of the NOD2 pathway at a crossroads between growth, immunity, and cancer, yet BIRC2 has been reported to have no phenotype when knocked down in C. elegans . This lack of phenotype is in part because BIRC2 appears to be functionally redundant with BIRC1 in mouse knockouts, though their regulation depends on cell type .
The connection between growth pathways and cancer pathways is not unexpected. Interestingly, recent work has shown that M. arenaria is one of only three organisms shown to be susceptible to transmissible cancers. Metzger et al.  identified a line of clonal M. arenaria cells that are at least partially responsible for the high prevalence of hemocyte cancers in clams along the coast of the Northeast. It is possible in current siphon liquid or perfused hemocytes could contain high copy number of cancerous hemocytes. However, because we used washed siphon tissue for the transcriptome, we do not anticipate significant artifactual RNA from possible cancer cells.
As a nonmodel organism, Mya arenaria presents obstacles to analysis. Genomic data is limited and nucleotide divergence in mollusks is high . The high number of unannotated or uncharacterized genes in the screen limits the scope of our interpretations, primarily due to noncoding genes and having some fragmented transcripts where the contigs only contain the UTR portion of the transcripts. The representation of mollusks in the NCBI database is low, particularly some of the bivalves of interest . There may be overrepresented genes or pathways that are not well annotated. For this reason, network analysis must be interpreted conservatively.
To analyze organismal gene expression, we turned to high-throughput transcriptome analysis (RNAseq). Lack of representation in the database of annotated mollusks prevented us from identifying over half of the transcripts.
We determined that RPS3A served as a stable gene in the qPCR analysis and chose BIRC2 as a differentially expressed gene to analyze. Between individuals, both FG and WT, the RPS showed low variability with a single melt peak and a single band by DNA analyzer. RPS3A produces a ribosomal protein that is a component of the small ribosomal subunit. It is a member of the S3AE family of ribosomal proteins and is located in the cytoplasm. In the realm of FG phenotype, RPS3A is an interesting gene. On one hand, it has appeared in screens for genes highly associated with growth heterosis  but RPS3A has also been used in screens of FG cells, especially cancer, as a housekeeping gene, as seen in a meta-analysis by Popovici et al. .
The upregulation of the BIRC2 gene seen in both the RNAseq screen and the qPCR was of particular interest because the gene product BIRC2 lies at a crossroads between growth, cancer, and immunity. The sevenfold increase in BIRC-2 expression assayed by qPCR was consistent with the differential expression seen in the RNAseq, but the fold-difference was smaller, a result that has been seen in other RNAseq/qPCR comparisons  and in part reflects the difference in the dynamic range of the two methods . The BIRC-2 knockout has been reported to have no phenotype [45, 52], so future interventions combining a knockdown with an immune challenge could prove instructive in evaluating the interactions between growth and innate immunity and in evaluating the partial functional redundancy of BIRC1 and BIRC2.
Our sequence database contributions and annotation will serve to improve the bivalve representation in GenBank (BioProject accession # PRJNA221373, SRA accession numbers # SAMN02361211-16). Our results show that suites of genes involved in structural remodeling, signaling, and apoptosis correlate with a fast-growth phenotype. Functional analysis of some of these genes, such as BIRC-2, will inform analysis of growth regulation in these ecologically and economically important species. Since these genes lie at the crossroads of immunity, growth, and cancer, there are a range of biomedical implications. In addition, elucidation of growth in bivalves can have implications for conservation and policy for M. arenaria.
The authors declare that there are no competing interests regarding the publication of this paper.
The authors thank Dr. Gerard Zegers, Benjamin King, and Dr. Charles Walker for critical reading of the manuscript. Access to the partially assembled M. arenaria genome was kindly provided by Dr. Charles Walker, U. New Hampshire. The Downeast Institute for Applied Marine Research and Education and Kyle Pepperman provided F1 and FG clams. Benjamin King, Staff Scientist, MDI Biological Laboratory, conducted the initial transcriptome assembly. Bruce Kingham, Director, U. Delaware Sequencing & Genotyping at the Delaware Biotechnology Institute, was responsible for library construction and transcriptome sequencing. Research reported in this publication was supported by an Institutional Development Award (IDeA) from the National Institute of General Medical Sciences of the National Institutes of Health under Grant no. P20GM0103423. In-kind support was provided by The University of Maine at Machias. Information technology and server support was provided by Michael Matis and grants administration was kindly provided by the Mount Desert Island Biological Laboratory and by Dr. Sherrie Sprangers and Mr. Thomas Potter at U. Maine Machias.
Supplemental material includes: RNAseq quality control data, PCR conditions and data, and a plot of global transcriptional changes seen in our RNAseq.
B. F. Beal, M. R. Parker, and K. W. Vencile, “Seasonal effects of intraspecific density and predator exclusion along a shore-level gradient on survival and growth of juveniles of the soft-shell clam, Mya arenaria L., in Maine, USA,” Journal of Experimental Marine Biology and Ecology, vol. 264, no. 2, pp. 133–169, 2001.View at: Publisher Site | Google Scholar
B. F. Beal and M. Gayle Kraus, “Interactive effects of initial size, stocking density, and type of predator deterrent netting on survival and growth of cultured juveniles of the soft-shell clam, Mya arenaria L., in eastern Maine,” Aquaculture, vol. 208, no. 1-2, pp. 81–111, 2002.View at: Publisher Site | Google Scholar
Commericial Fishing Landings Data: Maine Department of Marine Resources, 2015, http://www.maine.gov/dmr/commercial-fishing/landings/index.html
“Annual Commercial Landing Statistics, National Marine Fisheries Service, NOAA Office of Science and Technology,” 2016, https://www.st.nmfs.noaa.gov/commercial-fisheries/commercial-landings/annual-landings/indexView at: Google Scholar
B. C. Peterson, G. C. Waldbieser, and L. Bilodeau, “IGF-I and IGF-II mRNA expression in slow and fast growing families of USDA103 channel catfish (Ictalurus punctatus),” Comparative Biochemistry and Physiology A: Molecular and Integrative Physiology, vol. 139, no. 3, pp. 317–323, 2004.View at: Publisher Site | Google Scholar
B. C. Peterson, G. C. Waldbieser, and L. Bilodeau, “IGF-I and IGF-II mRNA expression in slow and fast growing families of USDA103 channel catfish (Ictalurus punctatus),” Comparative Biochemistry and Physiology Part A: Molecular & Integrative Physiology, vol. 139, no. 3, pp. 317–323, 2004.View at: Publisher Site | Google Scholar
D. Hedgecock, D. J. McGoldrick, D. T. Manahanb, J. Vavrab, N. Appelmansb, and B. L. Baynec, “Quantitative and molecular genetic analyses of heterosis in bivalve molluscs,” Journal of Experimental Marine Biology and Ecology, vol. 203, pp. 49–59, 1996.View at: Google Scholar
D. A. Pace, A. G. Marsh, P. K. Leong, A. J. Green, D. Hedgecock, and D. T. Manahan, “Physiological bases of genetically determined variation in growth of marine invertebrate larvae: a study of growth heterosis in the bivalve Crassostrea gigas,” Journal of Experimental Marine Biology and Ecology, vol. 335, no. 2, pp. 188–209, 2006.View at: Publisher Site | Google Scholar
B. F. Beal, R. Bayer, M. G. Kraus, and S. R. Chapman, “A unique shell marker in juvenile, hatchery-reared individuals of the softshell clam, Mya arenaria L.,” Fishery Bulletin, vol. 97, no. 2, pp. 380–386, 1999.View at: Google Scholar
M. T. Araya, A. Siah, D. Mateo et al., “Selection and evaluation of housekeeping genes for haemocytes of soft-shell clams (Mya arenaria) challenged with Vibrio splendidus,” Journal of Invertebrate Pathology, vol. 99, no. 3, pp. 326–331, 2008.View at: Google Scholar
K. Dheda, J. F. Huggett, S. A. Bustin, M. A. Johnson, G. Rook, and A. Zumla, “Validation of housekeeping genes for normalizing RNA expression in real-time PCR,” BioTechniques, vol. 37, no. 1, pp. 112–119, 2004.View at: Google Scholar
D. Branquinho, P. Freire, C. Sofia, and S. De Gastroenterologia, “NOD2 mutations and colorectal cancer—where do we stand?” World Journal of Gastrointestinal Surgery, vol. 8, no. 4, pp. 284–293, 2016.View at: Google Scholar
M. E. Guicciardi, N. W. Werneburg, S. F. Bronk et al., “Cellular Inhibitor of Apoptosis (cIAP)-mediated ubiquitination of phosphofurin acidic cluster sorting protein 2 (PACS-2) negatively regulates tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) cytotoxicity,” PLoS ONE, vol. 9, no. 3, Article ID e92124, 2014.View at: Publisher Site | Google Scholar