About this Journal Submit a Manuscript Table of Contents
International Journal of Genomics
Volume 2013 (2013), Article ID 261730, 10 pages
http://dx.doi.org/10.1155/2013/261730
Research Article

Characterization of the OmyY1 Region on the Rainbow Trout Y Chromosome

1School of Biological Sciences, Washington State University Vancouver, 14204 NE, Salmon Creek Avenue, Vancouver, WA 98686-9600, USA
2Center for Reproductive Biology, Washington State University, Pullman, WA 99164-7520, USA
3School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, USA
4US Geological Survey, Western Fisheries Research Center, 6505 NE 65th Street, Seattle, WA 98115, USA
5Biology Department, Reed College, 3203 SE Woodstock Boulevard, Portland, OR 97202-8199, USA

Received 18 September 2012; Revised 26 December 2012; Accepted 17 January 2013

Academic Editor: G. Pesole

Copyright © 2013 Ruth B. Phillips et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

We characterized the male-specific region on the Y chromosome of rainbow trout, which contains both sdY (the sex-determining gene) and the male-specific genetic marker, OmyY1. Several clones containing the OmyY1 marker were screened from a BAC library from a YY clonal line and found to be part of an 800 kb BAC contig. Using fluorescence in situ hybridization (FISH), these clones were localized to the end of the short arm of the Y chromosome in rainbow trout, with an additional signal on the end of the X chromosome in many cells. We sequenced a minimum tiling path of these clones using Illumina and 454 pyrosequencing. The region is rich in transposons and rDNA, but also appears to contain several single-copy protein-coding genes. Most of these genes are also found on the X chromosome; and in several cases sex-specific SNPs in these genes were identified between the male (YY) and female (XX) homozygous clonal lines. Additional genes were identified by hybridization of the BACs to the cGRASP salmonid 4x44K oligo microarray. By BLASTn evaluations using hypothetical transcripts of OmyY1-linked candidate genes as query against several EST databases, we conclude at least 12 of these candidate genes are likely functional, and expressed.

1. Introduction

Salmonid fishes have the XX/XY system of sex determination [1]. Most rainbow trout have morphologically distinguishable sex chromosomes [2], with the short arm being longer in the X chromosome than the Y, primarily due to presence of 5S rDNA sequences on the X chromosome (reviewed in [3]). Some wild fish and several male clonal lines, including the Swanson YY and Arlee YY clones have a Y chromosome that resembles the X chromosome in having a longer short arm including the 5S rDNA sequences [4]. This fact and the viability seen in rainbow trout and Chinook salmon YY individuals [5, 6] suggest that the X and Y chromosomes share considerable genetic content.

The SEX locus is not found on a common linkage group in Pacific salmon and trout, but rather each species has the SEX locus on a different linkage group as shown by genetic mapping and localization of male-specific markers using in situ hybridization with clones to GH-Y [79]. (Rainbow and cutthroat trout are an exception to this [10], and hybrids between these two species are interfertile.) Although it is possible that each species has a different master sex determining gene, this is unlikely because there are several male-specific markers shared by the Oncorhynchus species. These include OmyY1, found in all Pacific trout and salmon [11], OtY2 [12] which is related to OmyY1, and GH-Y [13], which are found in most species of Pacific salmon, and OtY1, found only in Chinook salmon [14]. These results support the hypothesis that a small chromosomal segment containing the SEX locus, or the small short arm containing the SEX locus, is transposing to a new chromosome in each of these species (reviewed in [9, 15]).

The OmyY1 genetic marker, which has been used to sex fish from all of the Oncorhynchus species, was isolated from lambda libraries prepared from male fish of both rainbow trout (OSU × Hotcreek) and Chinook salmon (where the region is referred to as OtY3) [11]. Recently, evidence has been presented supporting the hypothesis that an immune function related-gene, sdY, which is found within the rainbow trout OmyY1 lambda clone, may be the master sex determining gene for many species in the Salmoninae [16]. We present evidence based on sequence information of BAC clones isolated from the OmyY1 region of rainbow trout that there are a number of sex-linked genes shared between the X and Y chromosomes, implying the sex chromosomes are in the early stages of differentiation. Some of the genes in the region adjacent to sdY show male-specific expression.

2. Materials and Methods

2.1. Doubled Haploid Rainbow Trout Clonal Lines and Crosses

Type I loci and other markers were sequenced in four doubled haploid parental lines produced by androgenesis (female Oregon State University (OSU), female Whale Rock (WR), male Swanson (SW), and male Arlee (AR)). Markers were mapped in a cross of (OSU × AR) to determine concordance with phenotypic sex. These fish were previously used to generate a dense genome-wide linkage map [1719] to analyze quantitative trait loci for a variety of traits [20, 21] and to characterize linkage of markers on the Y chromosome [4, 22]. In all crosses, sex phenotype was determined by internal examination of gonads.

2.2. BAC Library Screening for OmyY1 Marker Associated BACs

High density filters corresponding to the 5X BAC DH YY male Swanson (SW) library (EcoR1 set) for rainbow trout were screened using [ ] dCTP-labeled amplicons from the OmyY1 region. For screening the filters, two probes were generated by PCR amplification of SW genomic DNA using previously described primers to the OmyY1 region [11]. These amplicons were TOPO TA cloned and transformed in JM107 E. coli. After identification and isolation of OmyY1+clones, plasmid DNA was extracted and used as a probe to screen the filters. The product was purified (Qiagen) and then labeled with -dCTP (Amersham) using Ready-to-Go beads (GE Healthcare) according to the manufacturer’s suggestions. The labeled probe was purified using G-50 columns (ProbeQuant, GE Healthcare) and then used for hybridization. Filters were hybridized (5X SSC, 1% SDS, 0.5% sodium pyrophosphate, 0.5% nonfat milk and 10% dextran sulfate at ) and then washed (0.5% SSC/0.5% SDS, ) under stringent conditions. Positive clones were confirmed by PCR using primer sets corresponding to OmyY1. This resulted in many hits, but only the strongest signals were further evaluated by PCR and FISH. BACs were named according to the coordinates on the original plate. Clones were obtained from the National Center for Cool and Cold Water Aquaculture, ARS-USDA as stab cultures. BAC DNA for PCR confirmation was prepared from selected clones grown in 300 mls LB broth + chloramphenicol (12.5 μg/mL) overnight at 37C with shaking at 250 rpm, and DNA was isolated using a Qiagen Plasmid Maxi Kit.

2.3. PCR Screening of OmyY1 Marker Associated BACs

Further PCR screening of the selected clones was performed in a PTC100 or 200 Thermalcycler (MJ Research) using primers noted above. Reactions were carried out in 10 μL volumes containing 0.5 U Taq DNA Polymerase (GenScript), .4 uM primers, 1 μL 10x PCR buffer containing 1.5 mM MgC , 200 μM dNTPs, and 1 μL of 40 ng BAC DNA template. PCR conditions comprised of an initial denaturation step of for 5 min; 35 cycles with a denaturation step of for 30 sec, 45 sec at specific annealing temperature, and extension at for 45 sec followed by a final extension at for 5 minutes. PCR products were separated on a 1.2% agarose gel containing .5x TBE and GelRed Nucleic Acid stain (Phenix). The DNA fragments were visualized using a Gel Logic 100 system and UV trans-illuminator (Fisher Biotech).

2.4. Assembly of the OmyY1 BAC Contig

BAC clones RE223F6 and RE143K8 were confirmed using OmyY1-specific PCR primers and sent to Ming Chen Luo at University of California, Davis for assembly of a contig with the Swanson (SW) physical map [23]. Contig #6256 was identified as encompassing the OmyY1 BACs and included 60 additional BACs. Eleven BACs spanning the contig were ordered to use for FISH and end-sequencing and included RT399M07, RT278E18, RT423A24, RT399C14, RT439A05, RT578J04, RT450D22, RT015B10, RT492A21, RT004L12, and RT225N07. After BAC DNA isolation as described above, and determination of DNA concentration, aliquots were sent to the WSU Pullman Molecular Biology Core for end-sequencing using either T7, SP6, or M13 primers.

2.5. Localization of OmyY1 BAC Clones on Rainbow Trout Chromosomes

Blood was cultured from rainbow trout using standard methods [4]. Rainbow trout BAC DNA was labeled with Spectrum Orange using a nick translation kit (Abbott Molecular). Human placental DNA (2 μgs) and Cot-1 DNA (1 μg, prepared from rainbow trout) were added to the probe mixture for blocking. Hybridizations were carried out at overnight and posthybridization washes were as recommended by the manufacturer (Abbott Molecular) with minor modifications [24]. Antibodies to Spectrum Orange (Molecular Probes) were used to amplify the signal. Slides were counterstained with 4,6-diamidino-2-phenylindole (DAPI) at a concentration of 125 ng DAPI in 1 mL antifade solution (Abbott Molecular). Images were captured with a Jai camera and analyzed with Cytovision Genus (Applied Imaging, Inc.) software. Chromosomes were arranged according to size within the metacentric/submetacentric and acrocentric groups.

2.6. Characterization of the OmyY1 BAC Contig: Illumina Sequencing

A minimum tiling path of BAC clones from the OmyY1 region was sequenced by Amplicon Express, Inc. Pullman, WA by Focused Genome Sequencing (FGS). FGS is a next-generation sequencing (NGS) method developed at Amplicon Express that allows very high quality assembly of BAC clone sequence data using the Illumina HiSeq (San Diego, CA). The proprietary FGS process makes NGS tagged libraries of BAC clones and generates a consensus sequence of the BAC clones.

Five OmyY1 BACs spanning the OmyY1 physical map contig (RT578J04, RT450D22, RT015B10, including BAC clones RE223F06, and RE143K08 containing the OmyY1 locus) were sequenced using Roche 454 GS FLX next-generation sequencing technologies, by the WSU Pullman bioinformatics genomics core lab. This sequence evaluation yielded 94325 total reads, assembling 188 contigs ranging in size from 50.4 kb to <100 bp.

2.7. Characterization of the OmyY1 BAC Contig: Comparative Genomic Hybridization

We used a 4 x 44 k Agilent salmonid microarray [25] to determine gene homologies within the OmyY1 contig through Comparative Genomic Hybridization (CGH). This microarray was designed using 60 bp oligos from the 3′ end of most known cDNAs from Rainbow Trout or Atlantic salmon. We pooled six clones from the contig (RT492A21, RT450D22, RT015B10, RT423A24, RT004L12, and RE143K08) and labeled with both Alexa Fluor 5 and and Alexa Fluor 3 using the BioPrime Total Genomic Labeling System (Invitrogen). Samples were hybridized to the salmonid microarray in duplicate versus unrelated BAC clones using the manufacturer’s protocol for CGH (Agilent). Using the R statistical software package, microarray data were floored at 2 standard deviations above the local background then normalized with a LOESS normalization method. Color ratios were fit to a Linear Model and then analyzed using empirical Bayes statistics [26]. Positive hits on the array features with significant values and higher fluorescence intensity for OmyY1 contig samples than the competing samples from unrelated BAC clones.

False positives were removed from the initial list of positive hits by conducting a stand-alone BLAST of the 60 bp oligo sequences from the array against a first draft rainbow trout genome (M. R. Miller, unpublished) and removing any repetitive elements. A second BLAST inquiry was conducted in GenBank, and additional repetitive elements were removed from the list. Additionally, vector (E. coli) genomic DNA was hybridized to the microarray, and overlapping hits with those from the OmyY1 contig positive hit list were removed. The final positive hit list represented possible candidates for functional genes closely linked to OmyY1. Genes of interest were confirmed by searching the Illumina assembly and/or by PCR using primers designed to the EST target in the OmyY1 BAC clones and rainbow trout genomic DNA, followed by Sanger sequencing on an ABI 3130xl.

Once possible candidates for functional genes were confirmed within the OmyY1 BAC contig, the corresponding BAC sequences were compared to EST sequences from the 4 x 44 k salmonid microarray annotation file, the cGRASP rainbow trout EST database (http://web.uvic.ca/grasp/), or cDNA sequences from GenBank. In addition, the list of possible functional genes was BLAST-analyzed against both the rainbow trout and Atlantic salmon [27] draft genome assemblies to identify potential paralogs and eliminate additional repetitive sequences. Candidate genes were proposed to be functional within the OmyY1 contig if a full-length hypothetical mRNA could be identified within the BAC contig sequence. Criteria for functionality included presence of start and stop codons, with no in-frame stop codons. Gene identity required corresponding BAC sequence to be 100% identical to candidate genes, excluding compressions found in 454 and Illumina sequencing of homopolymers. For microarray hits not found in the BAC contig assembly, corresponding rainbow trout-specific expressed sequences were compared to Sanger sequenced PCR products from OmyY1 contig BAC clones or to OmyY1-linked scaffolds from the draft rainbow trout genome assembly.

2.8. PCR Confirmation and Sanger Sequencing of BAC Clones Containing Loci Identified in BLAST Analysis of Illumina Sequencing Results and SNP Identification

GenBank BLASTn and BLASTx analyses were performed on contig sequences obtained from the Illumina/454 OMY BACs assembly of the 6 OmyY1 BAC clones spanning the minimum tiling path of the physical map contig #6256. Following identification of regions of homology within contigs to various teleost annotated gene loci, the resulting sequences were subsequently screened for suspected and known repetitive elements in salmonids using a salmonid-specific repeat masker (http://lucy.ceh.uvic.ca/repeatmasker/cbr_repeatmasker.py).

PCR primers were designed either to the OMY BAC contig sequence itself, or to the GB cDNA or EST sequence available (see Table 3). Additionally, subsequent to the Comparative Genomic Hybridization (CGH) results, ESTs identified as positive hits were also used for template in target design. ESTs identified by CGH were aligned with the rainbow trout draft genome sequence, and scaffolds with significant homology provided another source for primer design.

PCR amplified products were treated with 2 U of FastAP (Fermentas) and 1 U Exonuclease I (USB) and sequenced directly using 2.3 uls of the sequencing RR mix from the Big Dye Terminator v3.1 cycle sequencing kit (ABI), 2 μls of the provided 5x buffer, .4 μm primer, and .5–2 μL of the PCR reaction. Reactions were run for 29 cycles with 4 minute extension at 72C, then cleaned by CleanSEQ magnetic bead separation (Agencourt) and run on an ABI 3130xl sequencer. Sequence reads were aligned using DNASTAR Seqman II v5 sequence analysis software.

3. Results

3.1. The OmyY1 BAC Contig

Three BAC clones containing the OmyY1 sequence were confirmed following screening of the Swanson rainbow trout YY RE library. These clones overlapped with a contig from the Swanson RT library which originally contained 30 clones. Most recently a third generation BAC physical map has been prepared with BACs from the three Swanson BAC libraries: RE (EcoRI), RB (BamHI), and RT (HindIII) (Y. Palti, pers. com.). This updated map expands the previous OmyY1 BAC-containing contig, now contains 60 clones, and is designated as #6256. Although a PCR product was obtained following amplification with OmyY1 primers in most of the clones, we were able to sequence the OmyY1 product from only two of the RE BAC clones. The proposed sex determining gene, sdY [16], is found in the same two RE BAC clones and is contained within the 22 kb OmyY1 GenBank deposition. A diagram showing the clones from physical map contig #6256 analyzed in this project is shown in Figure 1.

261730.fig.001
Figure 1: Diagram of the OmyY1 BAC contig showing clones that we examined in this project. The dashed lines above the BAC clones indicate Scaffolds from Mike Miller’s rainbow trout genome assembly. Indicates that the clone was sequenced by Illumina. Indicates scaffolds that did not include BACs known to be in the OmyY1 contig (6256), but which apparently had BACs that are in this region because sequences of genes identified in the OmyY1 BACs matched the sequence of genes in these scaffolds exactly.

3.2. Localization of OmyY1 BAC Clones on Rainbow Trout Chromosomes

A dozen BAC clones from physical map contig #6256 were used as probes in FISH (fluorescence in situ) experiments for hybridization to rainbow trout chromosomes. All of these clones hybridized to the end of the short arm of the Y chromosome of rainbow trout with some signal observed on the end of the short arm of the X chromosome in many cells (Figures 2(a) and 2(b)). Hybridization was done on chromosomes prepared from OSU × Hot Creek hybrids. The Hot Creek strain has a Y chromosome which is morphologically distinct from the X, so the X and Y chromosomes can be easily distinguished, with the short arm of the Y being smaller than the short arm of the X chromosome [2]. Some of the BAC clones contained ribosomal DNA and those clones also hybridized to the NORs on chromosome 20 in rainbow trout (Figure 2(a)). BAC clones in the middle of the contig hybridized only to the Y chromosome in many cells (Figure 2(b)). These clones did not contain ribosomal DNA. The BAC clones containing OmyY1 and sdY did not hybridize to the X or Y chromosomes in Chinook Salmon (data not shown). As mentioned above, these clones contained 18S ribosomal DNA and hybridized only to known sites of rDNA on Chinook chromosomes.

fig2
Figure 2: (a) Hybridization of the BAC clone RE143K08 (which is found near the 3′ end of the OmyY1 contig and contains the OmyY1 marker, the sdY gene and ribosomal DNA) to male rainbow trout chromosomes. Note the signals on the sex chromosome pair and chromosome pair 20, the site of the 18S rDNA locus in rainbow trout. (b) Hybridization of the BAC clone RT429A21 to male rainbow trout chromosomes. This BAC is close to the 5′ end of the OmyY1 contig (6256) (see Figure 1). Note the larger signal on the Y chromosome compared to the X.
3.3. Characterization of the OmyY1 BAC Contig: Illumina and 454 Sequencing Results

Five BACs spanning the OmyY1 physical map contig #6256 (RT450D22, RT015B10, RT492A21, RT004L12, and RT423A24) were Illumina sequenced and assembled by Amplicon Express (http://www.ampliconexpress.com/). The BAC clones were sequenced by Focused Genome Sequencing (FGS). FGS is a next-generation sequencing (NGS) method developed at Amplicon Express that allows very high quality assembly of BAC clone sequence data using the Illumina HiSeq (San Diego, CA). The proprietary FGS process makes NGS tagged libraries of BAC clones and generates a consensus sequence of the BAC clones. Illumina sequencing yielded an average of 2,412,973 reads per BAC clone assembling into 56 contigs ranging in size from approximately 245 kb to 1 kb. Additionally, five OmyY1 BACs spanning the OmyY1 physical map contig (RT578J04, RT450D22, RT015B10, including BAC clones RE223F06, and RE143K08 containing the OmyY1 locus) were sequenced using Roche 454 GS FLX next-generation sequencing technologies, by the WSU Pullman bioinformatics genomics core lab.

Assembled DNA sequence contigs were evaluated directly for gene coding content using NCBI-BLASTx [28] and also redundantly masked for repetitive elements using the salmonid-specific repeat masker prior to NCBI-BLASTx evaluation, to identify candidate gene sequences. BLASTx coding content homologies provided access to GenBank gene sequence data depositions, which were then used in NCBI distributed stand-alone BLAST-2.2.25 application evaluations. The contigs containing the OmyY1 BAC clone illumine sequence data were put into the Standalone BLAST databases format and then BLASTn was conducted on this OmyY1 database with downloaded candidate gene sequences for comprehensive gene sequence content homology ([16, 29] and http://web.uvic.ca/grasp/). When assembled, sequence contigs were found to sequentially contain gene coding-region content, and these adjacent assemblies were evaluated for flanking DNA sequence homology to allow assembly of larger continuous sequence reads spanning the candidate gene sequence. Candidate gene sequences were characterized for intron-exon boundaries by sequence alignment with GenBank mRNA or predicted gene sequence depositions.

By this method the following 12 predicted protein coding gene sequences have been identified (including sdY), partially annotated and deposited to GenBank (Table 1). Five of these genes have confirmed transcripts in at least one of five different databases that were searched and these are reported in Table 2. Additionally, one predicted pseudogene was identified in the contig #6256 sequence assemblies (Table 1), determined by presence of premature stop codons relative to annotated predicted proteins from other species (data not shown). Further BLAST analyses within the cGRASP EST cluster rainbow trout databases yielded an additional 5 OmyY1 linked transcripts for unknown genes: omyk-BX860091, omyk-CB488087, Contig22207, omyk-DV199273, and Contig19333 (Tables 1 and 2). BLASTx in GenBank revealed that omyk-BX860091 shows poor homology to laminin subunit beta-1-like (best homology is 63% identity to Oreochromis niloticus) and Contig22207 has partial homology to Sec16A (all homology <55% identity for <25% of the transcript). No predicted protein sequences for these unknown genes could be analyzed.

tab1
Table 1: List of predicted protein coding genes, pseudogenes, and unknown transcripts found in rainbow trout physical map contig #6256 sequence assemblies. Pseudogenes were differentiated from protein coding genes based on the presence of premature stop codons in open reading frames. The BAC clone column denotes the clone in the contig that gave a PCR product for the gene in question.
tab2
Table 2: List of predicted nonrepetitive transcribed genes in rainbow trout physical map contig #6256 with corresponding EST’s in at least one database. EST designations in bold had the highest homology to OmyY1 linked sequence, and those sequences were used for homology and coverage analyses.
tab3
Table 3: Primer information for markers in the OmyY1 BAC contig (#6256).The parental lines are the doubled haploid male (AR/SW) and female (OSU/WR) rainbow trout clonal lines in which the products were amplified. Clonal lines listed on either side of “/” denote which clonal lines have the SNP or indel. The last column shows the base pair location of the SNP(s) or indel in the amplified product.
3.4. Characterization of the OmyY1 BAC Contig: Comparative Genomic Hybridization Results

A 4 x 44 k Agilent salmonid microarray [25] was used to identify gene homologies within the OmyY1 contig. The preliminary list of hits from the OmyY1 contig contained 124 expressed sequences. Following the removal of duplicates within the array and false positive hits by removing overlapping hits with unrelated BAC clones, we were left with 48 ESTs (see Supplemental File 1 of the Supplementary Material available online at http://dx.doi.org/10.1155/2013/261730). This list was compared to genes found in the Illumina and 454 assemblies to confirm that the CGH yielded true positives, with six genes found on both lists: cAMP-responsive element-binding protein 3-like protein 4, F-Box/WD repeat containing protein 2, Na/K/2Cl co-transporter, CREB-regulated transcription coactivator 2, DENN domain-containing protein 4B, and Zinc transporter ZIP1. No other genes confirmed in the OmyY1 contig assemblies were represented on the 4 x 44 k oligo array by exact name or alias.

Stand-alone BLAST analysis was conducted with the 48 array hits using a draft Oncorhynchus mykiss genome assembly as the database. This draft genome assembly included sequence from 7 BAC clones within the rainbow trout physical map contig #6256; 5 of which were chosen for Illumina and 454 sequencing in our study (Figure 1). Microarray sequences matching any corresponding assembled genome scaffolds were confirmed as OmyY1-linked. Of these 7, scaffolds MMSRT112A (containing sequence from RT450D22) and MMSRT121C (containing sequence from RT492A21) showed significant sequence homology and coverage to 5 microarray hits each, 3 of which fall in an overlapping region between both BAC RT450D22 and BAC RT492A21clones (supplemental file 1).

These microarray results identified additional sex-linked or paralogous scaffolds that contained a significant number of microarray hits. Stand-alone BLAST results showed MMSRT069D had significant homology to 4 hits (Supplemental File 1). To check whether this scaffold was sex-linked or paralogous, primers were designed to the microarray EST sequences then partial gene sequences were amplified from the OmyY1 BAC clones and sequenced. Sequences from MMSRT069D showed 90% homology to OmyY1 BAC sequence indicating the scaffold was most likely a paralogous scaffold, amplifying in BAC clone RT15B10. There were also five rDNA and/or repetitive ESTs that contained homology to similar sets of draft genome scaffolds, including MMSRT092E, 100C, 059D, 036A, 103G, 136F, and 084G (Supplemental File 1). The 5 EST sequences had similar rates of homology with both the draft genome and OmyY1 contig assemblies, but none show with 100% homology with any assembly. Due to the difficulty in amplifying clean rDNA and repetitive sequences in the BACs, and because a great number of scaffolds showed identical homology to the ESTs, we could not determine whether these scaffolds were paralogous or identical to the OmyY1 region. It is worth noting, however, that MMSRT100C also contains IRF9 (interferon regulatory factor 9) which is the gene from which sdY was originally derived [16]. This suggests that even though these rDNA and repetitive sequences are found in many locations throughout the genome, there are specific sex-linked versions surrounding OmyY1 and sdY.

The 48 CGH hits queried in stand-alone BLAST evaluations of the OmyY1 contig sequence and draft genome assemblies were checked against GenBank (using BLASTn and BLASTx) for homology with known salmon and trout repetitive elements and checked against the rainbow trout draft genome. This search showed 17 hits represented transposable elements, 5 contained ribosomal DNA, and 12 more were repetitive, leaving 14 ESTs showing significant nonrepetitive sequence homology to the OmyY1 region (Supplemental file 1). Although repetitive hits were not included in Tables 1 and 2, many of these repetitive ESTs from the microarray seem to be expressed within the OmyY1 contig and may still be functionally and evolutionarily important (details of both BLAST searches are included in Supplemental File 1). It is also important to note that for some ESTs, repetitive elements only made up a small portion of the entire transcript.

Some ESTs within the 4 x 44 k annotation file were derived from Atlantic Salmon (Salmo salar), and for those ESTs with less than 99-100% homology to either the O. mykiss draft genome assembly or the OmyY1 contig Illumina assembly, rainbow trout-specific EST databases were searched to find expressed orthologs or paralogs. These databases include the cGRASP O. mykiss EST cluster databases, GenBank, Salem’s 454 database [29], and the O. mykiss sex-specific gonad EST databases at 35dpf from Yano et al. [16]. Found orthologs and paralogs were evaluated for strong homology and comprehensive gene coverage within OmyY1 contig sequence and for the lack of premature stop codons when predicted protein sequences were available to determine functionality of genes.

The final list of predicted functional, nonrepetitive genes within the OmyY1 contig contained 11 hits from the 4 x 44 k microarray (Table 2). Six of these ESTs (cr3l4, FBXW2, nkcc1a, TORC2-like, and ZIP1 (with 2 predicted splice variants)) were found in the Illumina assembly for the OmyY1 contig.

3.5. Analysis of SNPs from the OmyY1 BACs

For further confirmation of these loci within the BAC clones, Sanger sequencing was performed. PCR primers were designed either to the OMY BAC Illumina/454 contig sequence, GB cDNA, or EST sequence available, or to the O. mykiss draft genome homologous scaffolds (see Figure 1). The F-box and WD repeat domain containing 2 (Fbxw2) gene comprises a 7.6 kb region within BAC RT450D22. It was evaluated for sequence polymorphisms in the DH RT clonal lines AR, OSU, SW, and WR, [30, 31] yielding 10 SNPs; 4 of which were population specific and 6 of which appeared sex specific. All SNPs are referenced against the Fbx numbering system of the GenBank deposition and locations are tabulated (see Table 3).

Gene-specific amplification attempts of the Oncorhynchus mykiss oocyte protease inhibitor-1 gene, BAC RT492A1 (GenBank KC686349), failed to amplify a single locus, except within the BAC itself. Sequencing of PCR products of genomic DNA from the OSU female DH line yielded the cleanest reads, with primers F1/R1 showing discernible indels and SNPs and an overall homology of 94% as compared to the Y sequence, but it could not be determined whether this was a related sequence on the X chromosome or an autosomal homolog.

Sex-linkage and SNP evaluations of the cAMP-responsive element-binding protein 3-like protein 4 (cr3l4) gene, BAC RT15B10 (GenBank KC686342) of DH RT genomic DNAs preferentially amplified paralogous loci; no primer combinations yielded sex-linked polymorphisms for this gene.

Also within BAC RT15B10, the Zinc transporter ZIP1 (zip1) gene was evaluated by PCR and amplified product sequencing of RT clonal lines for sex-linked polymorphisms, which are referenced against the ZIP1 GenBank KC686350 deposition. The CREB-regulated transcription coactivator 2 (TORC2-like) gene was PCR evaluated for sex-linked polymorphisms in the RT clonal line panel, identifying 4 SNPs between the clonal lines, 1 sex-linked SNP, and a 17 bp deletion in SW only.

Population and RT clone-specific polymorphisms were found in the DENN domain-containing protein 4B-like gene amplified in BAC RT15B10. All polymorphisms are referenced against the GenBank KC686345 sequence. RT15B10 also contains an interleukin enhancer-binding factor 2 pseudogene (GenBank KC686347); however the 2 primer sets used to confirm sequence within RT15B10 amplified more than one locus in all the DH genomic DNA analyzed.

In addition to these EST templates and annotated loci, a number of primers were designed to other unannotated regions within the Illumina/454 assembly contigs as well as to the homologous MMSRT draft genome scaffolds. Of these, there were 2 primer sets that yielded sex-specific SNPs and these are included in Table 3.

4. Discussion

In this study we describe sequencing and CGH results from BAC clones forming a contig with the OmyY1 marker isolated by Brunelli and Wertzler [11] and the sdY sex determining gene isolated by Yano et al. [16]. Primers for the OmyY1 marker and the sdY gene amplified products from two of the BACs on the 3′ end of the contig. These BACs hybridized to the end of the short arm of the Y chromosome with some signal also on the end of the short arm of the X chromosome. Illumina sequencing of the BACs, because of the large number of repetitive sequences present, produced many small contigs. A number of putative genes were identified, and all genes PCR amplified appear to be present in both males and females, with only a small portion showing sex-specific sequence differences. Large contiguous sequences with over 95% homology to each other were found in nonoverlapping BACs suggesting that duplications of a number of regions, some of which containing putative genes, are present in contig #6256. Taken together these results suggest that the X and Y are in an early stage of differentiation, perhaps because infrequent episodes of crossing over have partially homogenized the content of the X and Y.

Because the rainbow trout X and Y chromosomes are usually morphologically distinct, it has been assumed that they are in a later stage of differentiation compared to species such as medaka where the X and Y are not distinguishable. In the case of medaka, the only difference between the X and Y is the insertion of the sex-determining gene DMY into the Y chromosome by transposition [32]. DMY was found in only one BAC clone. We analyzed a contig composed of a minimum tiling path of six BACs from the Swanson YY library. Sex-specific SNPs were identified in genes from BACs in the 5′ and middle of the contig, but few genes were found on the sdY on the 3′ end other than rDNA and transposable elements. Because the ends of contig #6256 contain very repetitive sequences, we were not able to extend the contig in either direction. It is possible that Y-specific genes are found in regions outside of the region that was sequenced. Future work could involve isolation of BACs and sequencing of these adjacent regions in both directions.

The expression of all of the genes in the OmyY1 region could be examined to determine if any are expressed preferentially in males during sex determination or sexual differentiation. Isolation of the corresponding region from the OSU BAC library prepared from an XX female clonal line and comparison of the content and expression of genes on the X and the Y in the region adjacent to the proposed sex determination gene, sdY would provide information about the degree of divergence between the X and Y chromosomes.

In the Swanson YY clonal line, the Y chromosome is not morphologically distinct from the X, with rDNA on the short arm adjacent to the centromere. This undifferentiated Y chromosome is possibly the result of crossing over between the X and Y in the population that gave rise to this clonal line. Future work on the male-specific region of the Y chromosome in YY clonal lines with the morphologically differentiated Y chromosome should reveal whether greater variation is present in the coding regions of the Y chromosome in such rainbow trout strains.

Finally, we would like to compare the male-specific content of the Y chromosome in the other Oncorhynchus species and determine the size of the region that is transposing to form new Y chromosomes in this genus. A BAC clone has been isolated that contains the sdY gene from a male Chinook salmon (RH Devlin, Fisheries and Oceans, pers. com.) and it will be interesting to compare the sequences of the adjacent regions in the two species. We are currently performing CGH experiments to compare the genes present in YY and XX genomic DNA from rainbow trout, coho salmon, and sockeye salmon which should also provide information on differences in gene content between the X and Y chromosomes of these species.

5. Conclusions

We analyzed sequence data from a contig of rainbow trout BAC clones from the Swanson YY clonal line that includes the male-specific OmyY1 genetic marker and the sex-determining gene, sdY. Our data shows that this region is rich with rDNA and repetitive elements, but also has many predicted protein coding genes. Most genes in the region seem to be present on both the X and Y chromosomes, although sex-specific SNPs are found in a small portion of them, suggesting that the X and Y are at an early stage of differentiation. None of these genes were found in the cosmid or BAC clones from the Y chromosome of Chinook salmon (Phillips and Devlin, unpublished) and the two BAC clones containing OmyY1 and sdY did not hybridize to the Y chromosome of Chinook salmon, suggesting that the shared region containing the sex-determining gene between these two species is very small.

Acknowledgments

This project was supported by Agriculture and Food Research Initiative Competitive Grant no. 2009-35205-05067 from the USDA National Institute of Food and Agriculture. The following individuals from Amplicon express worked on the sequencing: Evan Hart, Amy Mraz, Keith Stormo, Jason Dobry, and Travis Ruff. The authors would also like to thank Michael R. Miller for providing the rainbow trout draft genome assembly which greatly improved our analysis. The use of trade, firm, or corporation names in this publication is for the information and convenience of the reader. Such use does not constitute an official endorsement or approval by the U.S. Department of Interior or the U.S. Geological Survey of any product or service to the exclusion of others that may be suitable.

References

  1. R. H. Devlin and Y. Nagahama, “Sex determination and sex differentiation in fish: an overview of genetic, physiological, and environmental influences,” Aquaculture, vol. 208, no. 3-4, pp. 191–364, 2002. View at Publisher · View at Google Scholar · View at Scopus
  2. G. H. Thorgaard, “Heteromorphic sex chromosomes in male rainbow trout,” Science, vol. 196, no. 4292, pp. 900–902, 1977. View at Scopus
  3. R. B. Phillips, M. A. Noakes, M. Morasch, A. Felip, and G. H. Thorgaard, “Does differential selection on the 5S rDNA explain why the rainbow trout sex chromosome heteromorphism is NOT linked to the SEX locus?” Cytogenetics and Genome Research, vol. 105, pp. 122–125, 2004.
  4. A. Felip, A. Fujiwara, W. P. Young et al., “Polymorphism and differentiation of rainbow trout Y chromosomes,” Genome, vol. 47, no. 6, pp. 1105–1113, 2004. View at Publisher · View at Google Scholar · View at Scopus
  5. J. E. Parsons and G. H. Thorgaard, “Production of androgenetic diploid rainbow trout,” Journal of Heredity, vol. 76, no. 3, pp. 177–181, 1985. View at Scopus
  6. R. H. Devlin, C. A. Biagi, and D. E. Smailus, “Genetic mapping of Y-chromosomal DNA markers in Pacific salmon,” Genetica, vol. 111, no. 1–3, pp. 43–58, 2001. View at Publisher · View at Google Scholar · View at Scopus
  7. J. Stein, R. B. Phillips, and R. H. Devlin, “Identification of the Y chromosome in chinook salmon (Oncorhynchus tshawytscha),” Cytogenetics and Cell Genetics, vol. 92, no. 1-2, pp. 108–110, 2001. View at Scopus
  8. R. B. Phillips, M. R. Morasch, L. K. Park, K. A. Naish, and R. H. Devlin, “Identification of the sex chromosome pair in coho salmon (Oncorhynchus kisutch): lack of conservation of the sex linkage group with chinook salmon (Oncorhynchus tshawytscha),” Cytogenetic and Genome Research, vol. 111, no. 2, pp. 166–170, 2005. View at Publisher · View at Google Scholar · View at Scopus
  9. R. B. Phillips, J. DeKoning, M. R. Morasch, L. K. Park, and R. H. Devlin, “Identification of the sex chromosome pair in chum salmon (Oncorhynchus keta) and pink salmon (Oncorhynchus gorbuscha),” Cytogenetic and Genome Research, vol. 116, no. 4, pp. 298–304, 2007. View at Publisher · View at Google Scholar · View at Scopus
  10. M. A. Alfaqih, R. B. Phillips, P. A. Wheeler, and G. H. Thorgaard, “The cutthroat trout Y chromosome is conserved with that of rainbow trout,” Cytogenetic and Genome Research, vol. 121, no. 3-4, pp. 255–259, 2008. View at Publisher · View at Google Scholar · View at Scopus
  11. J. P. Brunelli, K. J. Wertzler, K. Sundin, and G. H. Thorgaard, “Y-specific sequences and polymorphisms in rainbow trout and Chinook salmon,” Genome, vol. 51, no. 9, pp. 739–748, 2008. View at Publisher · View at Google Scholar · View at Scopus
  12. J. P. Brunelli and G. H. Thorgaard, “A new Y-chromosome-specific marker for Pacific salmon,” Transactions of the American Fisheries Society, vol. 133, no. 5, pp. 1247–1253, 2004. View at Publisher · View at Google Scholar · View at Scopus
  13. Shao Jun Du, R. H. Devlin, and C. L. Hew, “Genomic structure of growth hormone genes in chinook salmon (Oncorhynchus tshawytscha): presence of two functional genes, GH-I and GH-II, and a male- specific pseudogene, GH-Y,” DNA and Cell Biology, vol. 12, no. 8, pp. 739–751, 1993. View at Scopus
  14. R. H. Devlin, B. K. McNeil, T. D. D. Groves, and E. M. Donaldson, “Isolation of a Y-chromosomal DNA probe capable of determining genetic sex in Chinook salmon (Oncorhynchus tshawytscha),” Canadian Journal of Fisheries and Aquatic Sciences, vol. 48, pp. 1606–1612, 1991.
  15. R. A. Woram, K. Gharbi, T. Sakamoto et al., “Comparative genome analysis of the primary sex-determining locus in Salmonid fishes,” Genome Research, vol. 13, no. 2, pp. 272–280, 2003. View at Publisher · View at Google Scholar · View at Scopus
  16. A. Yano, R. Guyomard, B. Nichol et al., “An immune-related gene evolved into the master sex-determining gene in rainbow trout, Oncorhynchus mykiss,” Current Biology, vol. 22, pp. 1–6, 2012.
  17. W. P. Young, P. A. Wheeler, V. H. Coryell, P. Keim, and G. H. Thorgaard, “A detailed linkage map of rainbow trout produced using doubled haploids,” Genetics, vol. 148, no. 2, pp. 839–850, 1998. View at Scopus
  18. K. M. Nichols, W. P. Young, R. G. Danzmann et al., “A consolidated linkage map for rainbow trout (Oncorhynchus mykiss),” Animal Genetics, vol. 34, no. 2, pp. 102–115, 2003. View at Publisher · View at Google Scholar · View at Scopus
  19. Y. Palti, S. A. Gahr, J. D. Hansen, and C. E. Rexroad, “Characterization of a new BAC library for rainbow trout: evidence for multi-locus duplication,” Animal Genetics, vol. 35, no. 2, pp. 130–133, 2004. View at Publisher · View at Google Scholar · View at Scopus
  20. K. M. Nichols, J. Bartholomew, and G. H. Thorgaard, “Mapping multiple genetic loci associated with Ceratomyxa shasta resistance in Oncorhynchus mykiss,” Diseases of Aquatic Organisms, vol. 56, no. 2, pp. 145–154, 2003. View at Scopus
  21. A. M. Zimmerman, J. P. Evenhuis, G. H. Thorgaard, and S. S. Ristow, “A single major chromosomal region controls natural killer cell-like activity in rainbow trout,” Immunogenetics, vol. 55, no. 12, pp. 825–835, 2004. View at Publisher · View at Google Scholar · View at Scopus
  22. R. B. Phillips, J. J. Dekoning, A. B. Ventura et al., “Recombination is suppressed over a large region of the rainbow trout y chromosome,” Animal Genetics, vol. 40, no. 6, pp. 925–932, 2009. View at Publisher · View at Google Scholar · View at Scopus
  23. Y. Palti, M. C. Luo, Y. Hu et al., “A first generation BAC-based physical map of the rainbow trout genome,” BMC Genomics, vol. 10, article 462, 2009. View at Publisher · View at Google Scholar · View at Scopus
  24. R. B. Phillips, K. M. Nichols, J. J. DeKoning et al., “Assignment of rainbow trout linkage groups to specific chromosomes,” Genetics, vol. 174, no. 3, pp. 1661–1670, 2006. View at Publisher · View at Google Scholar · View at Scopus
  25. B. F. Koop, K. R. Von Schalburg, J. Leong et al., “A salmonid EST genomic study: genes, duplications, phylogeny and microarrays,” BMC Genomics, vol. 9, article 545, 2008. View at Publisher · View at Google Scholar · View at Scopus
  26. G. K. Smyth, “Linear models and empirical Bayes methods for assessing differential expression in microarray experiments,” Statistical Applications in Genetics and Molecular Biology, vol. 3, pp. 1–26, 2004.
  27. W. S. Davidson, B. F. Koop, S. J. Jones et al., “Sequencing the genome of Atlantic salmon (Salmo salar),” Genome Biology, vol. 11, no. 9, article 403, 2010.
  28. S. F. Altschul, T. L. Madden, A. A. Schäffer et al., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs,” Nucleic Acids Research, vol. 25, no. 17, pp. 3389–3402, 1997. View at Publisher · View at Google Scholar · View at Scopus
  29. M. Salem, C. E. Rexroad III, J. Wang, G. H. Thorgaard, and J. Yao, “Characterization of the rainbow trout transcriptome using Sanger and 454-pyrosequencing approaches,” BMC Genomics, vol. 11, no. 1, article 564, 2010. View at Publisher · View at Google Scholar · View at Scopus
  30. W. P. Young, P. A. Wheeler, V. H. Coryell, P. Keim, and G. H. Thorgaard, “A detailed linkage map of rainbow trout produced using doubled haploids,” Genetics, vol. 148, no. 2, pp. 839–850, 1998. View at Scopus
  31. M. R. Miller, J. P. Brunelli, P. A. Wheeler, et al., “A conserved haplotype controls parallel adaptation in geographically distant salmonid populations,” Molecular Ecology, vol. 21, pp. 237–249, 2012.
  32. M. Kondo, U. Hornung, I. Nanda et al., “Genomic organization of the sex-determining and adjacent regions of the sex chromosomes of medaka,” Genome Research, vol. 16, no. 7, pp. 815–826, 2006. View at Publisher · View at Google Scholar · View at Scopus