How Can Genomic Tools Contribute to the Conservation of Endangered OrganismsView this Special Issue
A Quantitative Genomic Approach for Analysis of Fitness and Stress Related Traits in a Drosophila melanogaster Model Population
The ability of natural populations to withstand environmental stresses relies partly on their adaptive ability. In this study, we used a subset of the Drosophila Genetic Reference Panel, a population of inbred, genome-sequenced lines derived from a natural population of Drosophila melanogaster, to investigate whether this population harbors genetic variation for a set of stress resistance and life history traits. Using a genomic approach, we found substantial genetic variation for metabolic rate, heat stress resistance, expression of a major heat shock protein, and egg-to-adult viability investigated at a benign and a higher stressful temperature. This suggests that these traits will be able to evolve. In addition, we outline an approach to conduct pathway associations based on genomic linear models, which has potential to identify adaptive genes and pathways, and therefore can be a valuable tool in conservation genomics.
Understanding the genetic architecture of quantitative complex traits is a central topic in modern biology, with applications ranging within evolutionary genetics, animal and plant breeding, conservation biology, and human health. Linkage analyses and candidate gene studies have been used to gather information about the genetic basis of many complex phenotypes in a range of organisms, but usually with limited power to identify the causal loci. Linkage analyses rely on the joint inheritance of a small number of markers within families with known kinship. Candidate gene studies rely on a set of preselected genes; thus, many causal genes are likely to be missed because of incomplete genetic knowledge of the trait [1–3]. With the availability of full genome sequence data, genetic polymorphisms among individuals, or within populations, can be investigated, and genome-wide association studies (GWAS) have become increasingly popular as a tool for finding causal loci .
The principle of GWAS is to associate the phenotypic variation with genetic polymorphism (single-nucleotide polymorphisms (SNPs) and/or other polymorphic molecular variants) with the assumption that SNPs are in linkage disequilibrium (LD) with nearby causal variants or that the SNPs themselves are causal. For an increasing number of species, the genome has been sequenced, and combined with advances in bioinformatics, GWAS can be performed on model- as well as nonmodel organisms [4, 5]. However, studies using model organisms, including Drosophila melanogaster, continue to play an important role in gaining increased knowledge about the genetic architecture of complex traits. Studies on this species have contributed to elucidating the genetic architecture of complex traits, including traits associated with stress resistance [6, 7]. The D. melanogaster Genetic Reference Panel (DGRP)  has often been used in such studies. DGRP is a genetic tool which allows researchers to assay a large number of replicated individuals with the same genotype from hundreds of independent, inbred, genome-sequenced lines.
The outcome of GWAS is typically a list of the most significant SNPs. This ignores much of the remaining variance due to the conservative statistical nature of such analyses. This is unfortunate because genetic variants with small effects are likely to be missed, and even variants with large effect may not be among the top hits [8, 9]. Although valuable, the potential gain from GWAS is therefore currently not utilized sufficiently, and alternative analytical approaches are warranted to further exploit the potential use of full genome sequence data. Approaches which utilize prior biological knowledge to group SNPs (hereafter referred to as SNP-set) have been proposed to alleviate the issue of type-I error rates in analysis of large genome data sets [8–11].
Increasing evidence from studies relying on full genome approaches indicates that most traits have very complex genetic architectures and that they are influenced by often hundreds of interacting genes, each with a small effect on the phenotype and by genetic and environmental interactions [6–8, 12–15]. Accordingly, associated variants are nonrandomly distributed across the genome and are enriched within genes that interact through pathways and biological networks . Such associated variants would be captured easier if based on a SNP-set based association approach where SNPs are grouped according to their physical proximity to a gene within a pathway. It can be argued that such an approach will increase the probability of finding true associations. In addition, by reducing the number of independent tests performed from the total number of SNPs (often millions) to the number of pathways (typically thousands), less restrictive statistical corrections for type-1 errors are needed. Analyzing the combined effect of many SNPs with small effect sizes might therefore increase the probability of finding causal variants .
In the present study we use a subset of the DGRP (21–27 lines depending on the trait assessed) to investigate five traits related to life history and environmental stress resistance in D. melanogaster. We use a SNP-set approach in which we aggregate SNPs based on knowledge of biological processes (here gene ontologies (GOs)) and associate the phenotypic variation with genomic variation. We investigate whether these traits harbor genetic variation for five fitness-related phenotypes: egg-to-adult viability under benign and stressful temperatures, heat stress resistance, expression of a major heat shock protein (Hsp70), and metabolic rate. The capability of a genotype to produce an egg that successfully develops into an adult fly and the ability to withstand increasing stressful temperatures are key fitness traits of major importance for the abundance and distribution of insect species . Metabolic rate is known to influence functional traits such as longevity [18–20] and resistance to desiccation and starvation [21, 22]. Knowledge of the genetic variation and identification of possible pathways involved in explaining phenotypic variation in these traits is therefore of major importance for understanding species distribution and local adaptation and genomic tools are predicted to have important applications in conservation genetics in the future .
2. Material and Methods
2.1. Drosophila Stocks
A subset of the Drosophila Genetic Reference Panel (DGRP)  was obtained from Bloomington Drosophila Stock Center (NIH P40OD018537) (see Table S1 of the Supplementary Material available online at http://dx.doi.org/10.1155/2016/2157494). The flies were maintained on a standard oatmeal-sugar-yeast-agar Drosophila medium at 25°C and a 12-hour light/dark cycle.
2.2. Phenotypic Assays
2.2.1. Egg-to-Adult Viability
The proportion of eggs which successfully develop to adulthood was assessed at two developmental temperatures: a benign temperature (25°C) and a lightly stressful temperature (28°C). For each line and temperature 200 eggs were collected. At benign temperature, forty eggs were placed in five vials per line, and at the high temperature twenty eggs were placed in ten vials per line. Emerging flies were counted daily until all flies had emerged. To approximate Gaussian distribution data arcsine2 was transformed. At benign and stressful temperatures 27 and 26 lines were assayed, respectively.
2.2.2. Heat Stress Resistance
Approximately 24 females from 27 lines were placed into individual glass tubes (24 × 15 mm) with lids. Tubes were randomized and placed on a rack that was submerged into a water bath heated to 37°C. Flies were constantly monitored after submersion, and heat stress resistance was measured as the time from placement in the water bath until the animal became comatose (i.e., not moving any body parts).
2.2.3. Metabolic Rate
From the rate of CO2-production (), metabolic rate was estimated using repeated stop-flow respirometry, as described in Jensen et al. . Measurements were conducted in 16 parallel metabolic chambers (glass cylinder, 20 × 70 mm) over a period of 24 hours. Measurements were performed on groups of individuals with approximately 18 five-day-old females per line per metabolic chamber. On average, nine replicates per line were obtained. Each day measurements were obtained for 13 lines (and three empty controls chambers). To avoid dehydration flies had access to a solution of 4% sugar and 2% agar (0.3 mL placed on a 15 × 15 mm paper). The estimate of standard metabolic rate was obtained using the average of the three lowest measurements of over the 24-hour period at °C (see Jensen et al.  for details and discussion).
The stop-flow respirometry system enabled analysis of the cumulative CO2 production for a given period as the metabolic chambers were sequentially open (3 minutes for measurement) and closed (45 minutes while CO2 accumulates). The system was controlled by two parallel 8-channel multiplexers (RM Gas Flow Multiplexer, Sable Systems, Las Vegas, Nevada, USA). Opening allowed airflow of CO2-striped dry air (soda lime column removes CO2, MERCK Millipore, Darmstadt, Germany) to flush the chambers at a fixed rate of 200 mL min−1 controlled by a mass flow controller (MFC 2-channel v. 1.0, Sable Systems, Las Vegas, Nevada, USA) connected to a flow controller (Side-Trak®, Sierra Instruments, Monterey, California, USA). Air leaving the metabolic chambers passed through a calcium chloride column (AppliChem, Darmstadt, Germany) before entering the CO2 analyzer (Li-6251 CO2 Analyzer, LI-COR Environmental, Lincoln, Nebraska, USA) to remove water. The temperature inside one metabolic chamber was registered by a data logger (iButton® Data Loggers, Maxim, Sunnyvale, California, USA) and data were extracted by OneWireViewer (Maxim, Sunnyvale, California, USA). All flies were stored in a −80°C freezer after measurements and the dry weight (Sartorius Microbalance, type MC5, accuracy ± 1 μg) was obtained after drying for 24 hours at 60°C such that could be expressed as μL CO2 produced per hour per mg dry weight.
2.2.4. Expression of Heat Shock Protein 70 Following Heat Stress
From each of 21 DGRP lines 10 five-day-old (±24 h) adult females were transferred to plastic tubes with screw cap and exposed to 35°C for 1 hour (a temperature known to induce a heat shock response; see ). Flies were subsequently allowed to recover at 25°C for 1 hour before being frozen at −80°C. Hsp70 expression was quantified in three replicates of approximately 10 flies for each line by ELISA, using the monoclonal antibody 7.FB which specifically binds Hsp70 in D. melanogaster , using the procedure described in Sørensen et al. . Prior to analysis the data were corrected for plate effect by equalizing the mean (the same mean on the three plates).
2.3. Quantitative Genomic Analyses
2.3.1. Genomic Data
SNP data, major inversions, and Wolbachia status were obtained from http://dgrp2.gnets.ncsu.edu/ [6, 28]. The complete set of DGRP lines harbors 1,496,037 polymorphic markers with a minor allele frequency > 0.05. Because we used a subset of the DGRP lines monomorphic markers were observed. These were removed prior to analyses. The number of lines assayed varied among traits; thus the number of SNPs analyzed differed as well (metabolic rate: 1,179,43 SNPs, heat stress resistance: 1,231,310 SNPs, Hsp70 expression: 1,115,889 SNPs, egg-to-adult viability at benign condition; 1,231,310 SNPs, and egg-to-adult viability at light stressful condition: 1,216,721 SNPs).
2.3.2. Quantitative Genetic Parameters
Variance components and genomic effects were estimated using the REML algorithm implemented in the Regress package  for R  by fitting the linear mixed model , where was a vector of phenotypic values, was a vector of fixed effects (i.e., Wolbachia status, five major polymorphic inversions (, , , , and ), and experimental block effects), was a vector of random genomic effects, and was a vector of the residuals. and were design matrices linking fixed and genomic effects to the observations. The genomic and residual effects were defined as and . The additive genomic relationship matrix, , was computed based on all genomic markers as , where was the total number of markers and was a centered and scaled genotype matrix (i.e., and ). Each column vector of was , where was the minor allele frequency of the th marker and was the th column vector of the allele count matrix, , containing the genotypes encoded as 0, 1, or 2, counting the number of minor alleles .
The genomic variance captured by common SNPs was computed as , and genomic (and raw phenotypic) correlations among the traits were computed as Spearman’s rank correlation. The 95% confidence interval (CI) for was obtained using a bootstrap procedure; observations were sampled 10,000 times with replacement obtaining the same number of observations as the true data. In each round was estimated, and the 95% CI was then obtained as the 2.5% and 97.5% quantiles of the bootstrap samples.
2.3.3. Pathway Association
We used a pathway based approach to identify sets of SNPs with association with the traits. Firstly, SNPs were annotated to genes within a 5 KB region using variant annotation from FlyBase (v.FB5.46) . Secondly, based on the gene annotation, SNPs were grouped into gene ontologies (GOs) using the annotation packages GO.db  and org.Dm.eg.db  from Bioconductor . Three types of GO classes were obtained: biological processes (BP), molecular function (MF), and cellular components (CC). Sets of SNPs were only included if they contained more than 10 genes, and the number of markers was >199. The numbers of markers included in the pathway association for BP, MF, and CC were 616.010, 562.938, and 549.372 SNPs, respectively.
The pathway based approach tests whether a particular SNP-set has a more extreme signal of association than a random group of SNPs. A summary statistic (), measuring the degree of association of one set of SNPs, was computed as the sum of the marker effects () within the given pathway; thus . The marker effects () were computed from the predicted genetic effect as .
Using a permutation approach the observed summary statistic for a SNP-set was compared to an empirical distribution of summary statistics for a random set of SNPs of the same size. As a consequence of LD, nearby SNPs will likely be correlated; this will affect the distribution of the summary statistics. To account for this correlation structure we used a procedure where we let a vector of observed marker effects be ordered according to the SNPs physical position in the genome, which then were linked to GOs. The elements in this vector were numbered . The permutation consisted of two steps. (1) Randomly pick an element () from this vector. Let this th marker effect () be the first element in the permuted vector and the remaining elements ordered according to their original numbering. Thus, all elements from the original vector were shifted to a new position and starting with and ending with . The mapping of GOs was kept fixed according to original mapping. (2) A summary statistic was computed for each SNP-set based on the original mapping of GOs. Hereby, the link between SNPs and pathway was broken while retaining the correlation structure among marker effects. Steps (1) and (2) were repeated 10,000 times and from this empirical distribution of summary test statistics a value was obtained. The empirical value corresponds to a one-tailed test of the proportion of randomly sampled summary statistics that were larger than the observed summary statistics. We assigned individual pathways as significant if .
2.3.4. Overlapping Pathways
A consequence of the small number of lines investigated was limited statistical power to detect causal variants. Therefore, we investigated whether we observed shared patterns across traits in the rankings of the pathways. For each class of pathways (i.e., BP, MF, and CC) an incidence matrix with rows corresponding to the number of SNP-sets and columns corresponding to the number of traits () was constructed. If the summary statistic for a SNP-set was below the threshold level (here ) the corresponding element in the incidence matrix was set to 1, otherwise to 0. The observed overlap was then compared to an empirical distribution of the overlap. For a total of 10,000 times the elements within each column were permutated and the overlap among columns was recorded. The probability of the overlap was estimated under the null hypothesis of independent association among traits. We determined the empirical value of a one-tailed test as the fraction of all random permutations that was larger than or equal to the observed overlap among traits at the 5% level.
2.3.5. Partitioning of Genetic Variance within Pathways
To dissect the genetic contribution of the associated pathways, the genetic variation within pathway was decomposed to gene level. Pathway-specific genetic effects of the genes constituting the pathways () were computed as , where was the genomic effect of the th marker computed and was the number of markers within the gene . Thus, if a GO has the genetic effect and consists of genes, then . A measure of the genetic variation for each feature per gene adjusted for the number of SNPs within gene () was computed as .
3. Results and Discussion
In the present study we used a subset of the DGRP to investigate whether genetic variation existed for five fitness-related phenotypes, namely, the proportion of eggs that develop to adulthood at two environmental conditions, a benign and a mildly stressful temperature, resistance to acute heat stress, induction of a major heat shock protein (Hsp70), and metabolic rate estimated from CO2 emission rate.
We found substantial phenotypic variation for all five traits (Figure 1, Table S2). In addition, genotype-by-environmental interaction (GxE) was observed for egg-to-adult viability, as the DGRP lines were not affected equally by the higher, stressful temperature (Figure 1(a)). The phenotypic variation was decomposed into a genomic () and a residual effect () from which the proportion of phenotypic variation captured by common SNPs was computed, that is, . Metabolic rate, heat stress resistance, and Hsp70 expression all showed intermediate heritability estimates, whereas the two egg-to-adult viability traits both had high estimates (Table 1). Despite the low number of DGRP lines assayed the bootstrap CI supported the magnitude of the heritability estimates, except for Hsp70 expression (Table 1). The very broad CI for Hsp70 expression was probably a consequence of the few number of lines assayed for this trait (Table S1).
Correlation of the raw phenotypic values showed a significant correlation between the two egg-to-adult viability traits (Table 1). By correlating individual genomic effects significant negative correlations were found between expression of Hsp70 and the two egg-to-adult viability traits (Table 1), and a high positive correlation between the viability traits at the two temperature conditions was found (Table 1). The discrepancy between the correlations based on the raw phenotypic values and the genomic effects could be due to accounting for random and fixed effects by the mixed model. However, no strong signals were found when testing the individual fixed effects (Table S3). Infection with Wolbachia and the five major inversions are however known to affect trait phenotypes ; thus, these were kept in the model. Also, accounting for the fixed effects will take up additional degrees of freedom. However, the pathway association is not dependent on degrees of freedom; thus, this does not influence the power of the pathway test.
Based on the trait-specific individual genomic effects the trait-specific genomic marker effects were computed. Using the pathway association approach we tried, despite the rather low number of lines being assessed, to investigate whether some pathways had a more extreme signal of association than others. Pathways were divided into three categories: biological processes (BP, 689 GOs), molecular functions (MF, 239 GOs), and cellular components (CC, 161 GOs). With an arbitrary significance threshold of the expected numbers of false-positives were three GOs in BP, one GO in MF, and below one GO in CC. For metabolic rate nine BP, two MF and four CC had a (Table S4), which were more than expected by chance. Three MF and one BP were associated with heat stress resistance, and three of each class were associated with Hsp70 expression (Table S4). Two or fewer GO within each class were associated with the egg-to-adult viability traits (Table S4). With exception of metabolic rate, the number of GOs below the threshold value was near the expected number of false-positives. Because of the apparent limited statistical power, we investigated whether we could identify general patterns of “association” across traits. Using a less conservative threshold level we computed overlaps of GOs with a across traits (Tables 2, 3, 4, and S5). We found statistical evidence for overlapping GOs between metabolic rate and Hsp70 expression (15 BP and four MF in common, Tables 2 and 3) and between egg-to-adult viability investigated at the two temperatures (11 BP, six MF, and four CC in common) (Tables 2, 3, and 4).
Genetic variation is necessary for populations to adapt to variable and at times stressful environments. Using a model system, we found substantial phenotypic variation for the five traits related to fitness and environmental stress resistance (Figure 1). The DGRP lines were originally inbred to an inbreeding coefficient of ~1 ; thus, the phenotypic variation was a consequence of genomic difference among the lines. The DGRP was established from a natural D. melanogaster population ; thus our results illustrate natural genetic variation, and therefore adaptive potential, for metabolic rate, heat stress resistance, Hsp70 expression, and egg-to-adult viability at benign and stress conditions (Table 2). Compared to, for example, morphological traits, fitness components are generally believed to have low heritability due to, for example, directional selection that removes additive genetic variation . However, here we report high heritability estimates for the egg-to-adult viability traits assessed at benign and stressful temperatures. Others have reported substantial lower estimates for lifespan, fecundity, and egg-to-adult viability [37, 38]. These contradictions could result from overestimation caused by limited sample size in our study. However, the estimated heritability for Hsp70 expression was in the range of what has previously been reported , thereby supporting our estimates. With respect to metabolic rate our estimate was higher than other reported vales for Drosophila [40, 41] but was consistent with estimates obtained from bird populations [42, 43]. Lastly, the estimate for resistance to abrupt exposure to high temperatures was similar to other estimates for Drosophila  and different species of fish [45, 46]. Overall these results point to evolutionary adaptive potential for the traits investigated. Recent studies do however suggest that terrestrial ectotherms, endotherms, and plant species have limited potential to change their upper thermal limits [47, 48], which could be a consequence of hard physiological boundaries . Why do our results then point to rather high evolutionary potentials in traits related to coping with high temperatures? For such comparison to be valid, the traits compared need to be similar, and it has been shown that measuring response to heat stress using a static approach (i.e., abrupt exposure to a high temperature) and that using a ramping approach (i.e., a gradual increase in temperature) are not two identical traits . It has also been shown that the more ecological relevant approach (ramping) results in lower heritability estimates than the static approach , which we used in this study. Thus for heat resistance the results obtained in this study may not be ecologically relevant.
We found strong, significant negative genomic correlation between expression of a major heat shock protein and the two egg-to-adult viability traits and high positive genomic correlation between the two viability traits (Table 2). Egg-to-adult viability at 28°C was on average reduced by 21% compared to the viability at 25°C (Figure 1); however, the magnitude was not equal for all lines, indicative of genotype-by-environmental interaction. Studies have shown that resistance to one type of environmental stresses often has fitness costs in terms of reduced longevity or other life history traits [50–52]. Therefore, it could be hypothesized that energy spent on expression of stress response proteins, for example, Hsp70, reduces the resources available for survival. Thus, a similar correlation could be expected between metabolic rate and egg-to-adult viability, but this was not observed. However, we did observe a significant overlap in the top GOs for metabolic rate and Hsp70 expression, which does suggest genomic correlations at pathway level.
Biological interpretation from associated GOs is difficult, especially with limited statistical power to detect pathways. However, in our study, we have shown an efficient method to compute individual markers’ effects and subsequently a summary statistic for a set of markers. Despite limited statistical power, which we admittedly have in this study, some biological relevant pathways were identified. For example, GO:0000149 is a group of genes related to the SNARE complex, which was associated to heat stress resistance (Table S4). Several members of the heat shock protein family are implicated in the sustenance of synaptic function, and during synaptic transmission of vesicular content, for example, neurotransmitters, heat shock proteins bind to the SNARE complex . Another GO associated with heat stress was a group of genes related to oxidoreductase activity (GO:0016614, Table S4). In plants it has been shown that heat shock proteins can protect oxidoreductase complexes . With respect to the GOs associated with metabolic rate several different types were identified ranging within transportation of vesicles (GO:0006891), cell division (GO:0008356), and regulation of several pathways (e.g., GO:0048786, GO:0016791, and GO:0010951). This might illustrate that metabolic rate is a complex trait influenced by many genes and that the phenotypic variation in metabolic rate is a function of variation in other biological functions within individuals.
Partitioning of genetic variation within the associated pathway to genetic variation per gene (corrected for number of genetic markers within gene) showed that most of the GOs had the same overall pattern, namely, relative few genes within each GO contributed to the overall genetic variation (Figure 2 and Table S6). Moreover, a limited number of genes explained more than 20% of the within GO genetic variation (Tables 5 and S6). Only one gene, Frizzled, was in common among traits, between metabolic rate and Hsp70 expression, in this reduced list of genes (Table 5). Frizzled is associated with the Wnt signaling pathway, which promotes cell proliferation, alters key metabolic proteins, and is involved in whole-body energy homeostasis . One of the genes that explained a large proportion of the genetic variation, associated with egg-to-adult viability at stressful conditions, was Mkp3, which is a gene in the MAPK cascade. Heat shock activates heat shock proteins which are regulated by the MAPK cascade . Also, associated with heat stress resistance was snp25, a gene in the SNARE complex. As discussed in the previous paragraph, heat shock proteins bind to the SNARE complex. Lastly, one of the genes capturing a large fraction of the genetic variance within a GO for Hsp70 expression was Irc, which has been predicted to be associated with oxidative stress , and Hsp70 expression is known to be a biomarker for oxidative stress .
Despite the obvious statistical power limitations associated with the number of DGRP lines assessed in this study, we found more GOs than expected by chance, some of which seem to have important biological functions. More importantly, we found substantial genetic variation, thus evolutionary adaptive potential, for all five traits investigated supported by the relative narrow bootstrap CI for four of the traits.
With the advances in sequencing technologies genomic approaches have been suggested as promising tools for conservation genetics. Neutral markers have predominately been applied in population and conservation genetics to describe loss of genetic variation, population structure, and so forth. However, using neutral markers to monitor the effect of environmental changes in a population is limited because the loss of variation will only decrease significantly if the population size is greatly reduced . Therefore, genomic approaches, in which all polymorphic markers are used, may increase the accuracy on estimates of genetic diversity . Further, with genomic approaches, it is possible to assess adaptive genes, which must harbor loci that contribute with a substantial part of the genetic variation to the traits of interest [57, 58]. Therefore, in a conservation perspective, there is a need to identify genes influencing life history and stress resistance traits. Obtaining genotypes and phenotypes of individuals from wild populations is often challenging but achievable. For organisms lacking reference sequences and proper annotations, gene regions may be predicted using traditional bioinformatic approaches. However, to achieve reliable and accurate results, the sample size must be large and therefore such approaches may not be feasible for natural populations. Thus, using genomic knowledge of key traits from model populations, such as the DGRP, to wild populations may be an alternative. Investigating and identifying genes and gene complexes associated with key traits for laboratory organisms may be used as guides for identification of variants with adaptive significance in wild populations.
In the work presented here, we used a subset of the DGRP to investigate traits related to fitness and environmental stress resistance. As the environment changes, populations need to be able to adapt to these changes to survive. Using the DGRP we found substantial genetic variation for metabolic rate, heat stress resistance, expression of Hsp70, and egg-to-adult viability at two environmental conditions. In addition, we found evidence for genotype-by-environmental interaction for viability. Using a genomic pathways association approach, we attempted to locate pathways displaying association with the traits investigated. This approach can be extended to nonmodel organisms or provided as a genomic tool for identification of adaptive genes in model organisms and thus provide a potential use for conservation genomics.
The authors declare that there is no conflict of interests regarding the publication of this paper.
The authors thank Doth Andersen for technical assistance. This work was funded by the Carlsberg Foundation, the Danish Natural Research Council (Sapere Aude grants to Johannes Overgaard and Torsten Nygaard Kristensen), and the Center for Genomic Selection in Animals and Plants (GenSAP) funded by The Danish Council for Strategic Research.
The supplementary material contain information of which DGRP lines assessed, additional experimental design informations (Table S1), and the phenotypic line means for the traits assayed (Table S2). The supplementary information also contains the likelihood ratio test for the fixed effects (Table S3), overview of GOs with a p<0.005 (Table S4) and GOs with a p<0.05 that are common across traits (Table S5). Lastly, the exact figures for Figure 2 are given in Table S6, that is, the associated GOs and the proportion of variance explained per gene adjusted for the number of SNPs within the gene.
J. M. Kwon and A. M. Goate, “The candidate gene approach,” Alcohol Research and Health, vol. 24, no. 3, pp. 164–168, 2000.View at: Google Scholar
K. Dalman, K. Himmelstrand, Å. Olson, M. Lind, M. Brandström-Durling, and J. Stenlid, “A genome-wide association study identifies genomic regions for virulence in the non-model organism Heterobasidion annosum s.s,” PLoS ONE, vol. 8, no. 1, Article ID e53525, 2013.View at: Publisher Site | Google Scholar
H. L. Allen, K. Estrada, G. Lettre et al., “Hundreds of variants clustered in genomic loci and biological pathways affect human height,” Nature, vol. 467, pp. 832–838, 2010.View at: Google Scholar
A. A. Hoffmann and P. A. Parsons, Evolutionary Genetics and Environmental Stress, Oxford University Press, Oxford, UK, 1991.
P. Jensen, J. Overgaard, V. Loeschcke, M. F. Schou, H. Malte, and T. N. Kristensen, “Inbreeding effects on standard metabolic rate investigated at cold, benign and hot temperatures in Drosophila melanogaster,” Journal of Insect Physiology, vol. 62, no. 1, pp. 11–20, 2014.View at: Publisher Site | Google Scholar
T. N. Kristensen, J. Dahlgaard, and V. Loeschcke, “Inbreeding affects Hsp70 expression in two species of Drosophila even at benign temperatures,” Evolutionary Ecology Research, vol. 4, no. 8, pp. 1209–1216, 2002.View at: Google Scholar
J. G. Sørensen, P. Michalak, J. Justesen, and V. Loeschcke, “Expression of the heat-shock protein HSP70 in Drosophila buzzatii lines selected for thermal resistance,” Hereditas, vol. 131, no. 2, pp. 155–164, 1999.View at: Google Scholar
D. Clifford and P. McCullagh, The Regress Package R Package Version 1.3-14., 2014.
R Core Team, R: A Language and Environment for Statistical Computing, Version 3.2.0, R Foundation for Statistical Computing, Vienna, Austria, 2015, http://www.R-project.org/.
M. Carlson, GO.db: A Set of Annotation Maps Describing the Entire Gene Ontology, R Package Version 3.2.2, 2015.
M. Carlson, org.Dm.eg.db: Genome Wide Annotation for Fly, R Package Version 3.2.3, 2015.
R. Frankham, J. D. Ballou, and D. A. Briscoe, Introduction to Conservation Genetics, Cambridge University Press, New York, NY, USA, 4th edition, 2004.
A. A. Khazaeli, W. Van Voorhies, and J. W. Curtsinger, “Longevity and metabolism in Drosophila melanogaster: genetic correlations between life span and age-specific metabolic rate in populations artificially selected for long life,” Genetics, vol. 169, no. 1, pp. 231–242, 2005.View at: Publisher Site | Google Scholar