BioMed Research International
Volume 2014 (2014), Article ID 319534, 7 pages
http://dx.doi.org/10.1155/2014/319534
Improved Variant Calling Accuracy by Merging Replicates in Whole-Exome Sequencing Studies
1Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN 37203, USA
2Department of Molecular Physiology and Biophysics, Center for Human Genetics Research, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
3Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
Received 28 May 2014; Revised 15 July 2014; Accepted 15 July 2014; Published 4 August 2014
Academic Editor: Ernesto Picardi
Copyright © 2014 Yanfeng Zhang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Linked References
- J. Shendure and H. Ji, “Next-generation DNA sequencing,” Nature Biotechnology, vol. 26, no. 10, pp. 1135–1145, 2008. View at Publisher · View at Google Scholar · View at Scopus
- J. S. Parla, I. Iossifov, I. Grabill, M. S. Spector, M. Kramer, and W. R. McCombie, “A comparative analysis of exome capture,” Genome biology, vol. 12, article R97, 2011. View at Publisher · View at Google Scholar · View at Scopus
- M. N. Bainbridge, M. Wang, D. L. Burgess et al., “Whole exome capture in solution with 3 Gbp of data,” Genome Biology, vol. 11, no. 6, article R62, 2010. View at Publisher · View at Google Scholar · View at Scopus
- J. K. Teer and J. C. Mullikin, “Exome sequencing: the sweet spot before whole genomes,” Human Molecular Genetics, vol. 19, no. 2, pp. R145–R151, 2010. View at Publisher · View at Google Scholar · View at Scopus
- W. Fu, T. D. O'Connor, G. Jun et al., “Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants,” Nature, vol. 493, no. 7431, pp. 216–220, 2013. View at Publisher · View at Google Scholar · View at Scopus
- S. B. Ng, E. H. Turner, P. D. Robertson et al., “Targeted capture and massively parallel sequencing of 12 human exomes,” Nature, vol. 461, no. 7261, pp. 272–276, 2009. View at Publisher · View at Google Scholar · View at Scopus
- X. Yi, Y. Liang, E. Huerta-Sanchez et al., “Sequencing of 50 human exomes reveals adaptation to high altitude,” Science, vol. 329, no. 5987, pp. 75–78, 2010. View at Publisher · View at Google Scholar · View at Scopus
- J. A. Tennessen, A. W. Bigham, T. D. O'Connor et al., “Evolution and functional impact of rare coding variation from deep sequencing of human exomes,” Science, vol. 337, no. 6090, pp. 64–69, 2012. View at Publisher · View at Google Scholar
- T. W. Yu, M. H. Chahrour, M. E. Coulter et al., “Using whole-exome sequencing to identify inherited causes of autism,” Neuron, vol. 77, no. 2, pp. 259–273, 2013. View at Publisher · View at Google Scholar · View at Scopus
- E. R. Thompson, M. A. Doyle, G. L. Ryland et al., “Exome sequencing identifies rare deleterious mutations in DNA repair genes FANCC and BLM as potential breast cancer susceptibility alleles,” PLoS Genetics, vol. 8, no. 9, Article ID e1002894, 2012. View at Publisher · View at Google Scholar · View at Scopus
- S. J. Sanders, M. T. Murtha, A. R. Gupta et al., “De novo mutations revealed by whole-exome sequencing are strongly associated with autism,” Nature, vol. 484, no. 7397, pp. 237–241, 2012. View at Publisher · View at Google Scholar · View at Scopus
- M. J. Bamshad, S. B. Ng, A. W. Bigham et al., “Exome sequencing as a tool for Mendelian disease gene discovery,” Nature Reviews Genetics, vol. 12, no. 11, pp. 745–755, 2011. View at Publisher · View at Google Scholar · View at Scopus
- A. Kiezun, K. Garimella, R. Do et al., “Exome sequencing and the genetic basis of complex traits,” Nature Genetics, vol. 44, no. 6, pp. 623–630, 2012. View at Publisher · View at Google Scholar · View at Scopus
- E. J. Vallender, “Expanding whole exome resequencing into non-human primates,” Genome Biology, vol. 12, article R87, 2011. View at Publisher · View at Google Scholar · View at Scopus
- R. D. George, G. McVicker, R. Diederich et al., “Trans genomic capture and sequencing of primate exomes reveals new targets of positive selection,” Genome Research, vol. 21, no. 10, pp. 1686–1694, 2011. View at Publisher · View at Google Scholar · View at Scopus
- J. E. McCormack, M. G. Harvey, B. C. Faircloth, N. G. Crawford, T. C. Glenn, and R. T. Brumfield, “A phylogeny of birds based on over 1,500 loci collected by target enrichment and high-throughput sequencing,” PLoS ONE, vol. 8, no. 1, Article ID e54848, 2013. View at Publisher · View at Google Scholar · View at Scopus
- M. J. Clark, R. Chen, H. Y. K. Lam et al., “Performance comparison of exome DNA sequencing technologies,” Nature Biotechnology, vol. 29, no. 10, pp. 908–916, 2011. View at Publisher · View at Google Scholar · View at Scopus
- Y. Guo, J. Long, J. He et al., “Exome sequencing generates high quality data in non-target regions,” BMC Genomics, vol. 13, article 194, 2012. View at Publisher · View at Google Scholar · View at Scopus
- Y. T. Gao, X. O. Shu, Q. Dai et al., “Association of menstrual and reproductive factors with breast cancer risk: results from the Shanghai breast cancer study,” International Journal of Cancer, vol. 87, no. 2, pp. 295–300, 2000. View at Google Scholar
- W. Zheng, J. Long, Y. Gao et al., “Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1,” Nature Genetics, vol. 41, no. 3, pp. 324–328, 2009. View at Publisher · View at Google Scholar · View at Scopus
- H. Li and R. Durbin, “Fast and accurate short read alignment with Burrows-Wheeler transform,” Bioinformatics, vol. 25, no. 14, pp. 1754–1760, 2009. View at Publisher · View at Google Scholar · View at Scopus
- H. Li, B. Handsaker, A. Wysoker et al., “The sequence alignment/map format and SAMtools,” Bioinformatics, vol. 25, no. 16, pp. 2078–2079, 2009. View at Publisher · View at Google Scholar · View at Scopus
- M. A. DePristo, E. Banks, R. Poplin et al., “A framework for variation discovery and genotyping using next-generation DNA sequencing data,” Nature Genetics, vol. 43, no. 5, pp. 491–498, 2011. View at Google Scholar
- A. McKenna, M. Hanna, E. Banks et al., “The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data,” Genome Research, vol. 20, no. 9, pp. 1297–1303, 2010. View at Publisher · View at Google Scholar · View at Scopus
- P. Frommolt, A. T. Abdallah, J. Altmüller et al., “Assessing the enrichment performance in targeted resequencing experiments,” Human Mutation, vol. 33, no. 4, pp. 635–641, 2012. View at Publisher · View at Google Scholar · View at Scopus
- S. C. Schuster, W. Miller, A. Ratan et al., “Complete Khoisan and Bantu genomes from southern Africa,” Nature, vol. 463, no. 7283, pp. 943–947, 2010. View at Publisher · View at Google Scholar · View at Scopus
- K. J. McKernan, H. E. Peckham, G. L. Costa et al., “Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding,” Genome Research, vol. 19, no. 9, pp. 1527–1541, 2009. View at Publisher · View at Google Scholar · View at Scopus
- The 1000 Genomes Project Consortium, “An integrated map of genetic variation from 1,092 human genomes,” Nature, vol. 491, no. 7422, pp. 56–65, 2012. View at Google Scholar
- X. Liu, S. Han, Z. Wang et al., “Variant callers for next-generation sequencing data: a comparison study,” PLoS ONE, vol. 8, no. 9, Article ID e75619, 2013. View at Google Scholar
- A. Sirmaci, Y. J. K. Edwards, H. Akay, and M. Tekin, “Challenges in whole exome sequencing: an example from hereditary deafness,” PLoS ONE, vol. 7, no. 2, Article ID e32000, 2012. View at Publisher · View at Google Scholar · View at Scopus
- M. N. Bainbridge, M. Wang, Y. Wu et al., “Targeted enrichment beyond the consensus coding DNA sequence exome reveals exons with higher variant densities,” Genome Biology, vol. 12, no. 7, article R68, 2011. View at Publisher · View at Google Scholar · View at Scopus
- J. Freudenberg, P. K. Gregersen, and Y. Freudenberg-Hua, “A simple method for analyzing exome sequencing data shows distinct levels of nonsynonymous variation for human immune and nervous system genes,” PLoS ONE, vol. 7, no. 6, Article ID e38087, 2012. View at Publisher · View at Google Scholar · View at Scopus
- A. Roberts, C. Trapnell, J. Donaghey, J. L. Rinn, and L. Pachter, “Improving RNA-Seq expression estimates by correcting for fragment bias,” Genome Biology, vol. 12, no. 3, article R22, 2011. View at Publisher · View at Google Scholar · View at Scopus
- A. Mortazavi, B. A. Williams, K. McCue, L. Schaeffer, and B. Wold, “Mapping and quantifying mammalian transcriptomes by RNA-Seq,” Nature Methods, vol. 5, no. 7, pp. 621–628, 2008. View at Publisher · View at Google Scholar · View at Scopus
- J. T. Leek, R. B. Scharpf, H. C. Bravo et al., “Tackling the widespread and critical impact of batch effects in high-throughput data,” Nature Reviews Genetics, vol. 11, no. 10, pp. 733–739, 2010. View at Publisher · View at Google Scholar · View at Scopus