Table of Contents Author Guidelines Submit a Manuscript
Erratum

An erratum for this article has been published. To view the erratum, please click here.

BioMed Research International
Volume 2014, Article ID 736473, 16 pages
http://dx.doi.org/10.1155/2014/736473
Research Article

A De Novo Genome Assembly Algorithm for Repeats and Nonrepeats

1School of Information Science and Technology, Sun Yat-Sen University, Guangzhou High Education Mega City, No. 132 Waihuan East Road, Panyu District, GuangZhou 510006, China
2SYSU-CMU Shunde International Joint Research Institute, Shunde 528300, China

Received 19 December 2013; Revised 1 April 2014; Accepted 8 April 2014; Published 25 May 2014

Academic Editor: Li-Ching Wu

Copyright © 2014 Shuaibin Lian et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. D. R. Bentley, “Whole-genome re-sequencing,” Current Opinion in Genetics and Development, vol. 16, no. 6, pp. 545–552, 2006. View at Publisher · View at Google Scholar · View at Scopus
  2. T. D. Harris, P. R. Buzby, H. Babcock et al., “Single-molecule DNA sequencing of a viral genome,” Science, vol. 320, no. 5872, pp. 106–109, 2008. View at Publisher · View at Google Scholar · View at Scopus
  3. M. L. Metzker, “Sequencing technologies the next generation,” Nature Reviews Genetics, vol. 11, no. 1, pp. 31–46, 2010. View at Publisher · View at Google Scholar · View at Scopus
  4. E. R. Mardis, “The impact of next-generation sequencing technology on genetics,” Trends in Genetics, vol. 24, no. 3, pp. 133–141, 2008. View at Publisher · View at Google Scholar · View at Scopus
  5. O. Morozova and M. A. Marra, “Applications of next-generation sequencing technologies in functional genomics,” Genomics, vol. 92, no. 5, pp. 255–264, 2008. View at Publisher · View at Google Scholar · View at Scopus
  6. R. L. Strausberg, S. Levy, and Y.-H. Rogers, “Emerging DNA sequencing technologies for human genomic medicine,” Drug Discovery Today, vol. 13, no. 13-14, pp. 569–577, 2008. View at Publisher · View at Google Scholar · View at Scopus
  7. E. Pettersson, J. Lundeberg, and A. Ahmadian, “Generations of sequencing technologies,” Genomics, vol. 93, no. 2, pp. 105–111, 2009. View at Publisher · View at Google Scholar · View at Scopus
  8. The 1000 Genomes Project Consurtium, “A map of human genome variation from population-scale sequencing,” Nature, vol. 467, pp. 1061–1073, 2010. View at Publisher · View at Google Scholar
  9. Genome 10K Community of Scientists, “Genome 10K: a proposal to obtain whole-genome sequence for 10, 000 vertebrate species,” Journal of Heredity, vol. 100, pp. 659–674, 2009. View at Google Scholar
  10. T. J. Treangen and S. L. Salzberg, “Repetitive DNA and next-generation sequencing: computational challenges and solutions,” Nature Reviews Genetics, vol. 13, no. 1, pp. 36–46, 2012. View at Publisher · View at Google Scholar · View at Scopus
  11. J. T. Simpson, K. Wong, S. D. Jackman, J. E. Schein, S. J. M. Jones, and I. Birol, “ABySS: a parallel assembler for short read sequence data,” Genome Research, vol. 19, no. 6, pp. 1117–1123, 2009. View at Publisher · View at Google Scholar · View at Scopus
  12. S. Gnerre, I. MacCallum, D. Przybylski et al., “High-quality draft assemblies of mammalian genomes from massively parallel sequence data,” Proceedings of the National Academy of Sciences of the United States of America, vol. 108, no. 4, pp. 1513–1518, 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. S. Koren, T. J. Treangen, and M. Pop, “Bambus 2: scaffolding metagenomes,” Bioinformatics, vol. 27, no. 21, Article ID btr520, pp. 2964–2971, 2011. View at Publisher · View at Google Scholar · View at Scopus
  14. J. R. Miller, A. L. Delcher, S. Koren et al., “Aggressive assembly of pyrosequencing reads with mates,” Bioinformatics, vol. 24, no. 24, pp. 2818–2824, 2008. View at Publisher · View at Google Scholar · View at Scopus
  15. J. T. Simpson and R. Durbin, “Efficient de novo assembly of large genomes using compressed data structures,” Genome Research, vol. 22, no. 3, pp. 549–556, 2012. View at Publisher · View at Google Scholar · View at Scopus
  16. R. Li, H. Zhu, J. Ruan et al., “De novo assembly of human genomes with massively parallel short read sequencing,” Genome Research, vol. 20, no. 2, pp. 265–272, 2010. View at Publisher · View at Google Scholar · View at Scopus
  17. D. R. Zerbino and E. Birney, “Velvet: algorithms for de novo short read assembly using de Bruijn graphs,” Genome Research, vol. 18, no. 5, pp. 821–829, 2008. View at Publisher · View at Google Scholar · View at Scopus
  18. L. D. Stein, Z. Bao, D. Blasiar et al., “The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics,” PLoS Biology, vol. 1, no. 2, article E45, 2003. View at Publisher · View at Google Scholar · View at Scopus
  19. International Human Genome Consortium, “Initial sequencing and analysis of the human genome,” Nature, vol. 409, pp. 860–921, 2001. View at Publisher · View at Google Scholar
  20. A. J. Iafrate, L. Feuk, M. N. Rivera et al., “Detection of large-scale variation in the human genome,” Nature Genetics, vol. 36, no. 9, pp. 949–951, 2004. View at Publisher · View at Google Scholar · View at Scopus
  21. L. Feuk, A. R. Carson, and S. W. Scherer, “Structural variation in the human genome,” Nature Reviews Genetics, vol. 7, no. 2, pp. 85–97, 2006. View at Publisher · View at Google Scholar · View at Scopus
  22. D. G. Albertson and D. Pinkel, “Genomic microarrays in human genetic disease and cancer,” Human Molecular Genetics, vol. 12, no. 2, pp. R145–R152, 2003. View at Google Scholar · View at Scopus
  23. C. R. Marshall, A. Noor, J. B. Vincent et al., “Structural variation of chromosomes in autism spectrum disorder,” The American Journal of Human Genetics, vol. 82, no. 2, pp. 477–488, 2008. View at Publisher · View at Google Scholar · View at Scopus
  24. J. Sebat, B. Lakshmi, D. Malhotra et al., “Strong association of de novo copy number mutations with autism,” Science, vol. 316, no. 5823, pp. 445–449, 2007. View at Publisher · View at Google Scholar · View at Scopus
  25. T. F. Smith and M. S. Waterman, “Identification of common molecular subsequences,” Journal of Molecular Biology, vol. 147, no. 1, pp. 195–197, 1981. View at Google Scholar · View at Scopus
  26. A. L. Price, N. C. Jones, and P. A. Pevzner, “De novo identification of repeat families in large genomes,” Bioinformatics, vol. 21, no. 1, pp. i351–i358, 2005. View at Publisher · View at Google Scholar · View at Scopus
  27. S. L. Salzberg, A. M. Phillippy, A. Zimin et al., “GAGE: a critical evaluation of genome assemblies and assembly algorithms,” Genome Research, vol. 22, no. 3, pp. 557–567, 2012. View at Publisher · View at Google Scholar · View at Scopus
  28. R. Pinard, A. de Winter, G. J. Sarkis et al., “Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencing,” BMC Genomics, vol. 7, article 216, 2006. View at Publisher · View at Google Scholar · View at Scopus
  29. J. M. Lage, J. H. Leamon, T. Pejovic et al., “Whole genome analysis of genetic alterations in small DNA samples using hyperbranched strand displacement amplification and array-CGH,” Genome Research, vol. 13, no. 2, pp. 294–307, 2003. View at Publisher · View at Google Scholar · View at Scopus
  30. L. Zhang, X. Cui, K. Schmitt, R. Hubert, W. Navidi, and N. Arnheim, “Whole genome amplification from a single cell: implications for genetic analysis,” Proceedings of the National Academy of Sciences of the United States of America, vol. 89, no. 13, pp. 5847–5851, 1992. View at Publisher · View at Google Scholar · View at Scopus
  31. H. Telenius, N. P. Carter, C. E. Bebb, M. Nordenskjold, B. A. J. Ponder, and A. Tunnacliffe, “Degenerate oligonucleotide-primed PCR: general amplification of target DNA by a single degenerate primer,” Genomics, vol. 13, no. 3, pp. 718–725, 1992. View at Publisher · View at Google Scholar · View at Scopus