Table of Contents
ISRN Bioinformatics
Volume 2012 (2012), Article ID 816402, 9 pages
http://dx.doi.org/10.5402/2012/816402
Research Article

Enhancing De Novo Transcriptome Assembly by Incorporating Multiple Overlap Sizes

1Department of Computer Science and Information Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei 10617, Taiwan
2Institute of Plant and Microbial Biology, Academia Sinica, 128 Academia Road, Section 2, Nankang, Taipei 115, Taiwan
3Institute of Information Science, Academia Sinica, 128 Academia Road, Section 2, Nankang, Taipei 115, Taiwan

Received 20 December 2011; Accepted 9 February 2012

Academic Editors: F. Couto, Q. Dong, H. Hegyi, D. Labudde, C. Ortutay, F. Plewniak, and K. Yura

Copyright © 2012 Chien-Chih Chen et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. M. L. Metzker, “Emerging technologies in DNA sequencing,” Genome Research, vol. 15, no. 12, pp. 1767–1776, 2005. View at Publisher · View at Google Scholar · View at Scopus
  2. E. R. Mardis, “The impact of next-generation sequencing technology on genetics,” Trends in Genetics, vol. 24, no. 3, pp. 133–141, 2008. View at Publisher · View at Google Scholar · View at Scopus
  3. R. L. Warren, G. G. Sutton, S. J. M. Jones, and R. A. Holt, “Assembling millions of short DNA sequences using SSAKE,” Bioinformatics, vol. 23, no. 4, pp. 500–501, 2007. View at Publisher · View at Google Scholar · View at Scopus
  4. W. R. Jeck, J. A. Reinhardt, D. A. Baltrus et al., “Extending assembly of short DNA sequences to handle error,” Bioinformatics, vol. 23, no. 21, pp. 2942–2944, 2007. View at Publisher · View at Google Scholar · View at Scopus
  5. J. C. Dohm, C. Lottaz, T. Borodina, and H. Himmelbauer, “SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing,” Genome Research, vol. 17, no. 11, pp. 1697–1706, 2007. View at Publisher · View at Google Scholar · View at Scopus
  6. D. Hernandez, P. François, L. Farinelli, M. Østerås, and J. Schrenzel, “De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer,” Genome Research, vol. 18, no. 5, pp. 802–809, 2008. View at Publisher · View at Google Scholar · View at Scopus
  7. D. R. Zerbino and E. Birney, “Velvet: algorithms for de novo short read assembly using de Bruijn graphs,” Genome Research, vol. 18, no. 5, pp. 821–829, 2008. View at Publisher · View at Google Scholar · View at Scopus
  8. M. J. Chaisson and P. A. Pevzner, “Short read fragment assembly of bacterial genomes,” Genome Research, vol. 18, no. 2, pp. 324–330, 2008. View at Publisher · View at Google Scholar · View at Scopus
  9. J. Butler, I. MacCallum, M. Kleber et al., “ALLPATHS: de novo assembly of whole-genome shotgun microreads,” Genome Research, vol. 18, no. 5, pp. 810–820, 2008. View at Publisher · View at Google Scholar · View at Scopus
  10. J. T. Simpson, K. Wong, S. D. Jackman, J. E. Schein, S. J. M. Jones, and I. Birol, “ABySS: a parallel assembler for short read sequence data,” Genome Research, vol. 19, no. 6, pp. 1117–1123, 2009. View at Publisher · View at Google Scholar · View at Scopus
  11. B. G. Jackson, P. S. Schnable, and S. Aluru, “Parallel short sequence assembly of transcriptomes,” BMC Bioinformatics, vol. 10, no. 1, article S14, 2009. View at Publisher · View at Google Scholar · View at Scopus
  12. I. Birol, S. D. Jackman, C. B. Nielsen et al., “De novo transcriptome assembly with ABySS,” Bioinformatics, vol. 25, no. 21, pp. 2872–2877, 2009. View at Publisher · View at Google Scholar · View at Scopus
  13. L. J. Collins, P. J. Biggs, C. Voelckel, and S. Joly, “An approach to transcriptome analysis of non-model organisms using short-read sequences,” Genome Informatics, vol. 21, pp. 3–14, 2008. View at Google Scholar
  14. C. Furusawa and K. Kaneko, “Zipf's law in gene expression,” Physical Review Letters, vol. 90, no. 8, Article ID 088102, 4 pages, 2003. View at Google Scholar · View at Scopus
  15. Y. Surget-Groba and J. I. Montoya-Burgos, “Optimization of de novo transcriptome assembly from next-generation sequencing data,” Genome Research, vol. 20, no. 10, pp. 1432–1440, 2010. View at Publisher · View at Google Scholar · View at Scopus
  16. E. S. Lander and M. S. Waterman, “Genomic mapping by fingerprinting random clones: a mathematical analysis,” Genomics, vol. 2, no. 3, pp. 231–239, 1988. View at Google Scholar · View at Scopus
  17. Y. Erlich, P. P. Mitra, M. delaBastide, W. R. McCombie, and G. J. Hannon, “Alta-Cyclic: a self-optimizing base caller for next-generation sequencing,” Nature Methods, vol. 5, no. 8, pp. 679–682, 2008. View at Publisher · View at Google Scholar · View at Scopus
  18. D. D. Sommer, A. L. Delcher, S. L. Salzberg, and M. Pop, “Minimus: a fast, lightweight genome assembler,” BMC Bioinformatics, vol. 8, article 64, 2007. View at Publisher · View at Google Scholar · View at Scopus
  19. M. Pop, A. Phillippy, A. L. Delcher, and S. L. Salzberg, “Comparative genome assembly,” Briefings in Bioinformatics, vol. 5, no. 3, pp. 237–248, 2004. View at Google Scholar · View at Scopus
  20. C. Trapnell, B. A. Williams, G. Pertea et al., “Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation,” Nature Biotechnology, vol. 28, no. 5, pp. 511–515, 2010. View at Publisher · View at Google Scholar · View at Scopus
  21. K. D. Pruitt, T. Tatusova, and D. R. Maglott, “NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins,” Nucleic Acids Research, vol. 35, no. 1, pp. D61–D65, 2007. View at Publisher · View at Google Scholar · View at Scopus
  22. R. D. Morin, M. Bainbridge, A. Fejes et al., “Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing,” BioTechniques, vol. 45, no. 1, pp. 81–94, 2008. View at Publisher · View at Google Scholar · View at Scopus
  23. Z. Zhang, S. Schwartz, L. Wagner, and W. Miller, “A greedy algorithm for aligning DNA sequences,” Journal of Computational Biology, vol. 7, no. 1-2, pp. 203–214, 2000. View at Publisher · View at Google Scholar · View at Scopus