Table of Contents
ISRN Bioinformatics
Volume 2013 (2013), Article ID 252183, 13 pages
http://dx.doi.org/10.1155/2013/252183
Review Article

Modern Computational Techniques for the HMMER Sequence Analysis

1Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
2Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598, USA
3Department of Electrical and Computer Engineering, Gonzaga University, Spokane, WA 99258, USA

Received 17 June 2013; Accepted 30 July 2013

Academic Editors: A. Pulvirenti and K. Yura

Copyright © 2013 Xiandong Meng and Yanqing Ji. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. NCBI Genebank, 2012, http://www.ncbi.nlm.nih.gov/genbank/.
  2. Illumina Inc, 2012, http://www.illumina.com/.
  3. T. F. Smith and M. S. Waterman, “Identification of common molecular subsequences,” Journal of Molecular Biology, vol. 147, no. 1, pp. 195–197, 1981. View at Google Scholar · View at Scopus
  4. S. B. Needleman and C. D. Wunsch, “A general method applicable to the search for similarities in the amino acid sequence of two proteins,” Journal of Molecular Biology, vol. 48, no. 3, pp. 443–453, 1970. View at Google Scholar · View at Scopus
  5. A. Krogh, M. Brown, I. S. Mian, K. Sjolander, and D. Haussler, “Hidden Markov Models in computational biology applications to protein modeling,” Journal of Molecular Biology, vol. 235, no. 5, pp. 1501–1531, 1994. View at Publisher · View at Google Scholar · View at Scopus
  6. S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, “Basic local alignment search tool,” Journal of Molecular Biology, vol. 215, no. 3, pp. 403–410, 1990. View at Publisher · View at Google Scholar · View at Scopus
  7. W. R. Pearson, “Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms,” Genomics, vol. 11, no. 3, pp. 635–650, 1991. View at Google Scholar · View at Scopus
  8. D.-F. Feng and R. F. Doolittle, “Progressive sequence alignment as a prerequisitetto correct phylogenetic trees,” Journal of Molecular Evolution, vol. 25, no. 4, pp. 351–360, 1987. View at Publisher · View at Google Scholar · View at Scopus
  9. J. P. Walters, B. Qudah, and V. Chaudhary, “Accelerating the HMMER sequence analysis suite using conventional processors,” in Proceedings of the 20th International Conference on Advanced Information Networking and Applications, pp. 289–294, April 2006. View at Publisher · View at Google Scholar · View at Scopus
  10. U. Srinivasan, P.-S. Chen, Q. Diao et al., “Characterization and analysis of HMMER and SVM-RFE parallel bioinformatics applications,” in Proceedings of the IEEE International Symposium on Workload Characterization (IISWC '05), pp. 87–98, October 2005. View at Publisher · View at Google Scholar · View at Scopus
  11. W. Zhu, Y. Niu, J. Lu, and G. R. Gao, “Implementing parallel hmm-pfam on the EARTH multithreaded architecture,” in Proceedings of the IEEE Bioinformatics Conference (CSB '03), pp. 549–550, 2003.
  12. J. P. Walters, R. Darole, and V. Chaudhary, “Improving MPI-HMMER's scalabilitywith parallel I/O,” in Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS '09), pp. 1–11, May 2009. View at Publisher · View at Google Scholar · View at Scopus
  13. J. P. Walters, X. Meng, V. Chaudhary et al., “MPI-HMMER-boost: distributed FPGA acceleration,” Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, vol. 48, no. 3, pp. 223–238, 2007. View at Publisher · View at Google Scholar · View at Scopus
  14. B. Wun, J. Buhler, and P. Crowley, “Exploiting coarse-grained parallelism to accelerate protein motif finding with a network processor,” in Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT '05), pp. 173–184, September 2005. View at Publisher · View at Google Scholar · View at Scopus
  15. J. P. Walters, V. Balu, S. Kompalli, and V. Chaudhary, “Evaluating the use of GPUs in liver image segmentation and HMMER database searches,” in Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS '09), pp. 1–2. View at Publisher · View at Google Scholar · View at Scopus
  16. N. Ganesan, R. D. Chamberlain, J. Buhler, and M. Taufer, “Accelerating HMMER on GPUs by implementing hybrid data and task parallelism,” in Proceedings of the ACM International Conference on Bioinformatics and Computational Biology (ACM-BCB '10), pp. 418–421, August 2010. View at Publisher · View at Google Scholar · View at Scopus
  17. V. Sachdeva, M. Kistler, E. Speight, and T.-H. K. Tzeng, “Exploring the viability of the Cell Broadband Engine for bioinformatics applications,” Parallel Computing, vol. 34, no. 11, pp. 616–626, 2008. View at Publisher · View at Google Scholar · View at Scopus
  18. T. Oliver, L. Y. Yeow, and B. Schmidt, “Integrating FPGA acceleration into HMMer,” Parallel Computing, vol. 34, no. 11, pp. 681–691, 2008. View at Publisher · View at Google Scholar · View at Scopus
  19. Y. Sun, P. Li, G. Gu, Y. Wen, Y. Liu, and D. Liu, “Accelerating HMMer on FPGAs using systolic array based architecture,” in Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS '09), pp. 1–8, May 2009. View at Publisher · View at Google Scholar · View at Scopus
  20. S. R. Eddy, “Hidden Markov models,” Current Opinion in Structural Biology, vol. 6, no. 3, pp. 361–365, 1996. View at Publisher · View at Google Scholar · View at Scopus
  21. S. R. Eddy, “Profile hidden Markov models,” Bioinformatics, vol. 14, no. 9, pp. 755–763, 1998. View at Google Scholar · View at Scopus
  22. R. Durbin, S. Eddy, A. Krogh, and A. Mitchson, Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids, Cambridge University Press, 1998.
  23. A. Grama, G. Karypis, V. Kumar, and A. Gupta, Introduction to Parallel Computing, Addison Wesley, 2nd edition, 2003.
  24. D. Butenhof, Programming With POSIX Threads, Addison Wesley Professional, 1997.
  25. R. Chandra, R. Menon, L. Dagum, D. Kohr, D. Maydan, and J. McDonald, Parallel Programming in OpenMP, Morgan Kaufmann, 2000.
  26. M. P. I. Forum, “MPI: Message-Passing Interface Standard,” 1995, http://www.mpi-forum.org/.
  27. T. Takagi and T. Maruyama, “Accelerating hmmer search using FPGA,” in Proceedings of the 19th International Conference on Field Programmable Logic and Applications (FPL '09), pp. 332–337, September 2009. View at Publisher · View at Google Scholar · View at Scopus
  28. T. Oliver, L. Y. Yeow, and B. Schmidt, “High performance database searching with HMMer on FPGAs,” in Proceedings of the 21st International Parallel and Distributed Processing Symposium (IPDPS '07), pp. 1–7, March 2007. View at Publisher · View at Google Scholar · View at Scopus
  29. S. A. Manavski and G. Valle, “CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment,” BMC Bioinformatics, vol. 9, supplement 2, article S10, 2008. View at Publisher · View at Google Scholar · View at Scopus
  30. NVIDIA CUDA ZONE, 2012, http://www.nvidia.com/object/cuda_home.html.
  31. P. Cohen, “OpenCL: what you need to know,” Macworld, 2008, http://www.macworld.com/article/134858/2008/08/snowleopard_opencl.html.
  32. G. Amdahl, “The validity of the single processor approach to achieving large scale computing capabilities,” in Proceedings of the AFIPS Computing Conference, vol. 30, pp. 483–485, 1967.
  33. “IA-32 Intel Architecture Software Developer’s Manual Volume 1: Basic Architecture,” 2004.
  34. Pfam, “The PFAM HMM library: a large collection of multiple sequence alignments and hidden markov models covering many common protein families,” 2013, http://pfam.sanger.ac.uk/.
  35. NCBI, “The NR (non-redundant) database,” 2006, ftp://ftp.ncbi.nih.gov/blast/db/FASTA/nr.gz.
  36. K. B. Theobald, EARTH: An Efficient Architecture for Running Threads, School of Computer Science, McGill University, Montreal, Canada, 1999.
  37. A. Geist, A. Beguelin, J. Dongarra, W. Jiang, R. Mancheck, and Y. Sunderam, PVM: Parallel Virtual Machine. A USer’s Guide and Tutorial for Networked Parallel Computing, Edited by J. Kowalik, MIT Press, 1994.
  38. J. J. Dongarra, H. W. Meuer, and E. Strohmaier, “TOP500 supercomputer sites,” Supercomputer, vol. 12, no. 1, pp. 91–120, 1996. View at Google Scholar · View at Scopus
  39. Folding@home, 2012, http://folding.stanford.edu/English/HomePage.
  40. HMMER, 2012, http://hmmer.janelia.org/.
  41. K. Jiang, O. Thorsen, A. Peters, B. Smith, and C. P. Sosa, “An efficient parallel implementation of the hidden Markov methods for genomic sequence search on a massively parallel system,” IEEE Transactions on Parallel and Distributed Systems, vol. 19, no. 1, pp. 15–23, 2008. View at Publisher · View at Google Scholar · View at Scopus
  42. O. Lascu, N. Allsopp, P. Vezolle et al., “Unfolding the IBM Server Blue Gene Solution,” IBM Corp., Int’l Technical Support Organization, 2005, http://www.redbooks.ibm.com/abstracts/sg246686.html?Open.
  43. The Portland Group, “Accelerator Compilers with OpenACC Directives,” 2012, http://www.pgroup.com/resources/accel.htm.
  44. VHDL, 2012, http://www.vhdl.org/.
  45. Verilog, 2012, http://www.verilog.com/.
  46. Xilinx, 2012, http://www.xilinx.com/.
  47. Altera, 2012, http://www.altera.com/.