Review Article

Modern Computational Techniques for the HMMER Sequence Analysis

Table 1

Performance comparison among different software approaches.

Acceleration strategiesSupporting software/packageAccelerated programsDatasetsHardware environmentAchieved speedup

Instruction-level parallelismSSE2 Instructionshmmpfam, hmmsearch Pfam and nr databases2.66 GHz Intel Xeon processor with 2.5 GB of memory1.2x~1.3x [9]

Shared memory parallelismOpenMPhmmpfam 600 HMM profiles and 250 sequences 16 x86 3.0 GHz processors, 32 MB L4 cache shared among 4 CPUs, 4 MB L3 cache, 8 GB of memory14x [10]
EARTHhmmpfam 50 HMM profiles and 38192 sequencesA cluster that consists of 128 nodes, each with two 500 MHz Pentium III processors222.8x [11]

Distributed memory parallelismPVMhmmsearch 1 HMM profile and 100 MB of nr databaseA cluster with 4 nodes, each node consists of two 2.66 GHz Intel Xeon processors with 2.5 GB memory per node4.56x [9]
MPIhmmsearch 5.90x [9]
MPI + I/O optimizationshmmsearch One 236-state HMM profile and nr database A cluster that consists of 1056 nodes, each equipped with two 3.2 GHz Intel Xeon processors, 2 GB RAM221x [12]
hmmpfam 1.6 GB of Pfam database and nr databases328x [12]

Heterogeneous approachMPI  +  SSE2hmmsearch 1 HMM profile and 100 MB of nr databaseA cluster with 4 nodes, each node consists of two 2.66 GHz Intel Xeon processors with 2.5 GB memory per node7.71x [13]