Review Article

Computational and Bioinformatics Frameworks for Next-Generation Whole Exome and Genome Sequencing

Table 1


ProgramSource typeDescriptionWebsite

BowtieOpen sourceUngapped alignmenthttp://bowtie.cbcb.umd.edu/
Refined use of FM Index using the BWT
Fast and memory-efficient alignment
Quality value output

Bowtie2Open sourceExtends Bowtie approach to be useful for gapped alignmenthttp://bowtie-bio.sourceforge.net/bowtie2/index.shtml

SEALOpen sourceComparison of alignment algorithms using simulated short read sequencing runshttp://compbio.case.edu/seal/

SOAP3Open sourceGapped and ungapped alignmenthttp://www.cs.hku.hk/2bwt-tools/soap3/; http://soap.genomics.org.cn/soap3.html
Specialized for detecting and genotyping SNPs
Hash table accelerates searching using BWT-based index
Reports multiple possibilities rather than single best match
Increased speed using GPU

BWA, BWA-SWOpen sourceMost common/standard method usedhttp://maq.sourceforge.net/
Index with BWT that is faster than the hash-based index used for MAQ
Quality score reported

mrFAST, mrsFASTOpen sourceSeed-and-extend alignment method with hash table inex for reference genome
Reports all read mappings instead of single best mapping Useful for CNVs, structural variants

NovoalignCommercially
available
Novocraft’s proprietary software with hashing strategyhttp://www.novocraft.com/
High accuracy for single end mapping
Focused on sensitivity

SHRiMP/SHRiMP2Open sourceSpecialized for SOLiD color-space reads using spaced seeds and SWAhttp://compbio.cs.toronto.edu/shrimp/
Also applicable for regular letter-space reads
Handles higher level of polymorphisms

MAQOpen sourceUngapped alignmenthttp://maq.sourceforget.net/
Hash-based index with quality score for mapping

StampyOpen sourceHybrid mapping algorithm and statistical model, complementary to BWAhttp://www.well.ox.ac.uk/project-stampy/

ELANDCommercially
available
Hash-based alignment programhttp://www.illumina.com/

LAST alignerOpen sourceProbablistic alignment quality scores as well as usual score matrix
Useful for preprocessing for SNP/indel calling
http://last.cbrc.jp/

SARUMANOpen SourceMapping approach for single-end reads that returns all possible alignments with given error threshold with high speed using GPUshttp://www.cebitec.uni-bielefeld.de/brf/saruman/saruman.html