List of Sequence Alignment Software: Comparison
Please note this is a comparison between Version 1 by Vivi Li and Version 2 by Vivi Li.

This list of sequence alignment software is a compilation of software tools and web portals used in pairwise sequence alignment and multiple sequence alignment. See structural alignment software for structural alignment of proteins.

  • structural alignment
  • sequence alignment
  • web portals

1. Database Search Only

Name Description Sequence type* Authors Year
BLAST Local search with fast k-tuple heuristic (Basic Local Alignment Search Tool) Both Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ[1] 1990
HPC-BLAST NCBI compliant multinode and multicore BLAST wrapper. Distributed with the latest version of BLAST, this wrapper facilitates parallelization of the algorithm on modern hybrid architectures with many nodes and many cores within each node.[2] Protein Burdyshaw CE, Sawyer S, Horton MD, Brook RG, Rekapalli B 2017
CS-BLAST Sequence-context specific BLAST, more sensitive than BLAST, FASTA, and SSEARCH. Position-specific iterative version CSI-BLAST more sensitive than PSI-BLAST Protein Angermueller C, Biegert A, Soeding J[3] 2013
CUDASW++ GPU accelerated Smith Waterman algorithm for multiple shared-host GPUs Protein Liu Y, Maskell DL and Schmidt B 2009/2010
DIAMOND BLASTX and BLASTP aligner based on double indexing Protein Buchfink B, Xie C, Huson DH, Reuter K, Drost HG [4][5] 2015/2021
FASTA Local search with fast k-tuple heuristic, slower but more sensitive than BLAST Both    
GGSEARCH, GLSEARCH Global:Global (GG), Global:Local (GL) alignment with statistics Protein    
Genome Magician Software for ultra fast local DNA sequence motif search and pairwise alignment for NGS data (FASTA, FASTQ). DNA Hepperle D (www.sequentix.de) 2020
Genoogle Genoogle uses indexing and parallel processing techniques for searching DNA and Proteins sequences. It is developed in Java and open source. Both Albrecht F 2015
HMMER
HH-suite Pairwise comparison of profile Hidden Markov models; very sensitive Protein Söding J[6][7] 2005/2012
IDF Inverse Document Frequency Both    
Infernal Profile SCFG search RNA Eddy S  
KLAST High-performance general purpose sequence similarity search tool Both   2009/2014
LAMBDA High performance local aligner compatible to BLAST, but much faster; supports SAM/BAM Protein Hannes Hauswedell, Jochen Singer, Knut Reinert[8] 2014
MMseqs2 Software suite to search and cluster huge sequence sets. Similar sensitivity to BLAST and PSI-BLAST but orders of magnitude faster Protein Steinegger M, Mirdita M, Galiez C, Söding J[9] 2017
USEARCH Ultra-fast sequence analysis tool Both Edgar, R. C. (2010). "Search and clustering orders of magnitude faster than BLAST". Bioinformatics 26 (19): 2460–2461. doi:10.1093/bioinformatics/btq461. PMID 20709691.  publication 2010
OSWALD OpenCL Smith-Waterman on Altera's FPGA for Large Protein Databases Protein Rucci E, García C, Botella G, De Giusti A, Naiouf M, Prieto-Matías M[10] 2016
parasail Fast Smith-Waterman search using SIMD parallelization Both Daily J 2015
PSI-BLAST Position-specific iterative BLAST, local search with position-specific scoring matrices, much more sensitive than BLAST Protein Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ[11] 1997
PSI-Search Combining the Smith-Waterman search algorithm with the PSI-BLAST profile construction strategy to find distantly related protein sequences, and preventing homologous over-extension errors. Protein Li W, McWilliam H, Goujon M, Cowley A, Lopez R, Pearson WR[12] 2012
R&R Retrieve and Relate (R&R) is a high performance yet sensitive multi-database search engine, capable of searching in parallel through DNA,RNA and Protein sequences. Both   2019
ScalaBLAST Highly parallel Scalable BLAST Both Oehmen et al.[13] 2011
Sequilab Linking and profiling sequence alignment data from NCBI-BLAST results with major sequence analysis servers/services Nucleotide, peptide   2010
SAM Local and global search with profile Hidden Markov models, more sensitive than PSI-BLAST Both Karplus K, Krogh A[14] 1999
SSEARCH Smith-Waterman search, slower but more sensitive than FASTA Both    
SWAPHI First parallelized algorithm employing the emerging Intel Xeon Phis to accelerate Smith-Waterman protein database search Protein Liu Y and Schmidt B 2014
SWAPHI-LS First parallel Smith-Waterman algorithm exploiting Intel Xeon Phi clusters to accelerate the alignment of long DNA sequences DNA Liu Y, Tran TT, Lauenroth F, Schmidt B 2014
SWIMM Smith-Waterman implementation for Intel Multicore and Manycore architectures Protein Rucci E, García C, Botella G, De Giusti A, Naiouf M and Prieto-Matías M[15] 2015
SWIMM2.0 Enhanced Smith-Waterman on Intel's Multicore and Manycore architectures based on AVX-512 vector extensions Protein Rucci E, García C, Botella G, De Giusti A, Naiouf M and Prieto-Matías M[16] 2018
SWIPE Fast Smith-Waterman search using SIMD parallelization Both Rognes T 2011

*Sequence type: protein or nucleotide

2. Pairwise Alignment

Name Description Sequence type* Alignment type** Author Year
ACANA Fast heuristic anchor based pairwise alignment Both Both Huang, Umbach, Li 2005
AlignMe Alignments for membrane protein sequences Protein Both M. Stamm, K. Khafizov, R. Staritzbichler, L.R. Forrest 2013
ALLALIGN For DNA, RNA and protein molecules up to 32MB, aligns all sequences of size K or greater. Similar alignments are grouped together for analysis. Automatic repetitive sequence filter. Both Local E. Wachtel 2017
Bioconductor Biostrings::pairwiseAlignment Dynamic programming Both Both + Ends-free P. Aboyoun 2008
BioPerl dpAlign Dynamic programming Both Both + Ends-free Y. M. Chan 2003
BLASTZ, LASTZ Seeded pattern-matching Nucleotide Local Schwartz et al.[17][18] 2004,2009
CUDAlign DNA sequence alignment of unrestricted size in single or multiple GPUs Nucleotide Local, SemiGlobal, Global E. Sandes[19][20][21] 2011-2015
DNADot Web-based dot-plot tool Nucleotide Global R. Bowen 1998
DOTLET Java-based dot-plot tool Both Global M. Pagni and T. Junier 1998
FEAST Posterior based local extension with descriptive evolution model Nucleotide Local A. K. Hudek and D. G. Brown 2010
Genome Compiler Genome Compiler Align chromatogram files (.ab1, .scf) against a template sequence, locate errors, and correct them instantly. Nucleotide Local Genome Compiler Corporation 2014
G-PAS GPU-based dynamic programming with backtracking Both Local, SemiGlobal, Global W. Frohmberg, M. Kierzynka et al. 2011
GapMis Does pairwise sequence alignment with one gap Both SemiGlobal K. Frousios, T. Flouri, C. S. Iliopoulos, K. Park, S. P. Pissis, G. Tischler 2012
Genome Magician Software for ultra fast local DNA sequence motif search and pairwise alignment for NGS data (FASTA, FASTQ). DNA Local, SemiGlobal, Global Hepperle D (www.sequentix.de) 2020
GGSEARCH, GLSEARCH Global:Global (GG), Global:Local (GL) alignment with statistics Protein Global in query W. Pearson 2007
JAligner Java open-source implementation of Smith-Waterman Both Local A. Moustafa 2005
K*Sync Protein sequence to structure alignment that includes secondary structure, structural conservation, structure-derived sequence profiles, and consensus alignment scores Protein Both D. Chivian & D. Baker[22] 2003
LALIGN Multiple, non-overlapping, local similarity (same algorithm as SIM) Both Local non-overlapping W. Pearson 1991 (algorithm)
NW-align Standard Needleman-Wunsch dynamic programming algorithm Protein Global Y Zhang 2012
mAlign modelling alignment; models the information content of the sequences Nucleotide Both D. Powell, L. Allison and T. I. Dix 2004
matcher Waterman-Eggert local alignment (based on LALIGN) Both Local I. Longden (modified from W. Pearson) 1999
MCALIGN2 explicit models of indel evolution DNA Global J. Wang et al. 2006
MegAlign Pro (Lasergene Molecular Biology) Software to align DNA, RNA, protein, or DNA + protein sequences via pairwise and multiple sequence alignment algorithms including MUSCLE, Mauve, MAFFT, Clustal Omega, Jotun Hein, Wilbur-Lipman, Martinez Needleman-Wunsch, Lipman-Pearson and Dotplot analysis. Both Both DNASTAR 1993-2016
MUMmer suffix tree based Nucleotide Global S. Kurtz et al. 2004
needle Needleman-Wunsch dynamic programming Both SemiGlobal A. Bleasby 1999
Ngila logarithmic and affine gap costs and explicit models of indel evolution Both Global R. Cartwright 2007
NW Needleman-Wunsch dynamic programming Both Global A.C.R. Martin 1990-2015
parasail C/C++/Python/Java SIMD dynamic programming library for SSE, AVX2 Both Global, Ends-free, Local J. Daily 2015
Path Smith-Waterman on protein back-translation graph (detects frameshifts at protein level) Protein Local M. Gîrdea et al.[23] 2009
PatternHunter Seeded pattern-matching Nucleotide Local B. Ma et al.[24][25] 2002–2004
ProbA (also propA) Stochastic partition function sampling via dynamic programming Both Global U. Mückstein 2002
PyMOL "align" command aligns sequence & applies it to structure Protein Global (by selection) W. L. DeLano 2007
REPuter suffix tree based Nucleotide Local S. Kurtz et al. 2001
SABERTOOTH Alignment using predicted Connectivity Profiles Protein Global F. Teichert, J. Minning, U. Bastolla, and M. Porto 2009
Satsuma Parallel whole-genome synteny alignments DNA Local M.G. Grabherr et al. 2010
SEQALN Various dynamic programming Both Local or global M.S. Waterman and P. Hardy 1996
SIM, GAP, NAP, LAP Local similarity with varying gap treatments Both Local or global X. Huang and W. Miller 1990-6
SIM Local similarity Both Local X. Huang and W. Miller 1991
SPA: Super pairwise alignment Fast pairwise global alignment Nucleotide Global Shen, Yang, Yao, Hwang 2002
SSEARCH Local (Smith-Waterman) alignment with statistics Protein Local W. Pearson 1981 (Algorithm)
Sequences Studio Java applet demonstrating various algorithms from[26] Generic sequence Local and global A.Meskauskas 1997 (reference book)
SWIFOLD Smith-Waterman Acceleration on Intel's FPGA with OpenCL for Long DNA Sequences Nucleotide Local E. Rucci[27][28] 2017-2018
SWIFT suit Fast Local Alignment Searching DNA Local K. Rasmussen,[29] W. Gerlach 2005,2008
stretcher Memory-optimized Needleman-Wunsch dynamic programming Both Global I. Longden (modified from G. Myers and W. Miller) 1999
tranalign Aligns nucleic acid sequences given a protein alignment Nucleotide NA G. Williams (modified from B. Pearson) 2002
UGENE Opensource Smith-Waterman for SSE/CUDA, Suffix array based repeats finder & dotplot Both Both UniPro 2010
water Smith-Waterman dynamic programming Both Local A. Bleasby 1999
wordmatch k-tuple pairwise match Both NA I. Longden 1998
YASS Seeded pattern-matching Nucleotide Local L. Noe and G. Kucherov[30] 2004

*Sequence type: protein or nucleotide **Alignment type: local or global

3. Multiple Sequence Alignment

Name Description Sequence type* Alignment type** Author Year License
ABA A-Bruijn alignment Protein Global B.Raphael et al. 2004 Proprietary, freeware for education, research, nonprofit
ALE manual alignment ; some software assistance Nucleotides Local J. Blandy and K. Fogel 1994 (latest version 2007) Free, GPL2
ALLALIGN For DNA, RNA and protein molecules up to 32MB, aligns all sequences of size K or greater, MSA or within a single molecule. Similar alignments are grouped together for analysis. Automatic repetitive sequence filter. Both Local E. Wachtel 2017 Free
AMAP Sequence annealing Both Global A. Schwartz and L. Pachter 2006  
anon. fast, optimal alignment of three sequences using linear gap costs Nucleotides Global D. Powell, L. Allison and T. I. Dix 2000  
BAli-Phy Tree+multi-alignment; probabilistic-Bayesian; joint estimation Both + Codons Global BD Redelings and MA Suchard 2005 (latest version 2018) Free, GPL
Base-By-Base Java-based multiple sequence alignment editor with integrated analysis tools Both Local or global R. Brodie et al. 2004 Proprietary, freeware, must register
CHAOS, DIALIGN Iterative alignment Both Local (preferred) M. Brudno and B. Morgenstern 2003  
ClustalW Progressive alignment Both Local or global Thompson et al. 1994 Free, LGPL
CodonCode Aligner Multi-alignment; ClustalW & Phrap support Nucleotides Local or global P. Richterich et al. 2003 (latest version 2009)  
Compass COmparison of Multiple Protein sequence Alignments with assessment of Statistical Significance Protein Global R.I. Sadreyev, et al. 2009  
DECIPHER Progressive-iterative alignment Both Global Erik S. Wright 2014 Free, GPL
DIALIGN-TX and DIALIGN-T Segment-based method Both Local (preferred) or Global A.R.Subramanian 2005 (latest version 2008)  
DNA Alignment Segment-based method for intraspecific alignments Both Local (preferred) or Global A.Roehl 2005 (latest version 2008)  
DNA Baser Sequence Assembler Multi-alignment; Full automatic sequence alignment; Automatic ambiguity correction; Internal base caller; Command line seq alignment Nucleotides Local or global Heracle BioSoft SRL 2006 (latest version 2018) Commercial (some modules are freeware)
DNADynamo linked DNA to Protein multiple alignment with MUSCLE, Clustal and Smith-Waterman Both Local or global DNADynamo 2004 (newest version 2017)  
EDNA Energy Based Multiple Sequence Alignment for DNA Binding Sites Nucleotides Local or global Salama, RA. et al. 2013  
FAMSA Progressive alignment for extremely large protein families (hundreds of thousands of members) Protein Global Deorowicz et al. 2016  
FSA Sequence annealing Both Global R. K. Bradley et al. 2008  
Geneious Progressive-Iterative alignment; ClustalW plugin Both Local or global A.J. Drummond et al. 2005 (latest version 2017)  
Kalign Progressive alignment Both Global T. Lassmann 2005  
MAFFT Progressive-iterative alignment Both Local or global K. Katoh et al. 2005 Free, BSD
MARNA Multi-alignment of RNAs RNA Local S. Siebert et al. 2005  
MAVID Progressive alignment Both Global N. Bray and L. Pachter 2004  
MegAlign Pro (Lasergene Molecular Biology) Software to align DNA, RNA, protein, or DNA + protein sequences via pairwise and multiple sequence alignment algorithms including MUSCLE, Mauve, MAFFT, Clustal Omega, Jotun Hein, Wilbur-Lipman, Martinez Needleman-Wunsch, Lipman-Pearson and Dotplot analysis. Both Local or global DNASTAR 1993-2016  
MSA Dynamic programming Both Local or global D.J. Lipman et al. 1989 (modified 1995)  
MSAProbs Dynamic programming Protein Global Y. Liu, B. Schmidt, D. Maskell 2010  
MULTALIN Dynamic programming-clustering Both Local or global F. Corpet 1988  
Multi-LAGAN Progressive dynamic programming alignment Both Global M. Brudno et al. 2003  
MUSCLE Progressive-iterative alignment Both Local or global R. Edgar 2004  
Opal Progressive-iterative alignment Both Local or global T. Wheeler and J. Kececioglu 2007 (latest stable 2013, latest beta 2016)  
Pecan Probabilistic-consistency DNA Global B. Paten et al. 2008  
Phylo A human computing framework for comparative genomics to solve multiple alignment Nucleotides Local or global McGill Bioinformatics 2010  
PMFastR Progressive structure aware alignment RNA Global D. DeBlasio, J Braund, S Zhang 2009  
Praline Progressive-iterative-consistency-homology-extended alignment with preprofiling and secondary structure prediction Protein Global J. Heringa 1999 (latest version 2009)  
PicXAA Nonprogressive, maximum expected accuracy alignment Both Global S.M.E. Sahraeian and B.J. Yoon 2010  
POA Partial order/hidden Markov model Protein Local or global C. Lee 2002  
Probalign Probabilistic/consistency with partition function probabilities Protein Global Roshan and Livesay 2006 Free, public domain
ProbCons Probabilistic/consistency Protein Local or global C. Do et al. 2005 Free, public domain
PROMALS3D Progressive alignment/hidden Markov model/Secondary structure/3D structure Protein Global J. Pei et al. 2008  
PRRN/PRRP Iterative alignment (especially refinement) Protein Local or global Y. Totoki (based on O. Gotoh) 1991 and later  
PSAlign Alignment preserving non-heuristic Both Local or global S.H. Sze, Y. Lu, Q. Yang. 2006  
RevTrans Combines DNA and Protein alignment, by back translating the protein alignment to DNA. DNA/Protein (special) Local or global Wernersson and Pedersen 2003 (newest version 2005)  
SAGA Sequence alignment by genetic algorithm Protein Local or global C. Notredame et al. 1996 (new version 1998)  
SAM Hidden Markov model Protein Local or global A. Krogh et al. 1994 (most recent version 2002)  
Se-Al Manual alignment Both Local A. Rambaut 2002  
StatAlign Bayesian co-estimation of alignment and phylogeny (MCMC) Both Global A. Novak et al. 2008  
Stemloc Multiple alignment and secondary structure prediction RNA Local or global I. Holmes 2005 Free, GPL 3 (parte de DART)
T-Coffee More sensitive progressive alignment Both Local or global C. Notredame et al. 2000 (newest version 2008) Free, GPL 2
UGENE Supports multiple alignment with MUSCLE, KAlign, Clustal and MAFFT plugins Both Local or global UGENE team 2010 (newest version 2020) Free, GPL 2
VectorFriends VectorFriends Aligner, MUSCLE plugin, and ClustalW plugin Both Local or global BioFriends team 2013 Proprietary, freeware for academic use
GLProbs Adaptive pair-Hidden Markov Model based approach Protein Global Y. Ye et al. 2013  

*Sequence type: protein or nucleotide. **Alignment type: local or global

4. Genomics Analysis

Name Description Sequence type*
EAGLE [31] An ultra-fast tool to find relative absent words in genomic data Nucleotide
ACT (Artemis Comparison Tool) Synteny and comparative genomics Nucleotide
AVID Pairwise global alignment with whole genomes Nucleotide
BLAT Alignment of cDNA sequences to a genome. Nucleotide
DECIPHER Alignment of rearranged genomes using 6 frame translation Nucleotide
FLAK Fuzzy whole genome alignment and analysis Nucleotide
GMAP Alignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy. Nucleotide
Splign Alignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy. Able to recognize and separate gene duplications. Nucleotide
Mauve Multiple alignment of rearranged genomes Nucleotide
MGA Multiple Genome Aligner Nucleotide
Mulan Local multiple alignments of genome-length sequences Nucleotide
Multiz Multiple alignment of genomes Nucleotide
PLAST-ncRNA Search for ncRNAs in genomes by partition function local alignment Nucleotide
Sequerome Profiling sequence alignment data with major servers/services Nucleotide, peptide
Sequilab Profiling sequence alignment data from NCBI-BLAST results with major servers-services Nucleotide, peptide
Shuffle-LAGAN Pairwise glocal alignment of completed genome regions Nucleotide
SIBsim4, Sim4 A program designed to align an expressed DNA sequence with a genomic sequence, allowing for introns Nucleotide
SLAM Gene finding, alignment, annotation (human-mouse homology identification) Nucleotide  
SRPRISM An efficient aligner for assemblies with explicit guarantees, aligning reads without splices Nucleotide

*Sequence type: protein or nucleotide

 

 

5. Motif Finding

NameDescriptionSequence type*
PMSMotif search and discoveryBoth
FMMMotif search and discovery (can get also positive & negative sequences as input for enriched motif search)Nucleotide
BLOCKSUngapped motif identification from BLOCKS databaseBoth
eMOTIFExtraction and identification of shorter motifsBoth
Gibbs motif samplerStochastic motif extraction by statistical likelihoodBoth
HMMTOPPrediction of transmembrane helices and topology of proteinsProtein
I-sitesLocal structure motif libraryProtein
JCoilsPrediction of Coiled coil and Leucine ZipperProtein
MEME/MASTMotif discovery and searchBoth
CUDA-MEMEGPU accelerated MEME (v4.4.0) algorithm for GPU clustersBoth
MERCIDiscriminative motif discovery and searchBoth
PHI-BlastMotif search and alignment toolBoth
PhyloscanMotif search toolNucleotide
PRATTPattern generation for use with ScanPrositeProtein
ScanPrositeMotif database search toolProtein
TEIRESIASMotif extraction and database searchBoth
BASALTMultiple motif and regular expression searchBoth

*Sequence type: protein or nucleotide

6. Benchmarking

NameAuthors
PFAM 30.0 (2016) 
SMART (2015)Letunic, Copley, Schmidt, Ciccarelli, Doerks, Schultz, Ponting, Bork
BAliBASE 3 (2015)Thompson, Plewniak, Poch
Oxbench (2011)Raghava, Searle, Audley, Barber, Barton
Benchmark collection (2009)Edgar
HOMSTRAD (2005)Mizuguchi
PREFAB 4.0 (2005)Edgar
SABmark (2004)Van Walle, Lasters, Wyns

7. Short-read Sequence Alignment

Name DescripAutionhors paired-end optionUse FASTQ qualityGappedMulti-threadedLicenseReferenceYear
PFAriocM 30.0 (2016) Computes Smith-Waterman gapped alignments and mapping qualities on one or more GPUs. Supports BS-seq alignments. Processes 100,000 to 500,000 reads per second (varies with data, hardware, and configured sensitivity).  YesNoYesYesFree, BSD[32]2015
BarraCUDSMART (2015) A GPGPU accLelerated Burrows-Wheeler transform (FM-index) short read alignment program based on BWA, supports alignment of indels with gap openings and extensions.tunic, Copley, Schmidt, Ciccarelli, Doerks, Schultz, Ponting, Bork YesNoYesYes, POSIX Threads and CUDAFree, GPL  
BAliBMapASE 3 (2015) Uses a sThort kmers to rapidly index genome; no size or scaffold count limit. Higher sensitivity and specificity than Burrows-Wheeler aligners, with similar or greater speed. Performs affine-transform-optimized global alignment, which is slower but more accurate than Smith-Waterman. Handles Illumina, 454, PacBio, Sanger, and Ion Torrent data. Splice-aware; capable of processing long indels and RNA-seq. Pure Java; runs on any platform. Used by tmpson, Plewniak, Poche Joint Genome Institute. YesYesYesYesFree, BSD 2010
BFASTOxbench (2011) Explicit time Rand accuracy tradeoff with a prior accuracy estimation, supported by indexing the reference sequences. Optimally compresses indexes. Can handle billions of short reads. Can handle insertions, deletions, SNPs, and color errors (can map ABI SOLiD color space reads). Performs a full Smith Watermaghava, Searle, Audley, Barber, Barton alignment.    Yes, POSIX ThreadsFree, GPL[33]2009
Benchmark collectigBWAon (2009) Runs the Burrows-Wheeler Aligner-BWA on a Hadoop cluster. It supports the algorithms BWA-MEM, BWA-ALN, and BWA-SW, working with paidgared and single reads. It implies an important reduction in the computational time when running in a Hadoop cluster, adding scalability and fault-tolerance. YesLow quality bases trimmingYesYesFree, GPL 3[34]2015
BLAHOMSTNRAD (2005) BLAST's nucleotMide alignment program, slow and not accurate for short reads, and uses a sequence database (EST, Sanger sequence) rather than a reference genome.zuguchi        
PREFABLAT 4.0 (2005) MaEde by Jim Kent. Can handle one mismatch in initial alignment step.gar    Yes, client-serverProprietary, freeware for academic and noncommercial use[35]2002
SABowtiemark (2004) Uses Va Burrows-Wheeler transform to create a permanent, reusable index of the genome; 1.3 GB memory footprint for human genome. Alignn Walle, Lasters, Wyns more than 25 million Illumina reads in 1 CPU hour. Supports Maq-like and SOAP-like alignment policies YesYesNoYes, POSIX ThreadsFree, Artistic[36]2009
BWAUses a Burrows-Wheeler transform to create an index of the genome. It's a bit slower than Bowtie but allows indels in alignment.YesLow quality bases trimmingYesYesFree, GPL[37]2009
BWA-PSSMA probabilistic short read aligner based on the use of position specific scoring matrices (PSSM). The aligner is adaptable in the sense that it can take into account the quality scores of the reads and models of data specific biases, such as those observed in Ancient DNA, PAR-CLIP data or genomes with biased nucleotide compositions.[38]YesYesYesYesFree, GPL[38]2014
CASHXQuantify and manage large quantities of short-read sequence data. CASHX pipeline contains a set of tools that can be used together, or separately as modules. This algorithm is very accurate for perfect hits to a reference genome.   NoProprietary, freeware for academic and noncommercial use  
CloudburstShort-read mapping using Hadoop MapReduce   Yes, Hadoop MapReduceFree, Artistic  
CUDA-ECShort-read alignment error correction using GPUs.   Yes, GPU enabled   
CUSHAWA CUDA compatible short read aligner to large genomes based on Burrows-Wheeler transformYesYesNoYes (GPU enabled)Free, GPL[39]2012
CUSHAW2Gapped short-read and long-read alignment based on maximal exact match seeds. This aligner supports both base-space (e.g. from Illumina, 454, Ion Torrent and PacBio sequencers) and ABI SOLiD color-space read alignments.YesNoYesYesFree, GPL 2014
CUSHAW2-GPUGPU-accelerated CUSHAW2 short-read aligner.YesNoYesYesFree, GPL  
CUSHAW3Sensitive and accurate base-space and color-space short-read alignment with hybrid seedingYesNoYesYesFree, GPL[40]2012
drFASTRead mapping alignment software that implements cache obliviousness to minimize main/cache memory transfers like mrFAST and mrsFAST, however designed for the SOLiD sequencing platform (color space reads). It also returns all possible map locations for improved structural variation discovery.YesYes, for structural variationYesNoFree, BSD  
ELANDImplemented by Illumina. Includes ungapped alignment with a finite read length.       
ERNEExtended Randomized Numerical alignEr for accurate alignment of NGS reads. It can map bisulfite-treated reads.YesLow quality bases trimmingYesMultithreading and MPI-enabledFree, GPL 3  
GASSSTFinds global alignments of short DNA sequences against large DNA banks   MultithreadingCeCILL version 2 License.[41]2011
GEMHigh-quality alignment engine (exhaustive mapping with substitutions and indels). More accurate and several times faster than BWA or Bowtie 1/2. Many standalone biological applications (mapper, split mapper, mappability, and other) provided.YesYesYesYesFree, GPL3[42]2012
Genalice MAPUltra fast and comprehensive NGS read aligner with high precision and small storage footprint.YesLow quality bases trimmingYesYesProprietary, commercial  
Geneious AssemblerFast, accurate overlap assembler with the ability to handle any combination of sequencing technology, read length, any pairing orientations, with any spacer size for the pairing, with or without a reference genome.   YesProprietary, commercial  
GensearchNGSComplete framework with user-friendly GUI to analyse NGS data. It integrates a proprietary high quality alignment algorithm and plug-in ability to integrate various public aligner into a framework allowing to import short reads, align them, detect variants, and generate reports. It is made for resequencing projects, namely in a diagnostic setting.YesNoYesYesProprietary, commercial  
GMAP and GSNAPRobust, fast short-read alignment. GMAP: longer reads, with multiple indels and splices (see entry above under Genomics analysis); GSNAP: shorter reads, with one indel or up to two splices per read. Useful for digital gene expression, SNP and indel genotyping. Developed by Thomas Wu at Genentech. Used by the National Center for Genome Resources (NCGR) in Alpheus.YesYesYesYesProprietary, freeware for academic and noncommercial use  
GNUMAPAccurately performs gapped alignment of sequence data obtained from next-generation sequencing machines (specifically of Solexa-Illumina) back to a genome of any size. Includes adaptor trimming, SNP calling and Bisulfite sequence analysis. Yes, also supports Illumina *_int.txt and *_prb.txt files with all 4 quality scores for each base Multithreading and MPI-enabled [43]2009
HIVE-hexagonUses a hash table and bloom matrix to create and filter potential positions on the genome. For higher efficiency uses cross-similarity between short reads and avoids realigning non unique redundant sequences. It is faster than Bowtie and BWA and allows indels and divergent sensitive alignments on viruses, bacteria, and more conservative eukaryotic alignments.YesYesYesYesProprietary, freeware for academic and noncommercial users registered to HIVE deployment instance[44]2014
IMOSImproved Meta-aligner and Minimap2 On Spark. A long read distributed aligner on Apache Spark platform with linear scalability w.r.t. single node execution. YesYesYesFree  
IsaacFully uses all the computing power available on one server node; thus, it scales well over a broad range of hardware architectures, and alignment performance improves with hardware abilitiesYesYesYesYesFree, GPL  
LASTUses adaptative seeds and copes more efficiently with repeat-rich sequences (e.g. genomes). For example: it can align reads to genomes without repeat-masking, without becoming overwhelmed by repetitive hits.YesYesYesYesFree, GPL[45]2011
MAQUngapped alignment that takes into account quality scores for each base.    Free, GPL  
mrFAST, mrsFASTGapped (mrFAST) and ungapped (mrsFAST) alignment software that implements cache obliviousness to minimize main/cache memory transfers. They are designed for the Illumina sequencing platform and they can return all possible map locations for improved structural variation discovery.YesYes, for structural variationYesNoFree, BSD  
MOMMOM or maximum oligonucleotide mapping is a query matching tool that captures a maximal length match within the short read.   Yes   
MOSAIKFast gapped aligner and reference-guided assembler. Aligns reads using a banded Smith-Waterman algorithm seeded by results from a k-mer hashing scheme. Supports reads ranging in size from very short to very long.   Yes   
MPscanFast aligner based on a filtration strategy (no indexing, use q-grams and Backward Nondeterministic DAWG Matching)     [46]2009
Novoalign & NovoalignCSGapped alignment of single end and paired end Illumina GA I & II, ABI Colour space & ION Torrent reads. High sensitivity and specificity, using base qualities at all steps in the alignment. Includes adapter trimming, base quality calibration, Bi-Seq alignment, and options for reporting multiple alignments per read. Use of ambiguous IUPAC codes in reference for common SNPs can improve SNP recall and remove allelic bias.YesYesYesMulti-threading and MPI versions available with paid licenseProprietary, freeware single threaded version for academic and noncommercial use  
NextGENeDeveloped for use by biologists performing analysis of next generation sequencing data from Roche Genome Sequencer FLX, Illumina GA/HiSeq, Life Technologies Applied BioSystems’ SOLiD System, PacBio and Ion Torrent platforms.YesYesYesYesProprietary, commercial  
NextGenMapFlexible and fast read mapping program (twice as fast as BWA), achieves a mapping sensitivity comparable to Stampy. Internally uses a memory efficient index structure (hash table) to store positions of all 13-mers present in the reference genome. Mapping regions where pairwise alignments are required are dynamically determined for each read. Uses fast SIMD instructions (SSE) to accelerate alignment calculations on CPU. If available, alignments are computed on GPU (using OpenCL/CUDA) further reducing runtime 20-50%.YesNoYesYes, POSIX Threads, OpenCL/CUDA, SSEFree[47]2013
Omixon Variant ToolkitIncludes highly sensitive and highly accurate tools for detecting SNPs and indels. It offers a solution to map NGS short reads with a moderate distance (up to 30% sequence divergence) from reference genomes. It poses no restrictions on the size of the reference, which, combined with its high sensitivity, makes the Variant Toolkit well-suited for targeted sequencing projects and diagnostics.YesYesYesYesProprietary, commercial  
PALMapperEfficiently computes both spliced and unspliced alignments at high accuracy. Relying on a machine learning strategy combined with a fast mapping based on a banded Smith-Waterman-like algorithm, it aligns around 7 million reads per hour on one CPU. It refines the originally proposed QPALMA approach.   YesFree, GPL  
Partek FlowFor use by biologists and bioinformaticians. It supports ungapped, gapped and splice-junction alignment from single and paired-end reads from Illumina, Life technologies Solid TM, Roche 454 and Ion Torrent raw data (with or without quality information). It integrates powerful quality control on FASTQ/Qual level and on aligned data. Additional functionality include trimming and filtering of raw reads, SNP and InDel detection, mRNA and microRNA quantification and fusion gene detection.YesYesYesMultiprocessor-core, client-server installation possibleProprietary, commercial, free trial version  
PASSIndexes the genome, then extends seeds using pre-computed alignments of words. Works with base space, color space (SOLID), and can align genomic and spliced RNA-seq reads.YesYesYesYesProprietary, freeware for academic and noncommercial use  
PerMIndexes the genome with periodic seeds to quickly find alignments with full sensitivity up to four mismatches. It can map Illumina and SOLiD reads. Unlike most mapping programs, speed increases for longer read lengths.   YesFree, GPL[48] 
PRIMEXIndexes the genome with a k-mer lookup table with full sensitivity up to an adjustable number of mismatches. It is best for mapping 15-60 bp sequences to a genome.NoNoYesNo, multiple processes per search [1]2003
QPalmaCan use quality scores, intron lengths, and computation splice site predictions to perform and performs an unbiased alignment. Can be trained to the specifics of a RNA-seq experiment and genome. Useful for splice site/intron discovery and for gene model building. (See PALMapper for a faster version).   Yes, client-serverFree, GPL 2  
RazerSNo read length limit. Hamming or edit distance mapping with configurable error rates. Configurable and predictable sensitivity (runtime/sensitivity tradeoff). Supports paired-end read mapping.    Free, LGPL  
REAL, cREALREAL is an efficient, accurate, and sensitive tool for aligning short reads obtained from next-generation sequencing. The programme can handle an enormous amount of single-end reads generated by the next-generation Illumina/Solexa Genome Analyzer. cREAL is a simple extension of REAL for aligning short reads obtained from next-generation sequencing to a genome with circular structure. Yes YesFree, GPL  
RMAPCan map reads with or without error probability information (quality scores) and supports paired-end reads or bisulfite-treated read mapping. There are no limitations on read length or number of mismatches.YesYesYes Free, GPL 3  
rNAA randomized Numerical Aligner for Accurate alignment of NGS readsYesLow quality bases trimmingYesMultithreading and MPI-enabledFree, GPL 3  
RTG InvestigatorExtremely fast, tolerant to high indel and substitution counts. Includes full read alignment. Product includes comprehensive pipelines for variant detection and metagenomic analysis with any combination of Illumina, Complete Genomics and Roche 454 data.YesYes, for variant callingYesYesProprietary, freeware for individual investigator use  
SegemehlCan handle insertions, deletions, mismatches; uses enhanced suffix arraysYesNoYesYesProprietary, freeware for noncommercial use[49]2009
SeqMapUp to 5 mixed substitutions and insertions-deletions; various tuning options and input-output formats    Proprietary, freeware for academic and noncommercial use  
ShrecShort read error correction with a suffix tree data structure   Yes, Java   
SHRiMPIndexes the reference genome as of version 2. Uses masks to generate possible keys. Can map ABI SOLiD color space reads.YesYesYesYes, OpenMPFree, [[BSD licenses| style="background: #9FF; color: black; vertical-align: middle; text-align: center; " class="free table-free"|Free, BSD]] derivative

[50] [51]

2009-2011
SLIDERSlider is an application for the Illumina Sequence Analyzer output that uses the "probability" files instead of the sequence files as an input for alignment to a reference sequence or a set of reference sequences.YesYesNoNo [52][53]2009-2010
SOAP, SOAP2, SOAP3, SOAP3-dpSOAP: robust with a small (1-3) number of gaps and mismatches. Speed improvement over BLAT, uses a 12 letter hash table. SOAP2: using bidirectional BWT to build the index of reference, and it is much faster than the first version. SOAP3: GPU-accelerated version that could find all 4-mismatch alignments in tens of seconds per one million reads. SOAP3-dp, also GPU accelerated, supports arbitrary number of mismatches and gaps according to affine gap penalty scores.YesNoYes, SOAP3-dpYes, POSIX Threads; SOAP3, SOAP3-dp need GPU with CUDA supportFree, GPL[54][55] 
SOCSFor ABI SOLiD technologies. Significant increase in time to map reads with mismatches (or color errors). Uses an iterative version of the Rabin-Karp string search algorithm.   YesFree, GPL  
SparkBWAIntegrates the Burrows-Wheeler Aligner—BWA on an Apache Spark framework running atop Hadoop. Version 0.2 of October 2016, supports the algorithms BWA-MEM, BWA-backtrack, and BWA-ALN. All of them work with single-reads and paired-end reads.YesLow quality bases trimmingYesYesFree, GPL 3[56]2016
SSAHA, SSAHA2Fast for a small number of variants    Proprietary, freeware for academic and noncommercial use  
StampyFor Illumina reads. High specificity, and sensitive for reads with indels, structural variants, or many SNPs. Slow, but speed increased dramatically by using BWA for first alignment pass.YesYesYesNoProprietary, freeware for academic and noncommercial use[57]2010
SToRMFor Illumina or ABI SOLiD reads, with SAM native output. Highly sensitive for reads with many errors, indels (full from 0 to 15, extended support otherwise). Uses spaced seeds (single hit) and a very fast SSE-SSE2-AVX2-AVX-512 banded alignment filter. For fixed-length reads only, authors recommend SHRiMP2 otherwise.NoYesYesYes, OpenMPFree[58]2010
Subread, SubjuncSuperfast and accurate read aligners. Subread can be used to map both gDNA-seq and RNA-seq reads. Subjunc detects exon-exon junctions and maps RNA-seq reads. They employ a novel mapping paradigm named seed-and-vote.YesYesYesYesFree, GPL 3  
TaipanDe-novo assembler for Illumina reads    Proprietary, freeware for academic and noncommercial use  
UGENEVisual interface both for Bowtie and BWA, and an embedded alignerYesYesYesYesFree, GPL  
VelociMapperFPGA-accelerated reference sequence alignment mapping tool from TimeLogic. Faster than Burrows-Wheeler transform-based algorithms like BWA and Bowtie. Supports up to 7 mismatches and/or indels with no performance penalty. Produces sensitive Smith-Waterman gapped alignments.YesYesYesYesProprietary, commercial  
XpressAlignFPGA based sliding window short read aligner which exploits the embarrassingly parallel property of short read alignment. Performance scales linearly with number of transistors on a chip (i.e. performance guaranteed to double with each iteration of Moore's Law without modification to algorithm). Low power consumption is useful for datacentre equipment. Predictable runtime. Better price/performance than software sliding window aligners on current hardware, but not better than software BWT-based aligners currently. Can manage large numbers (>2) of mismatches. Will find all hit positions for all seeds. Single-FPGA experimental version, needs work to develop it into a multi-FPGA production version.    Proprietary, freeware for academic and noncommercial use  
ZOOM100% sensitivity for a reads between 15 and 240 bp with practical mismatches. Very fast. Support insertions and deletions. Works with Illumina & SOLiD instruments, not 454.   Yes (GUI), no (CLI)Proprietary, commercial[59] 

7. Alignment Viewers, Editors

Please see List of alignment visualization software.

8. Short-read Sequence Alignment

NameDescriptionpaired-end optionUse FASTQ qualityGappedMulti-threadedLicenseReferenceYear
AriocComputes Smith-Waterman gapped alignments and mapping qualities on one or more GPUs. Supports BS-seq alignments. Processes 100,000 to 500,000 reads per second (varies with data, hardware, and configured sensitivity).YesNoYesYesFree, BSD[32]2015
BarraCUDAA GPGPU accelerated Burrows-Wheeler transform (FM-index) short read alignment program based on BWA, supports alignment of indels with gap openings and extensions.YesNoYesYes, POSIX Threads and CUDAFree, GPL  
BBMapUses a short kmers to rapidly index genome; no size or scaffold count limit. Higher sensitivity and specificity than Burrows-Wheeler aligners, with similar or greater speed. Performs affine-transform-optimized global alignment, which is slower but more accurate than Smith-Waterman. Handles Illumina, 454, PacBio, Sanger, and Ion Torrent data. Splice-aware; capable of processing long indels and RNA-seq. Pure Java; runs on any platform. Used by the Joint Genome Institute.YesYesYesYesFree, BSD 2010
BFASTExplicit time and accuracy tradeoff with a prior accuracy estimation, supported by indexing the reference sequences. Optimally compresses indexes. Can handle billions of short reads. Can handle insertions, deletions, SNPs, and color errors (can map ABI SOLiD color space reads). Performs a full Smith Waterman alignment.   Yes, POSIX ThreadsFree, GPL[33]2009
BigBWARuns the Burrows-Wheeler Aligner-BWA on a Hadoop cluster. It supports the algorithms BWA-MEM, BWA-ALN, and BWA-SW, working with paired and single reads. It implies an important reduction in the computational time when running in a Hadoop cluster, adding scalability and fault-tolerance.YesLow quality bases trimmingYesYesFree, GPL 3[34]2015
BLASTNBLAST's nucleotide alignment program, slow and not accurate for short reads, and uses a sequence database (EST, Sanger sequence) rather than a reference genome.       
BLATMade by Jim Kent. Can handle one mismatch in initial alignment step.   Yes, client-serverProprietary, freeware for academic and noncommercial use[35]2002
BowtieUses a Burrows-Wheeler transform to create a permanent, reusable index of the genome; 1.3 GB memory footprint for human genome. Aligns more than 25 million Illumina reads in 1 CPU hour. Supports Maq-like and SOAP-like alignment policiesYesYesNoYes, POSIX ThreadsFree, Artistic[36]2009
BWAUses a Burrows-Wheeler transform to create an index of the genome. It's a bit slower than Bowtie but allows indels in alignment.YesLow quality bases trimmingYesYesFree, GPL[37]2009
BWA-PSSMA probabilistic short read aligner based on the use of position specific scoring matrices (PSSM). The aligner is adaptable in the sense that it can take into account the quality scores of the reads and models of data specific biases, such as those observed in Ancient DNA, PAR-CLIP data or genomes with biased nucleotide compositions.[38]YesYesYesYesFree, GPL[38]2014
CASHXQuantify and manage large quantities of short-read sequence data. CASHX pipeline contains a set of tools that can be used together, or separately as modules. This algorithm is very accurate for perfect hits to a reference genome.   NoProprietary, freeware for academic and noncommercial use  
CloudburstShort-read mapping using Hadoop MapReduce   Yes, Hadoop MapReduceFree, Artistic  
CUDA-ECShort-read alignment error correction using GPUs.   Yes, GPU enabled   
CUSHAWA CUDA compatible short read aligner to large genomes based on Burrows-Wheeler transformYesYesNoYes (GPU enabled)Free, GPL[39]2012
CUSHAW2Gapped short-read and long-read alignment based on maximal exact match seeds. This aligner supports both base-space (e.g. from Illumina, 454, Ion Torrent and PacBio sequencers) and ABI SOLiD color-space read alignments.YesNoYesYesFree, GPL 2014
CUSHAW2-GPUGPU-accelerated CUSHAW2 short-read aligner.YesNoYesYesFree, GPL  
CUSHAW3Sensitive and accurate base-space and color-space short-read alignment with hybrid seedingYesNoYesYesFree, GPL[40]2012
drFASTRead mapping alignment software that implements cache obliviousness to minimize main/cache memory transfers like mrFAST and mrsFAST, however designed for the SOLiD sequencing platform (color space reads). It also returns all possible map locations for improved structural variation discovery.YesYes, for structural variationYesNoFree, BSD  
ELANDImplemented by Illumina. Includes ungapped alignment with a finite read length.       
ERNEExtended Randomized Numerical alignEr for accurate alignment of NGS reads. It can map bisulfite-treated reads.YesLow quality bases trimmingYesMultithreading and MPI-enabledFree, GPL 3  
GASSSTFinds global alignments of short DNA sequences against large DNA banks   MultithreadingCeCILL version 2 License.[41]2011
GEMHigh-quality alignment engine (exhaustive mapping with substitutions and indels). More accurate and several times faster than BWA or Bowtie 1/2. Many standalone biological applications (mapper, split mapper, mappability, and other) provided.YesYesYesYesFree, GPL3[42]2012
Genalice MAPUltra fast and comprehensive NGS read aligner with high precision and small storage footprint.YesLow quality bases trimmingYesYesProprietary, commercial  
Geneious AssemblerFast, accurate overlap assembler with the ability to handle any combination of sequencing technology, read length, any pairing orientations, with any spacer size for the pairing, with or without a reference genome.   YesProprietary, commercial  
GensearchNGSComplete framework with user-friendly GUI to analyse NGS data. It integrates a proprietary high quality alignment algorithm and plug-in ability to integrate various public aligner into a framework allowing to import short reads, align them, detect variants, and generate reports. It is made for resequencing projects, namely in a diagnostic setting.YesNoYesYesProprietary, commercial  
GMAP and GSNAPRobust, fast short-read alignment. GMAP: longer reads, with multiple indels and splices (see entry above under Genomics analysis); GSNAP: shorter reads, with one indel or up to two splices per read. Useful for digital gene expression, SNP and indel genotyping. Developed by Thomas Wu at Genentech. Used by the National Center for Genome Resources (NCGR) in Alpheus.YesYesYesYesProprietary, freeware for academic and noncommercial use  
GNUMAPAccurately performs gapped alignment of sequence data obtained from next-generation sequencing machines (specifically of Solexa-Illumina) back to a genome of any size. Includes adaptor trimming, SNP calling and Bisulfite sequence analysis. Yes, also supports Illumina *_int.txt and *_prb.txt files with all 4 quality scores for each base Multithreading and MPI-enabled [43]2009
HIVE-hexagonUses a hash table and bloom matrix to create and filter potential positions on the genome. For higher efficiency uses cross-similarity between short reads and avoids realigning non unique redundant sequences. It is faster than Bowtie and BWA and allows indels and divergent sensitive alignments on viruses, bacteria, and more conservative eukaryotic alignments.YesYesYesYesProprietary, freeware for academic and noncommercial users registered to HIVE deployment instance[44]2014
IMOSImproved Meta-aligner and Minimap2 On Spark. A long read distributed aligner on Apache Spark platform with linear scalability w.r.t. single node execution. YesYesYesFree  
IsaacFully uses all the computing power available on one server node; thus, it scales well over a broad range of hardware architectures, and alignment performance improves with hardware abilitiesYesYesYesYesFree, GPL  
LASTUses adaptative seeds and copes more efficiently with repeat-rich sequences (e.g. genomes). For example: it can align reads to genomes without repeat-masking, without becoming overwhelmed by repetitive hits.YesYesYesYesFree, GPL[45]2011
MAQUngapped alignment that takes into account quality scores for each base.    Free, GPL  
mrFAST, mrsFASTGapped (mrFAST) and ungapped (mrsFAST) alignment software that implements cache obliviousness to minimize main/cache memory transfers. They are designed for the Illumina sequencing platform and they can return all possible map locations for improved structural variation discovery.YesYes, for structural variationYesNoFree, BSD  
MOMMOM or maximum oligonucleotide mapping is a query matching tool that captures a maximal length match within the short read.   Yes   
MOSAIKFast gapped aligner and reference-guided assembler. Aligns reads using a banded Smith-Waterman algorithm seeded by results from a k-mer hashing scheme. Supports reads ranging in size from very short to very long.   Yes   
MPscanFast aligner based on a filtration strategy (no indexing, use q-grams and Backward Nondeterministic DAWG Matching)     [46]2009
Novoalign & NovoalignCSGapped alignment of single end and paired end Illumina GA I & II, ABI Colour space & ION Torrent reads. High sensitivity and specificity, using base qualities at all steps in the alignment. Includes adapter trimming, base quality calibration, Bi-Seq alignment, and options for reporting multiple alignments per read. Use of ambiguous IUPAC codes in reference for common SNPs can improve SNP recall and remove allelic bias.YesYesYesMulti-threading and MPI versions available with paid licenseProprietary, freeware single threaded version for academic and noncommercial use  
NextGENeDeveloped for use by biologists performing analysis of next generation sequencing data from Roche Genome Sequencer FLX, Illumina GA/HiSeq, Life Technologies Applied BioSystems’ SOLiD System, PacBio and Ion Torrent platforms.YesYesYesYesProprietary, commercial  
NextGenMapFlexible and fast read mapping program (twice as fast as BWA), achieves a mapping sensitivity comparable to Stampy. Internally uses a memory efficient index structure (hash table) to store positions of all 13-mers present in the reference genome. Mapping regions where pairwise alignments are required are dynamically determined for each read. Uses fast SIMD instructions (SSE) to accelerate alignment calculations on CPU. If available, alignments are computed on GPU (using OpenCL/CUDA) further reducing runtime 20-50%.YesNoYesYes, POSIX Threads, OpenCL/CUDA, SSEFree[47]2013
Omixon Variant ToolkitIncludes highly sensitive and highly accurate tools for detecting SNPs and indels. It offers a solution to map NGS short reads with a moderate distance (up to 30% sequence divergence) from reference genomes. It poses no restrictions on the size of the reference, which, combined with its high sensitivity, makes the Variant Toolkit well-suited for targeted sequencing projects and diagnostics.YesYesYesYesProprietary, commercial  
PALMapperEfficiently computes both spliced and unspliced alignments at high accuracy. Relying on a machine learning strategy combined with a fast mapping based on a banded Smith-Waterman-like algorithm, it aligns around 7 million reads per hour on one CPU. It refines the originally proposed QPALMA approach.   YesFree, GPL  
Partek FlowFor use by biologists and bioinformaticians. It supports ungapped, gapped and splice-junction alignment from single and paired-end reads from Illumina, Life technologies Solid TM, Roche 454 and Ion Torrent raw data (with or without quality information). It integrates powerful quality control on FASTQ/Qual level and on aligned data. Additional functionality include trimming and filtering of raw reads, SNP and InDel detection, mRNA and microRNA quantification and fusion gene detection.YesYesYesMultiprocessor-core, client-server installation possibleProprietary, commercial, free trial version  
PASSIndexes the genome, then extends seeds using pre-computed alignments of words. Works with base space, color space (SOLID), and can align genomic and spliced RNA-seq reads.YesYesYesYesProprietary, freeware for academic and noncommercial use  
PerMIndexes the genome with periodic seeds to quickly find alignments with full sensitivity up to four mismatches. It can map Illumina and SOLiD reads. Unlike most mapping programs, speed increases for longer read lengths.   YesFree, GPL[48] 
PRIMEXIndexes the genome with a k-mer lookup table with full sensitivity up to an adjustable number of mismatches. It is best for mapping 15-60 bp sequences to a genome.NoNoYesNo, multiple processes per search [1]2003
QPalmaCan use quality scores, intron lengths, and computation splice site predictions to perform and performs an unbiased alignment. Can be trained to the specifics of a RNA-seq experiment and genome. Useful for splice site/intron discovery and for gene model building. (See PALMapper for a faster version).   Yes, client-serverFree, GPL 2  
RazerSNo read length limit. Hamming or edit distance mapping with configurable error rates. Configurable and predictable sensitivity (runtime/sensitivity tradeoff). Supports paired-end read mapping.    Free, LGPL  
REAL, cREALREAL is an efficient, accurate, and sensitive tool for aligning short reads obtained from next-generation sequencing. The programme can handle an enormous amount of single-end reads generated by the next-generation Illumina/Solexa Genome Analyzer. cREAL is a simple extension of REAL for aligning short reads obtained from next-generation sequencing to a genome with circular structure. Yes YesFree, GPL  
RMAPCan map reads with or without error probability information (quality scores) and supports paired-end reads or bisulfite-treated read mapping. There are no limitations on read length or number of mismatches.YesYesYes Free, GPL 3  
rNAA randomized Numerical Aligner for Accurate alignment of NGS readsYesLow quality bases trimmingYesMultithreading and MPI-enabledFree, GPL 3  
RTG InvestigatorExtremely fast, tolerant to high indel and substitution counts. Includes full read alignment. Product includes comprehensive pipelines for variant detection and metagenomic analysis with any combination of Illumina, Complete Genomics and Roche 454 data.YesYes, for variant callingYesYesProprietary, freeware for individual investigator use  
SegemehlCan handle insertions, deletions, mismatches; uses enhanced suffix arraysYesNoYesYesProprietary, freeware for noncommercial use[49]2009
SeqMapUp to 5 mixed substitutions and insertions-deletions; various tuning options and input-output formats    Proprietary, freeware for academic and noncommercial use  
ShrecShort read error correction with a suffix tree data structure   Yes, Java   
SHRiMPIndexes the reference genome as of version 2. Uses masks to generate possible keys. Can map ABI SOLiD color space reads.YesYesYesYes, OpenMPFree, [[BSD licenses| style="background: #9FF; color: black; vertical-align: middle; text-align: center; " class="free table-free"|Free, BSD]] derivative

[50] [51]

2009-2011
SLIDERSlider is an application for the Illumina Sequence Analyzer output that uses the "probability" files instead of the sequence files as an input for alignment to a reference sequence or a set of reference sequences.YesYesNoNo [52][53]2009-2010
SOAP, SOAP2, SOAP3, SOAP3-dpSOAP: robust with a small (1-3) number of gaps and mismatches. Speed improvement over BLAT, uses a 12 letter hash table. SOAP2: using bidirectional BWT to build the index of reference, and it is much faster than the first version. SOAP3: GPU-accelerated version that could find all 4-mismatch alignments in tens of seconds per one million reads. SOAP3-dp, also GPU accelerated, supports arbitrary number of mismatches and gaps according to affine gap penalty scores.YesNoYes, SOAP3-dpYes, POSIX Threads; SOAP3, SOAP3-dp need GPU with CUDA supportFree, GPL[54][55] 
SOCSFor ABI SOLiD technologies. Significant increase in time to map reads with mismatches (or color errors). Uses an iterative version of the Rabin-Karp string search algorithm.   YesFree, GPL  
SparkBWAIntegrates the Burrows-Wheeler Aligner—BWA on an Apache Spark framework running atop Hadoop. Version 0.2 of October 2016, supports the algorithms BWA-MEM, BWA-backtrack, and BWA-ALN. All of them work with single-reads and paired-end reads.YesLow quality bases trimmingYesYesFree, GPL 3[56]2016
SSAHA, SSAHA2Fast for a small number of variants    Proprietary, freeware for academic and noncommercial use  
StampyFor Illumina reads. High specificity, and sensitive for reads with indels, structural variants, or many SNPs. Slow, but speed increased dramatically by using BWA for first alignment pass.YesYesYesNoProprietary, freeware for academic and noncommercial use[57]2010
SToRMFor Illumina or ABI SOLiD reads, with SAM native output. Highly sensitive for reads with many errors, indels (full from 0 to 15, extended support otherwise). Uses spaced seeds (single hit) and a very fast SSE-SSE2-AVX2-AVX-512 banded alignment filter. For fixed-length reads only, authors recommend SHRiMP2 otherwise.NoYesYesYes, OpenMPFree[58]2010
Subread, SubjuncSuperfast and accurate read aligners. Subread can be used to map both gDNA-seq and RNA-seq reads. Subjunc detects exon-exon junctions and maps RNA-seq reads. They employ a novel mapping paradigm named seed-and-vote.YesYesYesYesFree, GPL 3  
TaipanDe-novo assembler for Illumina reads    Proprietary, freeware for academic and noncommercial use  
UGENEVisual interface both for Bowtie and BWA, and an embedded alignerYesYesYesYesFree, GPL  
VelociMapperFPGA-accelerated reference sequence alignment mapping tool from TimeLogic. Faster than Burrows-Wheeler transform-based algorithms like BWA and Bowtie. Supports up to 7 mismatches and/or indels with no performance penalty. Produces sensitive Smith-Waterman gapped alignments.YesYesYesYesProprietary, commercial  
XpressAlignFPGA based sliding window short read aligner which exploits the embarrassingly parallel property of short read alignment. Performance scales linearly with number of transistors on a chip (i.e. performance guaranteed to double with each iteration of Moore's Law without modification to algorithm). Low power consumption is useful for datacentre equipment. Predictable runtime. Better price/performance than software sliding window aligners on current hardware, but not better than software BWT-based aligners currently. Can manage large numbers (>2) of mismatches. Will find all hit positions for all seeds. Single-FPGA experimental version, needs work to develop it into a multi-FPGA production version.    Proprietary, freeware for academic and noncommercial use  
ZOOM100% sensitivity for a reads between 15 and 240 bp with practical mismatches. Very fast. Support insertions and deletions. Works with Illumina & SOLiD instruments, not 454.   Yes (GUI), no (CLI)Proprietary, commercial[59] 

References

  1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ; Gish; Miller; Myers; Lipman (October 1990). "Basic local alignment search tool". Journal of Molecular Biology 215 (3): 403–10. doi:10.1016/S0022-2836(05)80360-2. PMID 2231712.  https://dx.doi.org/10.1016%2FS0022-2836%2805%2980360-2
  2. HPC-BLAST code repository https://github.com/UTennessee-JICS/HPC-BLAST
  3. Angermüller, C.; Biegert, A.; Söding, J. (Dec 2012). "Discriminative modelling of context-specific amino acid substitution probabilities". Bioinformatics 28 (24): 3240–7. doi:10.1093/bioinformatics/bts622. PMID 23080114.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbts622
  4. Buchfink, Xie and Huson (2015). "Fast and sensitive protein alignment using DIAMOND". Nature Methods 12 (1): 59–60. doi:10.1038/nmeth.3176. PMID 25402007.  https://dx.doi.org/10.1038%2Fnmeth.3176
  5. B Buchfink, K Reuter and HG Drost (2021). "Sensitive protein alignments at tree-of-life scale using DIAMOND". Nature Methods 18 (4): 366–368. doi:10.1038/s41592-021-01101-x. PMID 33828273.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=8026399
  6. Söding J (April 2005). "Protein homology detection by HMM-HMM comparison". Bioinformatics 21 (7): 951–60. doi:10.1093/bioinformatics/bti125. PMID 15531603.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbti125
  7. Remmert, Michael; Biegert, Andreas; Hauser, Andreas; Söding, Johannes (2011-12-25). "HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment". Nature Methods 9 (2): 173–175. doi:10.1038/nmeth.1818. ISSN 1548-7105. PMID 22198341.  https://dx.doi.org/10.1038%2Fnmeth.1818
  8. Hauswedell, Hannes; Singer, Jochen; Reinert, Knut (2014-09-01). "Lambda: the local aligner for massive biological data". Bioinformatics 30 (17): 349–355. doi:10.1093/bioinformatics/btu439. PMID 25161219.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=4147892
  9. Steinegger, Martin; Soeding, Johannes (2017-10-16). "MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets". Nature Biotechnology 35 (11): 1026–1028. doi:10.1038/nbt.3988. PMID 29035372. https://www.nature.com/nbt/journal/vaop/ncurrent/full/nbt.3988.html. 
  10. Rucci, Enzo; Garcia, Carlos; Botella, Guillermo; Giusti, Armando E. De; Naiouf, Marcelo; Prieto-Matias, Manuel (2016-06-30). "OSWALD: OpenCL Smith–Waterman on Altera's FPGA for Large Protein Databases". International Journal of High Performance Computing Applications 32 (3): 337–350. doi:10.1177/1094342016654215. ISSN 1094-3420. http://hpc.sagepub.com/content/early/2016/06/30/1094342016654215. 
  11. Altschul SF; Madden TL; Schäffer AA et al. (September 1997). "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs". Nucleic Acids Research 25 (17): 3389–402. doi:10.1093/nar/25.17.3389. PMID 9254694.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=146917
  12. Li W; McWilliam H; Goujon M et al. (June 2012). "PSI-Search: iterative HOE-reduced profile SSEARCH searching". Bioinformatics 28 (12): 1650–1651. doi:10.1093/bioinformatics/bts240. PMID 22539666.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=3371869
  13. Oehmen, C.; Nieplocha, J. (August 2006). "ScalaBLAST: A scalable implementation of BLAST for high-performance data-intensive bioinformatics analysis". IEEE Transactions on Parallel & Distributed Systems 17 (8): 740–749. doi:10.1109/TPDS.2006.112. https://zenodo.org/record/1232261. 
  14. Hughey, R.; Karplus, K.; Krogh, A. (2003). SAM: sequence alignment and modeling software system. Technical report UCSC-CRL-99-11 (Report). University of California, Santa Cruz, CA. http://compbio.soe.ucsc.edu/papers/sam_doc/sam_doc.html. 
  15. Rucci, Enzo; García, Carlos; Botella, Guillermo; De Giusti, Armando; Naiouf, Marcelo; Prieto-Matías, Manuel (2015-12-25). "An energy-aware performance analysis of SWIMM: Smith–Waterman implementation on Intel's Multicore and Manycore architectures". Concurrency and Computation: Practice and Experience 27 (18): 5517–5537. doi:10.1002/cpe.3598. ISSN 1532-0634. http://sedici.unlp.edu.ar/handle/10915/82869. 
  16. Rucci, Enzo; García, Carlos; Botella, Guillermo; De Giusti, Armando; Naiouf, Marcelo; Prieto-Matías, Manuel (2015-12-25). "SWIMM 2.0: enhanced Smith-Waterman on Intel's Multicore and Manycore architectures based on AVX-512 vector extensions". International Journal of Parallel Programming 47 (2): 296–317. doi:10.1007/s10766-018-0585-7. ISSN 1573-7640. http://sedici.unlp.edu.ar/handle/10915/82888. 
  17. Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison RC, Haussler D, Miller W; Kent; Smit; Zhang; Baertsch; Hardison; Haussler; Miller (2003). "Human-mouse alignments with BLASTZ". Genome Research 13 (1): 103–107. doi:10.1101/gr.809403. PMID 12529312.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=430961
  18. Harris R S (2007). Improved pairwise alignment of genomic DNA (Thesis).
  19. Sandes, Edans F. de O.; de Melo, Alba Cristina M.A. (May 2013). "Retrieving Smith-Waterman Alignments with Optimizations for Megabase Biological Sequences Using GPU". IEEE Transactions on Parallel and Distributed Systems 24 (5): 1009–1021. doi:10.1109/TPDS.2012.194.  https://dx.doi.org/10.1109%2FTPDS.2012.194
  20. Sandes, Edans F. de O.; Miranda, G.; De Melo, A.C.M.A.; Martorell, X.; Ayguade, E. (May 2014). "CUDAlign 3.0: Parallel Biological Sequence Comparison in Large GPU Clusters". Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on. p. 160. doi:10.1109/CCGrid.2014.18.  https://dx.doi.org/10.1109%2FCCGrid.2014.18
  21. Sandes, Edans F. de O.; Miranda, G.; De Melo, A.C.M.A.; Martorell, X.; Ayguade, E. (August 2014). "Fine-grain Parallel Megabase Sequence Comparison with Multiple Heterogeneous GPUs". Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. pp. 383–384. doi:10.1145/2555243.2555280.  https://dx.doi.org/10.1145%2F2555243.2555280
  22. Chivian, D; Baker, D (2006). "Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection". Nucleic Acids Research 34 (17): e112. doi:10.1093/nar/gkl480. PMID 16971460.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=1635247
  23. Girdea, M; Noe, L; Kucherov, G (January 2010). "Back-translation for discovering distant protein homologies in the presence of frameshift mutations". Algorithms for Molecular Biology 5 (6): 6. doi:10.1186/1748-7188-5-6. PMID 20047662.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2821327
  24. Ma, B.; Tromp, J.; Li, M. (2002). "PatternHunter: faster and more sensitive homology search". Bioinformatics 18 (3): 440–445. doi:10.1093/bioinformatics/18.3.440. PMID 11934743.  https://dx.doi.org/10.1093%2Fbioinformatics%2F18.3.440
  25. Li, M.; Ma, B.; Kisman, D.; Tromp, J. (2004). "Patternhunter II: highly sensitive and fast homology search". Journal of Bioinformatics and Computational Biology 2 (3): 417–439. doi:10.1142/S0219720004000661. PMID 15359419.  https://dx.doi.org/10.1142%2FS0219720004000661
  26. Gusfield, Dan (1997). Algorithms on strings, trees and sequences. Cambridge university press. ISBN 978-0-521-58519-4. 
  27. Rucci, Enzo; Garcia, Carlos; Botella, Guillermo; Naiouf, Marcelo; De Giusti,Armando; Prieto-Matias, Manuel (2018). "SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences". BMC Systems Biology 12 (Suppl 5): 96. doi:10.1186/s12918-018-0614-6. PMID 30458766.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=6245597
  28. Rucci, Enzo; Garcia, Carlos; Botella, Guillermo; Naiouf, Marcelo; De Giusti,Armando; Prieto-Matias, Manuel. "Accelerating Smith-Waterman Alignment of Long DNA Sequences with OpenCL on FPGA". 5th International Work-Conference on Bioinformatics and Biomedical Engineering. pp. 500–511. doi:10.1007/978-3-319-56154-7_45.  https://dx.doi.org/10.1007%2F978-3-319-56154-7_45
  29. Rasmussen K, Stoye J, Myers EW; Stoye; Myers (2006). "Efficient q-Gram Filters for Finding All epsilon-Matches over a Given Length". Journal of Computational Biology 13 (2): 296–308. doi:10.1089/cmb.2006.13.296. PMID 16597241.  https://dx.doi.org/10.1089%2Fcmb.2006.13.296
  30. Noe L, Kucherov G; Kucherov (2005). "YASS: enhancing the sensitivity of DNA similarity search". Nucleic Acids Research 33 (suppl_2): W540–W543. doi:10.1093/nar/gki478. PMID 15980530.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=1160238
  31. Pratas, Diogo; Silva, Jorge (2020). "Persistent minimal sequences of SARS-CoV-2". Bioinformatics 36 (21): 5129–5132. doi:10.1093/bioinformatics/btaa686. PMID 32730589.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=7559010
  32. Wilton, Richard; Budavari, Tamas; Langmead, Ben; Wheelan, Sarah J.; Salzberg, Steven L.; Szalay, Alexander S. (2015). "Arioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space". PeerJ 3: e808. doi:10.7717/peerj.808. PMID 25780763.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=4358639
  33. Homer, Nils; Merriman, Barry; Nelson, Stanley F. (2009). "BFAST: An Alignment Tool for Large Scale Genome Resequencing". PLOS ONE 4 (11): e7767. doi:10.1371/journal.pone.0007767. PMID 19907642. Bibcode: 2009PLoSO...4.7767H.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2770639
  34. Abuín, J.M.; Pichel, J.C.; Pena, T.F.; Amigo, J. (2015). "BigBWA: approaching the Burrows–Wheeler aligner to Big Data technologies". Bioinformatics 31 (24): 4003–5. doi:10.1093/bioinformatics/btv506. PMID 26323715.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbtv506
  35. Kent, W. J. (2002). "BLAT---The BLAST-Like Alignment Tool". Genome Research 12 (4): 656–664. doi:10.1101/gr.229202. ISSN 1088-9051. PMID 11932250.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=187518
  36. Langmead, Ben; Trapnell, Cole; Pop, Mihai; Salzberg, Steven L (2009). "Ultrafast and memory-efficient alignment of short DNA sequences to the human genome". Genome Biology 10 (3): R25. doi:10.1186/gb-2009-10-3-r25. ISSN 1465-6906. PMID 19261174.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2690996
  37. Li, H.; Durbin, R. (2009). "Fast and accurate short read alignment with Burrows-Wheeler transform". Bioinformatics 25 (14): 1754–1760. doi:10.1093/bioinformatics/btp324. ISSN 1367-4803. PMID 19451168.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2705234
  38. Kerpedjiev, Peter; Frellsen, Jes; Lindgreen, Stinus; Krogh, Anders (2014). "Adaptable probabilistic mapping of short reads using position specific scoring matrices". BMC Bioinformatics 15 (1): 100. doi:10.1186/1471-2105-15-100. ISSN 1471-2105. PMID 24717095.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=4021105
  39. Liu, Y.; Schmidt, B.; Maskell, D. L. (2012). "CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform". Bioinformatics 28 (14): 1830–1837. doi:10.1093/bioinformatics/bts276. ISSN 1367-4803. PMID 22576173.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbts276
  40. Liu, Y.; Schmidt, B. (2012). "Long read alignment based on maximal exact match seeds". Bioinformatics 28 (18): i318–i324. doi:10.1093/bioinformatics/bts414. ISSN 1367-4803. PMID 22962447.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=3436841
  41. Rizk, Guillaume; Lavenier, Dominique (2010). "GASSST: global alignment short sequence search tool". Bioinformatics 26 (20): 2534–2540. doi:10.1093/bioinformatics/btq485. PMID 20739310.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2951093
  42. Marco-Sola, Santiago; Sammeth, Michael; Guigó, Roderic; Ribeca, Paolo (2012). "The GEM mapper: fast, accurate and versatile alignment by filtration". Nature Methods 9 (12): 1185–1188. doi:10.1038/nmeth.2221. ISSN 1548-7091. PMID 23103880.  https://dx.doi.org/10.1038%2Fnmeth.2221
  43. Clement, N. L.; Snell, Q.; Clement, M. J.; Hollenhorst, P. C.; Purwar, J.; Graves, B. J.; Cairns, B. R.; Johnson, W. E. (2009). "The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next-generation sequencing". Bioinformatics 26 (1): 38–45. doi:10.1093/bioinformatics/btp614. ISSN 1367-4803. PMID 19861355.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=6276904
  44. Santana-Quintero, Luis; Dingerdissen, Hayley; Thierry-Mieg, Jean; Mazumder, Raja; Simonyan, Vahan (2014). "HIVE-Hexagon: High-Performance, Parallelized Sequence Alignment for Next-Generation Sequencing Data Analysis". PLOS ONE 9 (6): 1754–1760. doi:10.1371/journal.pone.0099033. PMID 24918764. Bibcode: 2014PLoSO...999033S.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=4053384
  45. Kielbasa, S.M.; Wan, R.; Sato, K.; Horton, P.; Frith, M.C. (2011). "Adaptive seeds tame genomic sequence comparison". Genome Research 21 (3): 487–493. doi:10.1101/gr.113985.110. PMID 21209072.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=3044862
  46. Rivals, Eric; Salmela, Leena; Kiiskinen, Petteri; Kalsi, Petri; Tarhio, Jorma (2009). mpscan: Fast Localisation of Multiple Reads in Genomes. Lecture Notes in Computer Science. 5724. 246–260. doi:10.1007/978-3-642-04241-6_21. ISBN 978-3-642-04240-9. Bibcode: 2009LNCS.5724..246R.  https://dx.doi.org/10.1007%2F978-3-642-04241-6_21
  47. Sedlazeck, Fritz J.; Rescheneder, Philipp; von Haeseler, Arndt (2013). "NextGenMap: fast and accurate read mapping in highly polymorphic genomes". Bioinformatics 29 (21): 2790–2791. doi:10.1093/bioinformatics/btt468. PMID 23975764.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbtt468
  48. Chen, Yangho; Souaiaia, Tade; Chen, Ting (2009). "PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds". Bioinformatics 25 (19): 2514–2521. doi:10.1093/bioinformatics/btp486. PMID 19675096.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2752623
  49. Searls, David B.; Hoffmann, Steve; Otto, Christian; Kurtz, Stefan; Sharma, Cynthia M.; Khaitovich, Philipp; Vogel, Jörg; Stadler, Peter F. et al. (2009). "Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures". PLOS Computational Biology 5 (9): e1000502. doi:10.1371/journal.pcbi.1000502. ISSN 1553-7358. PMID 19750212. Bibcode: 2009PLSCB...5E0502H.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2730575
  50. Rumble, Stephen M.; Lacroute, Phil; Dalca, Adrian V.; Fiume, Marc; Sidow, Arend; Brudno, Michael (2009). "SHRiMP: Accurate Mapping of Short Color-space Reads". PLOS Computational Biology 5 (5): e1000386. doi:10.1371/journal.pcbi.1000386. PMID 19461883. Bibcode: 2009PLSCB...5E0386R.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2678294
  51. David, Matei; Dzamba, Misko; Lister, Dan; Ilie, Lucian; Brudno, Michael (2011). "SHRiMP2: Sensitive yet Practical Short Read Mapping". Bioinformatics 27 (7): 1011–1012. doi:10.1093/bioinformatics/btr046. PMID 21278192.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbtr046
  52. Malhis, Nawar; Butterfield, Yaron S. N.; Ester, Martin; Jones, Steven J. M. (2009). "Slider – Maximum use of probability information for alignment of short sequence reads and SNP detection". Bioinformatics 25 (1): 6–13. doi:10.1093/bioinformatics/btn565. PMID 18974170.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2638935
  53. Malhis, Nawar; Jones, Steven J. M. (2010). "High Quality SNP Calling Using Illumina Data at Shallow Coverage". Bioinformatics 26 (8): 1029–1035. doi:10.1093/bioinformatics/btq092. PMID 20190250.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbtq092
  54. Li, R.; Li, Y.; Kristiansen, K.; Wang, J. (2008). "SOAP: short oligonucleotide alignment program". Bioinformatics 24 (5): 713–714. doi:10.1093/bioinformatics/btn025. ISSN 1367-4803. PMID 18227114.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbtn025
  55. Li, R.; Yu, C.; Li, Y.; Lam, T.-W.; Yiu, S.-M.; Kristiansen, K.; Wang, J. (2009). "SOAP2: an improved ultrafast tool for short read alignment". Bioinformatics 25 (15): 1966–1967. doi:10.1093/bioinformatics/btp336. ISSN 1367-4803. PMID 19497933.  https://dx.doi.org/10.1093%2Fbioinformatics%2Fbtp336
  56. Abuín, José M.; Pichel, Juan C.; Pena, Tomás F.; Amigo, Jorge (2016-05-16). "SparkBWA: Speeding Up the Alignment of High-Throughput DNA Sequencing Data". PLOS ONE 11 (5): e0155461. doi:10.1371/journal.pone.0155461. ISSN 1932-6203. PMID 27182962. Bibcode: 2016PLoSO..1155461A.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=4868289
  57. Lunter, G.; Goodson, M. (2010). "Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads". Genome Research 21 (6): 936–939. doi:10.1101/gr.111120.110. ISSN 1088-9051. PMID 20980556.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=3106326
  58. Noe, L.; Girdea, M.; Kucherov, G. (2010). "Designing efficient spaced seeds for SOLiD read mapping". Advances in Bioinformatics 2010: 708501. doi:10.1155/2010/708501. PMID 20936175.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2945724
  59. Lin, H.; Zhang, Z.; Zhang, M.Q.; Ma, B.; Li, M. (2008). "ZOOM! Zillions of oligos mapped". Bioinformatics 24 (21): 2431–2437. doi:10.1093/bioinformatics/btn416. PMID 18684737.  http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=2732274
More
Video Production Service