MAFFT offers a range of multiple alignment strategies, L-INS-i
(accurate; recommended for <200 sequences), FFT-NS-i (standard speed and
accuracy), FFT-NS-2 (fast; recommended for >2,000 sequences), etc.
According to BAliBASE and other benchmark tests, L-INS-i is one of the
most accurate methods currently available.
MAFFT has been described:
K. Katoh and H. Toh 2008 (Briefings in Bioinformatics 9:286-298)
Recent developments in the MAFFT multiple sequence alignment program.
K. Katoh, K. Misawa, K. Kuma and T. Miyata (Nucleic Acids Res. 30:
3059-3066, 2002) MAFFT: a novel method for rapid multiple sequence
alignment based on fast Fourier transform.
MAPMAKER/EXP is a linkage analysis package designed to help construct primary
linkage maps of markers segregating in experimental crosses. MAPMAKER/EXP
performs full multipoint linkage analysis (simultaneous estimation of all
recombination fractions from the primary data) for dominant, recessive, and co-
dominant (e.g. RFLP-like) markers. MAPMAKER/EXP is an experimental-cross-only
successor to the original MAPMAKER program.
MAPMAKER/QTL is a companion program to MAPMAKER/EXP which allows one to map
genes controlling polygenic quantitative traits in F2 intercrosses and BC1
backcrosses relative to a genetic linkage map. More information on MAPMAKER/QTL
can be found in the technical report (included with MAPMAKER/QTL).
Migrate estimates effective population sizes and past migration rates between
two or "n" populations assuming a migration matrix model with asymmetric
migration rates and different subpopulation sizes. The n-population migrate
can use sequence data, microsatellite data or electrophoretic data.
SeqAn is an open source C++ library of efficient algorithms
and data structures for the analysis of sequences with the
focus on biological data. The library is licensed under the
3-clause BSD license except the applications which are GPL.
This port now only contains the header library and API
documentation. The apps are separate.
MOPAC7, semi-empirical (MNDO, AM1, PM3, MINDO/3) quantum-chemical
calculation written by James J. P. Stewart and co-workers.
MUSCLE is multiple alignment software for protein and nucleotide sequences.
The name stands for multiple sequence comparison by log-expectation.
A range of options is provided that give you the choice of optimizing
accuracy, speed, or some compromise between the two. Default parameters are
those that give the best average accuracy in the published tests. MUSCLE
can achieve both better average accuracy and better speed than CLUSTALW or
T-Coffee, depending on the chosen options.
Citation:
Edgar, R. C. (2004) MUSCLE: multiple sequence alignment with high accuracy
and high throughput. Nucleic Acids Research 32(5): 1792-1797.
Edgar, R. C. (2004) MUSCLE: a multiple sequence alignment method with
reduced time and space complexity. BMC Bioinformatics 5(1): 113.
The NAR paper gives only a brief overview of the algorithm and
implementation details. For a full discussion of the method and many of
the non-default options that it offers, please see the BMC paper.
The Basic Local Alignment Search Tool (BLAST) finds regions of local
similarity between sequences. The program compares nucleotide or protein
sequences to sequence databases and calculates the statistical
significance of matches. BLAST can be used to infer functional and
evolutionary relationships between sequences as well as help identify
members of gene families.
The NCBI (National Center for Biotechnology Information) development toolkit,
containing various libraries needed by NCBI applications, as well as a
software suite containing, amongst other things, NCBI BLAST 2.0.
From the README:
The NCBI Software Development Toolkit was developed for the production and
distribution of GenBank, Entrez, BLAST, and related services by NCBI. We
make it freely available to the public without restriction to facilitate
the use of NCBI by the scientific community. However, please understand
that while we feel we have done a high quality job, this is not commercial
software.
The documentation lags considerably behind the software and we must make
any changes required by our data production needs. Nontheless, many people
have found it a useful and stable basis for a number of tools and
applications.
AcePerl is an object-oriented Perl interface for the ACEDB genome database
system. It provides functionality for connecting to remote ACEDB databases,
performing queries, fetching ACE objects, and updating databases.
p5-Bio-Das-Lite is an implementation of a client for the DAS protocol (XML
over HTTP primarily for biological-data).