MAFFT multiple sequence alignment software version 7: improvements in performance and usability.
Katoh K, Standley DM -Molecular biology and evolution(2013)
(A) A part of output of the ––treeout option showing the phylogenetic positions of new sequences (new#) in the tree of the existing alignment (backbone#), estimated before the alignment calculation. This file also shows a Newick format tree of the existing alignment (not shown in this figure). For each new sequence, the nearest sequence in the existing alignment (nearest sequence), approximate distance to the nearest sequence (approximate distance), and the members of the sister group (sister group) are shown. (B) Graphical representation of (A).
mst010-F3:(A) A part of output of the ––treeout option showing the phylogenetic positions of new sequences (new#) in the tree of the existing alignment (backbone#), estimated before the alignment calculation. This file also shows a Newick format tree of the existing alignment (not shown in this figure). For each new sequence, the nearest sequence in the existing alignment (nearest sequence), approximate distance to the nearest sequence (approximate distance), and the members of the sister group (sister group) are shown. (B) Graphical representation of (A).
View Article:PubMed Central - PubMed
Affiliation:Immunology Frontier Research Center, Osaka University, Suita, Osaka, Japan. kazutaka.katoh@aist.go.jp
Additional Figures:
Article
Collection
Results
Bottom Line:We report a major update of the MAFFT multiple sequence alignment program.This report shows actual examples to explain how these features work, alone and in combination.Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations.
Abstract
We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.
Mentions
Users can check the guide tree by using the ––treeout option. In the case of ––addfragments, the estimated phylogenetic positions of new sequences are shown together with the estimated tree of the existing alignment. The alignment calculation is performed based on this phylogenetic estimation. It is also possible to compute such phylogenetic information only, without alignment, by the ––retree 0 option. An example of output is shown in Figure 3A.Fig. 3.
MeSH
Major
Sequence Alignment/methods*
Software*
Minor
Algorithms
Amino Acid Sequence
Base Sequence
DNA, Fungal/genetics
DNA, Ribosomal/genetics
DNA, Ribosomal Spacer/genetics
Fungi/genetics
Humans
Models, Genetic
Molecular Sequence Data
Phylogeny
Protein Structure, Tertiary
Quality Improvement
RNA, Bacterial/genetics
Ribonucleases/chemistry/genetics
Ribosome Subunits, Small, Bacterial/genetics