MUSCLE (alignment software)
MUSCLE (multiple sequence comparison by log-expectation) is public domain, multiple sequence alignment software for protein and nucleotide sequences. The method was published by Robert C. Edgar in two papers in 2004. The first paper, published in Nucleic Acids Research, introduced the sequence alignment algorithm. The second paper, published in BMC Bioinformatics, presented more technical details.
The MUSCLE algorithm proceeds in three stages: the 'draft progressive', 'improved progressive' and 'refinement' stages. In the 'draft progressive' stage, the algorithm produces a draft multiple alignment, with the emphasis on speed rather than accuracy. In the 'improved progressive' stage, the Kimura distance is used to reestimate the binary tree used to create the draft alignment, in turn producing a more accurate multiple alignment. The final 'refinement' stage refines the improved alignment produced in the second step. Multiple alignments are available at the end of each stage. The time complexity of the first two stages of the algorithm is O(N2L + NL2); the space complexity is O(N2 + NL + L2). The 'refinement' stage adds a further O(N3L) term to the time complexity. MUSCLE is often used as a replacement for Clustal, since it typically (but not always) gives better sequence alignments, depending on the chosen options. In addition, MUSCLE is significantly faster than Clustal, especially for larger alignments.
MUSCLE is integrated into Geneious and MacVector and is available in Sequencher, MEGA and UGENE as a plugin. MUSCLE is also available as a web service provided by EMBL-EBI. As of September 2014, the two papers describing MUSCLE have been cited more than 12,000 times in total.
- Edgar RC (2004). "MUSCLE: multiple sequence alignment with high accuracy and high throughput". Nucleic Acids Research 32 (5): 1792–97. doi:10.1093/nar/gkh340. PMC 390337. PMID 15034147.
- Edgar RC (2004). "MUSCLE: a multiple sequence alignment method with reduced time and space complexity". BMC Bioinformatics 5 (1): 113. doi:10.1186/1471-2105-5-113. PMC 517706. PMID 15318951.
- "MUSCLE < Multiple Sequence Alignment < EMBL-EBI". Retrieved 1 September 2014.
- "Robert C. Edgar - Google Scholar Citations". Retrieved 1 September 2014.