Long branch attraction
||This article includes a list of references, related reading or external links, but its sources remain unclear because it lacks inline citations. (October 2014)|
Long branch attraction (LBA) causes species to seem more closely related in a phylogeny than they really are due to mutations or traits occurring independently (convergent evolution). These shared traits can be misinterpreted as being shared due to common ancestry. This occurs more often in long branches of a phylogeny. Until recently, long branch attraction was considered hypothetical due to insufficient evidence. However, today many different factors are taken into account in confirmation of the distance between two species (Bergsten 2005).
In phylogenetic and clustering analyses, LBA is a result of the way clustering algorithms work: terminals or taxa with many autapomorphies (character states unique to a single branch) may by chance exhibit the same states as those on another branch (homoplasy). A phylogenetic analysis will group these taxa together as a clade unless other synapomorphies outweigh the homoplastic features to group together true sister taxa.
These problems may be minimized by using methods that correct for multiple substitutions at the same site, by adding taxa related to those with the long branches that add additional true synapomorphies to the data, or by using alternative slower evolving traits (e.g. more conservative gene regions).
The result of LBA in evolutionary analyses is that rapidly evolving lineages may be inferred to be closely related, regardless of their true relationships. For example, in DNA sequence-based analyses, the problem arises when sequences from two (or more) lineages evolve rapidly. There are only four possible nucleotides and when DNA substitution rates are high, the probability that two lineages will evolve the same nucleotide at the same site increases. When this happens, parsimony may erroneously interpret this homoplasy as a synapomorphy (i.e., evolving once in the common ancestor of the two lineages).
The opposite effect may also be observed, in that if two (or more) branches exhibit particularly slow evolution among a wider, fast evolving group, those branches may be misinterpreted as closely related. As such, "long branch attraction" can in some ways be better expressed as "branch length attraction". However, it is typically long branches that exhibit attraction.
The recognition of long-branch attraction implies that there is some other evidence that suggests that the phylogeny is incorrect. For example morphological data may suggest that taxa marked as closely related are not truly sister taxa. Hennig's Auxiliary Principle suggests that synapomorphies should be viewed as de facto evidence of grouping unless there is specific contrary evidence (Hennig, 1966; Schuh and Brower, 2009).
One example of this phenomenon is the relationship between four skippers (butterflies): Agathymus mariae, Ancyloxpha numitor, Thorybes pylades, and Pyrrhopyge zenodorus. When comparing these species scientists used a multitude of procedures to compare one to another in order to get the most accurate results.
They began with analyzing a certain length of DNA in each species. Compared side by side they counted the matching nucleotides in each strand and came up with a phylogenetic tree based on the similarity shared between each DNA strand. This resulted in a tree supporting the close relationship between A. mariae and P. zenodorus.
The next step in the process involved another reconstruction method, distance-based. The amount of expected changes within each given DNA sequence was estimated. The species with similar amounts of changes were grouped together and were calculated to have a bootstrap value of 80%, also supporting tree 1.
The next method used in this procedure is called Maximum likelihood. So far in the data analysis, the trees have been in parsimony, meaning they have been the simplest forms. However, maximum likelihood is a process that takes into account what changes are the most likely to occur. It is not necessarily the easiest tree but is one that is the mostly likely to statistically occur. The maximum likelihood tree supports at tree linking A. numitor and A. mariae. This is the first method to have results that conflicts with that of the previously executed methods.
Another method to compare results is called Bayesian method. This method is very similar to maximum likelihood. It deals with the statistical data and creates a tree that represents the most likely occurrence. It differs from maximum likelihood in that it predicts how likely it would happen in the future. In this data set analysis, the Bayesian method resulted in a tree that also supports the close relationship of A. numitor and A. mariae.
When all this data is gathered and compared, we find that the second relationship is the most logical relationship. This experiment with skippers supports the importance of deciphering all the data before concluding that a certain tree is the correct one. Morphological traits are very important aspects of constructing trees, but parsimony is not always correct. It is helpful in using a few methods to determine an accurate tree (Grishin 2009).
- Bergsten, J. (2005): A review of long-branch attraction. Cladistics 21(2): 163-193. PDF fulltext
- Felsenstein, J. (2004): Inferring Phylogenies. Sinauer Associates, Sunderland, MA.
- Hennig, W. (1966): Phylogenetic Systematics. University of Illinois Press, Urbana, IL.
- Schuh, R. T. and Brower, A. V. Z. (2009): Biological Systematics: Principles and Applications, (2nd edn.) Cornell University Press, Ithaca, NY.
- Bergsten J. (2005): "A review of long-branch attraction". Blackwell Publishing [cited 2014 Oct 1] 21(2):163-193. Available from: http://onlinelibrary.wiley.com/doi/10.1111/j.1096-0031.2005.00059.x/pdf
- Grishin, Nick V. "Long Branch Attraction." Long Branch Attraction. Butterflies of America, 17 Aug. 2009. Web. 15 Sept. 2014. <http://butterfliesofamerica.com/knowhow/LBA.htm>.