© 2004 Society of Systematic Biologists
Tracing the Decay of the Historical Signal in Biological Sequence Data
1 School of Biological Sciences, University of Sydney NSW 2006 Australia
2 Sydney University Biological Informatics and Technology Centre, University of Sydney NSW 2006 Australia
Edited by Peter Lockhart: Associate Editor
| Abstract |
|---|
Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invariant sites, and covariotides, may interfere adversely with the recovery of the historical signal. The effect of the interaction of these signals on phylogenetic inference is not well understood and may, in many cases, even be underappreciated. In this study, we investigate this matter and present results based on Monte Carlo simulations. We explored the success of four phylogenetic methods in recovering the true tree from data that had evolved under conditions where the equilibrium base frequencies and substitution rates were allowed to vary among lineages. Seven scenarios with increasingly complex conditions were investigated. All of the methods tested, with the exception of neighbor-joining using LogDet distances, were sensitive to compositional convergence in nonsister lineages. Maximum parsimony was also susceptible to attraction between long edges. In many cases, however, phylogenetic inference methods can still recover the true tree when misleading signals are present, in some instances even when the historical signal is no longer dominant. These results highlight the growing need for simple methods to detect violation of the phylogenetic assumptions.
Keywords: Compositional heterogeneity; edge lengths; Monte Carlo simulation; networks; phylogenetic signal; rate heterogeneity; substitutional saturation
Received August 25, 2003; Revised December 14, 2003; Accepted March 27, 2004
3 Present address: Henry Wellcome Ancient Biomolecules Centre, Department of Zoology, University of Oxford, OX1 3PS, United Kingdom; E-mail: simon.ho{at}zoo.ox.ac.uk
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
P. G. Foster, C. J. Cox, and T. M. Embley The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods Phil Trans R Soc B, August 12, 2009; 364(1527): 2197 - 2207. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. C. Sheffield, H. Song, S. L. Cameron, and M. F. Whiting Nonstationary Evolution and Compositional Heterogeneity in Beetle Mitochondrial Phylogenomics Syst Biol, August 1, 2009; 58(4): 381 - 394. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Cox, P. G. Foster, R. P. Hirt, S. R. Harris, and T. M. Embley The archaebacterial origin of eukaryotes PNAS, December 23, 2008; 105(51): 20356 - 20361. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Shavit, D. Penny, M. D. Hendy, and B. R. Holland The Problem of Rooting Rapid Radiations Mol. Biol. Evol., November 1, 2007; 24(11): 2400 - 2411. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. White, S. Hills, R Gaddam, B. Holland, and D. Penny Treeness Triangles: Visualizing the Loss of Phylogenetic Signal Mol. Biol. Evol., September 1, 2007; 24(9): 2029 - 2039. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Susko and A. J. Roger On Reduced Amino Acid Alphabets for Phylogenetic Inference Mol. Biol. Evol., September 1, 2007; 24(9): 2139 - 2150. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Huttley, M. J. Wakefield, and S. Easteal Rates of Genome Evolution and Branching Order from Whole Genome Analysis Mol. Biol. Evol., August 1, 2007; 24(8): 1722 - 1730. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Jayaswal, J. Robinson, and L. Jermiin Estimation of Phylogeny and Invariant Sites under the General Markov Model of Nucleotide Sequence Evolution Syst Biol, April 1, 2007; 56(2): 155 - 162. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Baurain, H. Brinkmann, and H. Philippe Lack of Resolution in the Animal Phylogeny: Closely Spaced Cladogeneses or Undetected Systematic Errors? Mol. Biol. Evol., January 1, 2007; 24(1): 6 - 9. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. K. Ho, C. E. Adams, J. B. Lew, T. J. Matthews, C. C. Ng, A. Shahabi-Sirjani, L. H. Tan, Y. Zhao, S. Easteal, S. R Wilson, et al. SeqVis: Visualization of compositional heterogeneity in large alignments of nucleotides Bioinformatics, September 1, 2006; 22(17): 2162 - 2163. [Abstract] [Full Text] [PDF] |
||||
![]() |
T Martin Embley Multiple secondary origins of the anaerobic lifestyle in eukaryotes Phil Trans R Soc B, June 29, 2006; 361(1470): 1055 - 1067. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ababneh, L. S. Jermiin, C. Ma, and J. Robinson Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences Bioinformatics, May 15, 2006; 22(10): 1225 - 1231. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. R. Holland, L. S. Jermiin, and V. Moulton Improved Consensus Network Techniques for Genome-Scale Phylogeny Mol. Biol. Evol., May 1, 2006; 23(5): 848 - 855. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Phillips, P. A. McLenachan, C. Down, G. C. Gibb, and D. Penny Combined Mitochondrial and Nuclear DNA Sequences Resolve the Interrelations of the Major Australasian Marsupial Radiations Syst Biol, February 1, 2006; 55(1): 122 - 137. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Lockhart and M. Steel A Tale of Two Processes Syst Biol, December 1, 2005; 54(6): 948 - 951. [Full Text] [PDF] |
||||
![]() |
S. Y. W. Ho, M. J. Phillips, A. Cooper, and A. J. Drummond Time Dependency of Molecular Rate Estimates and Systematic Overestimation of Recent Divergence Times Mol. Biol. Evol., July 1, 2005; 22(7): 1561 - 1568. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S. Jermiin, S. Y.W. Ho, F. Ababneh, J. Robinson, and A. W.D. Larkum The Biasing Effect of Compositional Heterogeneity on Phylogenetic Estimates May be Underestimated Syst Biol, August 1, 2004; 53(4): 638 - 643. [Full Text] [PDF] |
||||




