Skip Navigation

Systematic Biology 2004 53(4):623-637; doi:10.1080/10635150490503035
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (34)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Ho, S. Y.W.
Right arrow Articles by Jermiin, L. S.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Ho, S. Y.W.
Right arrow Articles by Jermiin, L. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2004 Society of Systematic Biologists

Tracing the Decay of the Historical Signal in Biological Sequence Data

Simon Y.W. Ho1,3 and Lars S. Jermiin1,2

1 School of Biological Sciences, University of Sydney NSW 2006 Australia
2 Sydney University Biological Informatics and Technology Centre, University of Sydney NSW 2006 Australia

Edited by Peter Lockhart: Associate Editor


   Abstract

Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invariant sites, and covariotides, may interfere adversely with the recovery of the historical signal. The effect of the interaction of these signals on phylogenetic inference is not well understood and may, in many cases, even be underappreciated. In this study, we investigate this matter and present results based on Monte Carlo simulations. We explored the success of four phylogenetic methods in recovering the true tree from data that had evolved under conditions where the equilibrium base frequencies and substitution rates were allowed to vary among lineages. Seven scenarios with increasingly complex conditions were investigated. All of the methods tested, with the exception of neighbor-joining using LogDet distances, were sensitive to compositional convergence in nonsister lineages. Maximum parsimony was also susceptible to attraction between long edges. In many cases, however, phylogenetic inference methods can still recover the true tree when misleading signals are present, in some instances even when the historical signal is no longer dominant. These results highlight the growing need for simple methods to detect violation of the phylogenetic assumptions.

Keywords: Compositional heterogeneity; edge lengths; Monte Carlo simulation; networks; phylogenetic signal; rate heterogeneity; substitutional saturation

Received August 25, 2003; Revised December 14, 2003; Accepted March 27, 2004


3 Present address: Henry Wellcome Ancient Biomolecules Centre, Department of Zoology, University of Oxford, OX1 3PS, United Kingdom; E-mail: simon.ho{at}zoo.ox.ac.uk


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Phil Trans R Soc BHome page
P. G. Foster, C. J. Cox, and T. M. Embley
The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods
Phil Trans R Soc B, August 12, 2009; 364(1527): 2197 - 2207.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
N. C. Sheffield, H. Song, S. L. Cameron, and M. F. Whiting
Nonstationary Evolution and Compositional Heterogeneity in Beetle Mitochondrial Phylogenomics
Syst Biol, August 1, 2009; 58(4): 381 - 394.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
C. J. Cox, P. G. Foster, R. P. Hirt, S. R. Harris, and T. M. Embley
The archaebacterial origin of eukaryotes
PNAS, December 23, 2008; 105(51): 20356 - 20361.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
L. Shavit, D. Penny, M. D. Hendy, and B. R. Holland
The Problem of Rooting Rapid Radiations
Mol. Biol. Evol., November 1, 2007; 24(11): 2400 - 2411.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
W. White, S. Hills, R Gaddam, B. Holland, and D. Penny
Treeness Triangles: Visualizing the Loss of Phylogenetic Signal
Mol. Biol. Evol., September 1, 2007; 24(9): 2029 - 2039.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
E. Susko and A. J. Roger
On Reduced Amino Acid Alphabets for Phylogenetic Inference
Mol. Biol. Evol., September 1, 2007; 24(9): 2139 - 2150.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
G. A. Huttley, M. J. Wakefield, and S. Easteal
Rates of Genome Evolution and Branching Order from Whole Genome Analysis
Mol. Biol. Evol., August 1, 2007; 24(8): 1722 - 1730.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
V. Jayaswal, J. Robinson, and L. Jermiin
Estimation of Phylogeny and Invariant Sites under the General Markov Model of Nucleotide Sequence Evolution
Syst Biol, April 1, 2007; 56(2): 155 - 162.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
D. Baurain, H. Brinkmann, and H. Philippe
Lack of Resolution in the Animal Phylogeny: Closely Spaced Cladogeneses or Undetected Systematic Errors?
Mol. Biol. Evol., January 1, 2007; 24(1): 6 - 9.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. W. K. Ho, C. E. Adams, J. B. Lew, T. J. Matthews, C. C. Ng, A. Shahabi-Sirjani, L. H. Tan, Y. Zhao, S. Easteal, S. R Wilson, et al.
SeqVis: Visualization of compositional heterogeneity in large alignments of nucleotides
Bioinformatics, September 1, 2006; 22(17): 2162 - 2163.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
T Martin Embley
Multiple secondary origins of the anaerobic lifestyle in eukaryotes
Phil Trans R Soc B, June 29, 2006; 361(1470): 1055 - 1067.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F. Ababneh, L. S. Jermiin, C. Ma, and J. Robinson
Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences
Bioinformatics, May 15, 2006; 22(10): 1225 - 1231.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
B. R. Holland, L. S. Jermiin, and V. Moulton
Improved Consensus Network Techniques for Genome-Scale Phylogeny
Mol. Biol. Evol., May 1, 2006; 23(5): 848 - 855.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
M. J. Phillips, P. A. McLenachan, C. Down, G. C. Gibb, and D. Penny
Combined Mitochondrial and Nuclear DNA Sequences Resolve the Interrelations of the Major Australasian Marsupial Radiations
Syst Biol, February 1, 2006; 55(1): 122 - 137.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
P. Lockhart and M. Steel
A Tale of Two Processes
Syst Biol, December 1, 2005; 54(6): 948 - 951.
[Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Y. W. Ho, M. J. Phillips, A. Cooper, and A. J. Drummond
Time Dependency of Molecular Rate Estimates and Systematic Overestimation of Recent Divergence Times
Mol. Biol. Evol., July 1, 2005; 22(7): 1561 - 1568.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
L. S. Jermiin, S. Y.W. Ho, F. Ababneh, J. Robinson, and A. W.D. Larkum
The Biasing Effect of Compositional Heterogeneity on Phylogenetic Estimates May be Underestimated
Syst Biol, August 1, 2004; 53(4): 638 - 643.
[Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.