Skip Navigation

Systematic Biology 2004 53(3):485-495; doi:10.1080/10635150490445779
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (56)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Foster, P. G.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Foster, P. G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2004 Society of Systematic Biologists

Modeling Compositional Heterogeneity

Peter G. Foster

Department of Zoology, The Natural History Museum Cromwell Road, London SW7 5BD, United Kingdom; E-mail: p.foster{at}nhm.ac.uk

Edited by Ted Schultz: Associate Editor


   Abstract

Compositional heterogeneity among lineages can compromise phylogenetic analyses, because models in common use assume compositionally homogeneous data. Models that can accommodate compositional heterogeneity with few extra parameters are described here, and used in two examples where the true tree is known with confidence. It is shown using likelihood ratio tests that adequate modeling of compositional heterogeneity can be achieved with few composition parameters, that the data may not need to be modelled with separate composition parameters for each branch in the tree. Tree searching and placement of composition vectors on the tree are done in a Bayesian framework using Markov chain Monte Carlo (MCMC) methods. Assessment of fit of the model to the data is made in both maximum likelihood (ML) and Bayesian frameworks. In an ML framework, overall model fit is assessed using the Goldman-Cox test, and the fit of the composition implied by a (possibly heterogeneous) model to the composition of the data is assessed using a novel tree- and model-based composition fit test. In a Bayesian framework, overall model fit and composition fit are assessed using posterior predictive simulation. It is shown that when composition is not accommodated, then the model does not fit, and incorrect trees are found; but when composition is accommodated, the model then fits, and the known correct phylogenies are obtained.

Keywords: Compositional heterogeneity; Markov chain Monte Carlo; maximum likelihood; model assessment; model selection; phylogenetics

Received July 21, 2003; Revised October 31, 2003; Accepted December 14, 2003
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Phil Trans R Soc BHome page
P. G. Foster, C. J. Cox, and T. M. Embley
The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods
Phil Trans R Soc B, August 12, 2009; 364(1527): 2197 - 2207.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
N. C. Sheffield, H. Song, S. L. Cameron, and M. F. Whiting
Nonstationary Evolution and Compositional Heterogeneity in Beetle Mitochondrial Phylogenomics
Syst Biol, August 1, 2009; 58(4): 381 - 394.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
T. A. Richards, D. M. Soanes, P. G. Foster, G. Leonard, C. R. Thornton, and N. J. Talbot
Phylogenomic Analysis Demonstrates a Pattern of Rare and Ancient Horizontal Gene Transfer between Plants and Fungi
PLANT CELL, July 1, 2009; 21(7): 1897 - 1911.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
W. Delport, K. Scheffler, and C. Seoighe
Models of coding sequence evolution
Brief Bioinform, January 1, 2009; 10(1): 97 - 109.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
C. J. Cox, P. G. Foster, R. P. Hirt, S. R. Harris, and T. M. Embley
The archaebacterial origin of eukaryotes
PNAS, December 23, 2008; 105(51): 20356 - 20361.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. M. Archibald
The eocyte hypothesis and the origin of eukaryotic cells
PNAS, December 23, 2008; 105(51): 20049 - 20050.
[Full Text] [PDF]


Home page
Mol Biol EvolHome page
B. Kolaczkowski and J. W. Thornton
A Mixed Branch Length Model of Heterotachy Improves Phylogenetic Accuracy
Mol. Biol. Evol., June 1, 2008; 25(6): 1054 - 1066.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Blanquart and N. Lartillot
A Site- and Time-Heterogeneous Model of Amino Acid Replacement
Mol. Biol. Evol., May 1, 2008; 25(5): 842 - 858.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
N. Lartillot and H. Philippe
Improvement of molecular phylogenetic inference and the phylogeny of Bilateria
Phil Trans R Soc B, April 27, 2008; 363(1496): 1463 - 1472.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
J. Ripplinger and J. Sullivan
Does Choice in Model Selection Affect Maximum Likelihood Analysis?
Syst Biol, February 1, 2008; 57(1): 76 - 85.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
N. Rodriguez-Ezpeleta, H. Brinkmann, B. Roure, N. Lartillot, B. F. Lang, and H. Philippe
Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies
Syst Biol, June 1, 2007; 56(3): 389 - 399.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
V. Gowri-Shankar and M. Rattray
A Reversible Jump Method for Bayesian Phylogenetic Inference with a Nonhomogeneous Substitution Model
Mol. Biol. Evol., June 1, 2007; 24(6): 1286 - 1299.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
V. Jayaswal, J. Robinson, and L. Jermiin
Estimation of Phylogeny and Invariant Sites under the General Markov Model of Nucleotide Sequence Evolution
Syst Biol, April 1, 2007; 56(2): 155 - 162.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
R. B. Bevan, D. Bryant, and B. F. Lang
Accounting for Gene Rate Heterogeneity in Phylogenetic Inference
Syst Biol, April 1, 2007; 56(2): 194 - 205.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
V. Ruano-Rubio and M. A. Fares
Artifactual Phylogenies Caused by Correlated Distribution of Substitution Rates among Sites and Lineages: The Good, the Bad, and the Ugly
Syst Biol, February 1, 2007; 56(1): 68 - 82.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Blanquart and N. Lartillot
A Bayesian Compound Stochastic Process for Modeling Nonstationary and Nonhomogeneous Sequence Evolution
Mol. Biol. Evol., November 1, 2006; 23(11): 2058 - 2071.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
B. Boussau and M. Gouy
Efficient Likelihood Computations with Nonreversible Models of Evolution
Syst Biol, October 1, 2006; 55(5): 756 - 768.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
A. J Roger and L. A Hug
The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation
Phil Trans R Soc B, June 29, 2006; 361(1470): 1039 - 1054.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F. Ababneh, L. S. Jermiin, C. Ma, and J. Robinson
Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences
Bioinformatics, May 15, 2006; 22(10): 1225 - 1231.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
M. J. Phillips, P. A. McLenachan, C. Down, G. C. Gibb, and D. Penny
Combined Mitochondrial and Nuclear DNA Sequences Resolve the Interrelations of the Major Australasian Marsupial Radiations
Syst Biol, February 1, 2006; 55(1): 122 - 137.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
V. Gowri-Shankar and M. Rattray
On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference
Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
V. Hampl, D. S. Horner, P. Dyal, J. Kulda, J. Flegr, P. G. Foster, and T. M. Embley
Inference of the Phylogenetic Position of Oxymonads Based on Nine Genes: Support for Metamonada and Excavata
Mol. Biol. Evol., December 1, 2005; 22(12): 2508 - 2518.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
T. M. Collins, O. Fedrigo, and G. J. P. Naylor
Choosing the Best Genes for the Job: The Case for Stationary Genes in Genome-Scale Phylogenetics
Syst Biol, June 1, 2005; 54(3): 493 - 500.
[Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Y. W. Ho, M. J. Phillips, A. J. Drummond, and A. Cooper
Accuracy of Rate Estimation Using Relaxed-Clock Models with a Critical Focus on the Early Metazoan Radiation
Mol. Biol. Evol., May 1, 2005; 22(5): 1355 - 1363.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Pons and A. P. Vogler
Complex Pattern of Coalescence and Fast Evolution of a Mitochondrial rRNA Pseudogene in a Recent Radiation of Tiger Beetles
Mol. Biol. Evol., April 1, 2005; 22(4): 991 - 1000.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
D. Pisani
Identifying and Removing Fast-Evolving Sites Using Compatibility Analysis: An Example from the Arthropoda
Syst Biol, December 1, 2004; 53(6): 978 - 989.
[Full Text] [PDF]


Home page
Syst BiolHome page
L. S. Jermiin, S. Y.W. Ho, F. Ababneh, J. Robinson, and A. W.D. Larkum
The Biasing Effect of Compositional Heterogeneity on Phylogenetic Estimates May be Underestimated
Syst Biol, August 1, 2004; 53(4): 638 - 643.
[Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.