| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© 2004 Society of Systematic Biologists
Modeling Compositional Heterogeneity
Department of Zoology, The Natural History Museum Cromwell Road, London SW7 5BD, United Kingdom; E-mail: p.foster{at}nhm.ac.uk
Edited by Ted Schultz: Associate Editor
| Abstract |
|---|
Compositional heterogeneity among lineages can compromise phylogenetic analyses, because models in common use assume compositionally homogeneous data. Models that can accommodate compositional heterogeneity with few extra parameters are described here, and used in two examples where the true tree is known with confidence. It is shown using likelihood ratio tests that adequate modeling of compositional heterogeneity can be achieved with few composition parameters, that the data may not need to be modelled with separate composition parameters for each branch in the tree. Tree searching and placement of composition vectors on the tree are done in a Bayesian framework using Markov chain Monte Carlo (MCMC) methods. Assessment of fit of the model to the data is made in both maximum likelihood (ML) and Bayesian frameworks. In an ML framework, overall model fit is assessed using the Goldman-Cox test, and the fit of the composition implied by a (possibly heterogeneous) model to the composition of the data is assessed using a novel tree- and model-based composition fit test. In a Bayesian framework, overall model fit and composition fit are assessed using posterior predictive simulation. It is shown that when composition is not accommodated, then the model does not fit, and incorrect trees are found; but when composition is accommodated, the model then fits, and the known correct phylogenies are obtained.
Keywords: Compositional heterogeneity; Markov chain Monte Carlo; maximum likelihood; model assessment; model selection; phylogenetics
Received July 21, 2003; Revised October 31, 2003; Accepted December 14, 2003
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
P. G. Foster, C. J. Cox, and T. M. Embley The primary divisions of life: a phylogenomic approach employing composition-heterogeneous methods Phil Trans R Soc B, August 12, 2009; 364(1527): 2197 - 2207. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. C. Sheffield, H. Song, S. L. Cameron, and M. F. Whiting Nonstationary Evolution and Compositional Heterogeneity in Beetle Mitochondrial Phylogenomics Syst Biol, August 1, 2009; 58(4): 381 - 394. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Richards, D. M. Soanes, P. G. Foster, G. Leonard, C. R. Thornton, and N. J. Talbot Phylogenomic Analysis Demonstrates a Pattern of Rare and Ancient Horizontal Gene Transfer between Plants and Fungi PLANT CELL, July 1, 2009; 21(7): 1897 - 1911. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, K. Scheffler, and C. Seoighe Models of coding sequence evolution Brief Bioinform, January 1, 2009; 10(1): 97 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Cox, P. G. Foster, R. P. Hirt, S. R. Harris, and T. M. Embley The archaebacterial origin of eukaryotes PNAS, December 23, 2008; 105(51): 20356 - 20361. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Archibald The eocyte hypothesis and the origin of eukaryotic cells PNAS, December 23, 2008; 105(51): 20049 - 20050. [Full Text] [PDF] |
||||
![]() |
B. Kolaczkowski and J. W. Thornton A Mixed Branch Length Model of Heterotachy Improves Phylogenetic Accuracy Mol. Biol. Evol., June 1, 2008; 25(6): 1054 - 1066. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Blanquart and N. Lartillot A Site- and Time-Heterogeneous Model of Amino Acid Replacement Mol. Biol. Evol., May 1, 2008; 25(5): 842 - 858. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Lartillot and H. Philippe Improvement of molecular phylogenetic inference and the phylogeny of Bilateria Phil Trans R Soc B, April 27, 2008; 363(1496): 1463 - 1472. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ripplinger and J. Sullivan Does Choice in Model Selection Affect Maximum Likelihood Analysis? Syst Biol, February 1, 2008; 57(1): 76 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodriguez-Ezpeleta, H. Brinkmann, B. Roure, N. Lartillot, B. F. Lang, and H. Philippe Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies Syst Biol, June 1, 2007; 56(3): 389 - 399. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray A Reversible Jump Method for Bayesian Phylogenetic Inference with a Nonhomogeneous Substitution Model Mol. Biol. Evol., June 1, 2007; 24(6): 1286 - 1299. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Jayaswal, J. Robinson, and L. Jermiin Estimation of Phylogeny and Invariant Sites under the General Markov Model of Nucleotide Sequence Evolution Syst Biol, April 1, 2007; 56(2): 155 - 162. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. B. Bevan, D. Bryant, and B. F. Lang Accounting for Gene Rate Heterogeneity in Phylogenetic Inference Syst Biol, April 1, 2007; 56(2): 194 - 205. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ruano-Rubio and M. A. Fares Artifactual Phylogenies Caused by Correlated Distribution of Substitution Rates among Sites and Lineages: The Good, the Bad, and the Ugly Syst Biol, February 1, 2007; 56(1): 68 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Blanquart and N. Lartillot A Bayesian Compound Stochastic Process for Modeling Nonstationary and Nonhomogeneous Sequence Evolution Mol. Biol. Evol., November 1, 2006; 23(11): 2058 - 2071. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Boussau and M. Gouy Efficient Likelihood Computations with Nonreversible Models of Evolution Syst Biol, October 1, 2006; 55(5): 756 - 768. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J Roger and L. A Hug The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation Phil Trans R Soc B, June 29, 2006; 361(1470): 1039 - 1054. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ababneh, L. S. Jermiin, C. Ma, and J. Robinson Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences Bioinformatics, May 15, 2006; 22(10): 1225 - 1231. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Phillips, P. A. McLenachan, C. Down, G. C. Gibb, and D. Penny Combined Mitochondrial and Nuclear DNA Sequences Resolve the Interrelations of the Major Australasian Marsupial Radiations Syst Biol, February 1, 2006; 55(1): 122 - 137. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Hampl, D. S. Horner, P. Dyal, J. Kulda, J. Flegr, P. G. Foster, and T. M. Embley Inference of the Phylogenetic Position of Oxymonads Based on Nine Genes: Support for Metamonada and Excavata Mol. Biol. Evol., December 1, 2005; 22(12): 2508 - 2518. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Collins, O. Fedrigo, and G. J. P. Naylor Choosing the Best Genes for the Job: The Case for Stationary Genes in Genome-Scale Phylogenetics Syst Biol, June 1, 2005; 54(3): 493 - 500. [Full Text] [PDF] |
||||
![]() |
S. Y. W. Ho, M. J. Phillips, A. J. Drummond, and A. Cooper Accuracy of Rate Estimation Using Relaxed-Clock Models with a Critical Focus on the Early Metazoan Radiation Mol. Biol. Evol., May 1, 2005; 22(5): 1355 - 1363. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pons and A. P. Vogler Complex Pattern of Coalescence and Fast Evolution of a Mitochondrial rRNA Pseudogene in a Recent Radiation of Tiger Beetles Mol. Biol. Evol., April 1, 2005; 22(4): 991 - 1000. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Pisani Identifying and Removing Fast-Evolving Sites Using Compatibility Analysis: An Example from the Arthropoda Syst Biol, December 1, 2004; 53(6): 978 - 989. [Full Text] [PDF] |
||||
![]() |
L. S. Jermiin, S. Y.W. Ho, F. Ababneh, J. Robinson, and A. W.D. Larkum The Biasing Effect of Compositional Heterogeneity on Phylogenetic Estimates May be Underestimated Syst Biol, August 1, 2004; 53(4): 638 - 643. [Full Text] [PDF] |
||||






