© 2004 Society of Systematic Biologists
A Phylogenetic Mixture Model for Detecting Pattern-Heterogeneity in Gene Sequence or Character-State Data
School of Animal and Microbial Sciences, University of Reading, Whiteknights Reading RG6 6AJ, England; E-mail: m.pagel{at}rdg.ac.uk (M.P.)
Edited by Keith Crandall: Associate Editor
| Abstract |
|---|
We describe a general likelihood-based mixture model for inferring phylogenetic trees from gene-sequence or other character-state data. The model accommodates cases in which different sites in the alignment evolve in qualitatively distinct ways, but does not require prior knowledge of these patterns or partitioning of the data. We call this qualitative variability in the pattern of evolution across sites "pattern-heterogeneity" to distinguish it from both a homogenous process of evolution and from one characterized principally by differences in rates of evolution. We present studies to show that the model correctly retrieves the signals of pattern-heterogeneity from simulated gene-sequence data, and we apply the method to protein-coding genes and to a ribosomal 12S data set. The mixture model outperforms conventional partitioning in both these data sets. We implement the mixture model such that it can simultaneously detect rate- and pattern-heterogeneity. The model simplifies to a homogeneous model or a rate-variability model as special cases, and therefore always performs at least as well as these two approaches, and often considerably improves upon them. We make the model available within a Bayesian Markov-chain Monte Carlo framework for phylogenetic inference, as an easy-to-use computer program.
Keywords: Bayesian inference; MCMC; mixture model; phylogeny; rate-heterogeneity; secondary structure; sequence evolution
Received June 30, 2003; Revised October 23, 2003; Accepted January 29, 2004
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. E. Arnold, J. Miadlikowska, K. L. Higgins, S. D. Sarvate, P. Gugger, A. Way, V. Hofstetter, F. Kauff, and F. Lutzoni A Phylogenetic Estimation of Trophic Transition Networks for Ascomycetous Fungi: Are Lichens Cradles of Symbiotrophic Fungal Diversification? Syst Biol, July 11, 2009; (2009) syp001v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. J. Greenhill, T. E. Currie, and R. D. Gray Does horizontal transmission invalidate cultural phylogenies? Proc R Soc B, June 22, 2009; 276(1665): 2299 - 2306. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. E. Roberts, E. J. Sargis, and L. E. Olson Networks, Trees, and Treeshrews: Assessing Support and Identifying Conflict with Multiple Loci and a Problematic Root Syst Biol, June 16, 2009; (2009) syp025v3. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. D. O'Connor and N. I. Mundy Genotype-phenotype associations: substitution models to detect evolutionary associations between phenotypic variables and genotypic evolutionary rate Bioinformatics, June 15, 2009; 25(12): i94 - i100. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. Jordan, R. D. Gray, S. J. Greenhill, and R. Mace Matrilocal residence is ancestral in Austronesian societies Proc R Soc B, June 7, 2009; 276(1664): 1957 - 1964. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. H. Schweitzer, W. Zheng, C. L. Organ, R. Avci, Z. Suo, L. M. Freimark, V. S. Lebleu, M. B. Duncan, M. G. Vander Heiden, J. M. Neveu, et al. Biomolecular Characterization and Protein Sequences of the Campanian Hadrosaur B. canadensis Science, May 1, 2009; 324(5927): 626 - 631. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Medina, F. Torres-Perez, H. Galeno, M. Navarrete, P. A. Vial, R. E. Palma, M. Ferres, J. A. Cook, and B. Hjelle Ecology, Genetic Diversity, and Phylogeographic Structure of Andes Virus in Humans and Rodents in Chile J. Virol., March 15, 2009; 83(6): 2446 - 2459. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Brown and R. ElDabaje PuMA: Bayesian analysis of partitioned (and unpartitioned) model adequacy Bioinformatics, February 15, 2009; 25(4): 537 - 538. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pagel and A. Meade Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo Phil Trans R Soc B, December 27, 2008; 363(1512): 3955 - 3964. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Regier, J. W. Shultz, A. R. D. Ganley, A. Hussey, D. Shi, B. Ball, A. Zwick, J. E. Stajich, M. P. Cummings, J. W. Martin, et al. Resolving Arthropod Phylogeny: Exploring Phylogenetic Signal within 41 kb of Protein-Coding Nuclear Gene Sequence Syst Biol, December 1, 2008; 57(6): 920 - 938. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Kim and M. J. Sanderson Penalized Likelihood Phylogenetic Inference: Bridging the Parsimony-Likelihood Gap Syst Biol, October 1, 2008; 57(5): 665 - 674. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Li, G. Lu, and G. Orti Optimal Data Partitioning and a Test Case for Ray-Finned Fishes (Actinopterygii) Based on Ten Nuclear Loci Syst Biol, August 1, 2008; 57(4): 519 - 539. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan Spatial and Temporal Heterogeneity in Nucleotide Sequence Evolution Mol. Biol. Evol., August 1, 2008; 25(8): 1683 - 1694. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Spagna, A. I. Vakis, C. A. Schmidt, S. N. Patek, X. Zhang, N. D. Tsutsui, and A. V. Suarez Phylogeny, scaling, and the generation of extreme forces in trap-jaw ants J. Exp. Biol., July 15, 2008; 211(14): 2358 - 2368. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Gruenheit, P. J. Lockhart, M. Steel, and W. Martin Difficulties in Testing for Covarion-Like Properties of Sequences under the Confounding Influence of Changing Proportions of Variable Sites Mol. Biol. Evol., July 1, 2008; 25(7): 1512 - 1520. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Kolaczkowski and J. W. Thornton A Mixed Branch Length Model of Heterotachy Improves Phylogenetic Accuracy Mol. Biol. Evol., June 1, 2008; 25(6): 1054 - 1066. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Venditti, A. Meade, and M. Pagel Phylogenetic Mixture Models Can Reduce Node-Density Artifacts Syst Biol, April 1, 2008; 57(2): 286 - 293. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Hardman and L. M. Hardman The Relative Importance of Body Size and Paleoclimatic Change as Explanatory Variables Influencing Lineage Diversification Rate: An Evolutionary Analysis of Bullhead Catfishes (Siluriformes: Ictaluridae) Syst Biol, February 1, 2008; 57(1): 116 - 130. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Penny, W. T. White, M. D. Hendy, and M. J. Phillips A Bias in ML Estimates of Branch Lengths in the Presence of Multiple Signals Mol. Biol. Evol., February 1, 2008; 25(2): 239 - 242. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. A. Matsen and M. Steel Phylogenetic Mixtures on a Single Tree Can Mimic a Tree of Another Topology Syst Biol, October 1, 2007; 56(5): 767 - 775. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Benavides, R. Baum, D. McClellan, and J. W. Sites Molecular Phylogenetics of the Lizard Genus Microlophus (Squamata:Tropiduridae): Aligning and Retrieving Indel Signal from Nuclear Introns Syst Biol, October 1, 2007; 56(5): 776 - 797. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. McGuire, C. C. Witt, D. L. Altshuler, and J. V. Remsen Phylogenetic Systematics and Biogeography of Hummingbirds: Bayesian and Maximum Likelihood Analyses of Partitioned Data and Selection of an Appropriate Partitioning Strategy Syst Biol, October 1, 2007; 56(5): 837 - 856. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. E. Wildman, M. Uddin, J. C. Opazo, G. Liu, V. Lefort, S. Guindon, O. Gascuel, L. I. Grossman, R. Romero, and M. Goodman Genomics, biogeography, and the diversification of placental mammals PNAS, September 4, 2007; 104(36): 14395 - 14400. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rodriguez-Ezpeleta, H. Brinkmann, B. Roure, N. Lartillot, B. F. Lang, and H. Philippe Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies Syst Biol, June 1, 2007; 56(3): 389 - 399. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Comas, A. Moya, and F. Gonzalez-Candelas From Phylogenetics to Phylogenomics: The Evolutionary Relationships of Insect Endosymbiotic {gamma}-Proteobacteria as a Test Case Syst Biol, February 1, 2007; 56(1): 1 - 16. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. B. Leschen and T. R. Buckley Multistate Characters and Diet Shifts: Evolution of Erotylidae (Coleoptera) Syst Biol, February 1, 2007; 56(1): 97 - 112. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Vanderpoorten and B. Goffinet Mapping Uncertainty and Phylogenetic Uncertainty in Ancestral Character State Reconstruction: An Example in the Moss Genus Brachytheciastrum Syst Biol, December 1, 2006; 55(6): 957 - 971. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Marshall, C. Simon, and T. R. Buckley Accurate Branch Length Estimation in Partitioned Bayesian Analyses Requires Accommodation of Among-Partition Rate Variation and Attention to Branch Length Priors Syst Biol, December 1, 2006; 55(6): 993 - 1003. [Full Text] [PDF] |
||||
![]() |
M. Pagel, C. Venditti, and A. Meade Large Punctuational Contribution of Speciation to Evolutionary Divergence at the Molecular Level Science, October 6, 2006; 314(5796): 119 - 121. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Venditti, A. Meade, and M. Pagel Detecting the Node-Density Artifact in Phylogeny Reconstruction Syst Biol, August 1, 2006; 55(4): 637 - 643. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. R. Holland, L. S. Jermiin, and V. Moulton Improved Consensus Network Techniques for Genome-Scale Phylogeny Mol. Biol. Evol., May 1, 2006; 23(5): 848 - 855. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ronquist, B. Larget, J. P. Huelsenbeck, J. B. Kadane, D. Simon, and P. van der Mark Comment on "Phylogenetic MCMC Algorithms Are Misleading on Mixtures of Trees" Science, April 21, 2006; 312(5772): 367a - 367a. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Gesell and A. von Haeseler In silico sequence evolution with site-specific interactions along phylogenetic trees Bioinformatics, March 15, 2006; 22(6): 716 - 722. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. Collins, P. Schuchert, A. C. Marques, T. Jankowski, M. Medina, and B. Schierwater Medusozoan Phylogeny and Character Evolution Clarified by New Large and Small Subunit rDNA Data and an Assessment of the Utility of Phylogenetic Mixture Models Syst Biol, February 1, 2006; 55(1): 97 - 115. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Gowri-Shankar and M. Rattray On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic Inference Mol. Biol. Evol., February 1, 2006; 23(2): 352 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Brinkmann, M. van der Giezen, Y. Zhou, G. P. de Raucourt, and H. Philippe An Empirical Assessment of Long-Branch Attraction Artefacts in Deep Eukaryotic Phylogenomics Syst Biol, October 1, 2005; 54(5): 743 - 757. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Spencer, E. Susko, and A. J. Roger Likelihood, Parsimony, and Heterogeneous Evolution Mol. Biol. Evol., May 1, 2005; 22(5): 1161 - 1164. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pagel, A. Meade, and D. Barker Bayesian Estimation of Ancestral Character States on Phylogenies Syst Biol, October 1, 2004; 53(5): 673 - 684. [Abstract] [Full Text] [PDF] |
||||








