© 2004 Society of Systematic Biologists
The Importance of Proper Model Assumption in Bayesian Phylogenetics
Section of Integrative Biology, University of Texas 1 University Station C0930, Austin, Texas 78712, USA; E-mail: alemmon{at}evotutor.org (A.R.L.), chorusfrog{at}mail.utexas.edu (E.C.M.)
Edited by Jack Sullivan: Associate Editor
| Abstract |
|---|
We studied the importance of proper model assumption in the context of Bayesian phylogenetics by examining > 5,000 Bayesian analyses and six nested models of nucleotide substitution. Model misspecification can strongly bias bipartition posterior probability estimates. These biases were most pronounced when rate heterogeneity was ignored. The type of bias seen at a particular bipartition appeared to be strongly influenced by the lengths of the branches surrounding that bipartition. In the Felsenstein zone, posterior probability estimates of bipartitions were biased when the assumed model was underparameterized but were unbiased when the assumed model was overparameterized. For the inverse Felsenstein zone, however, both underparameterization and overparameterization led to biased bipartition posterior probabilities, although the bias caused by overparameterization was less pronounced and disappeared with increased sequence length. Model parameter estimates were also affected by model misspecification. Underparameterization caused a bias in some parameter estimates, such as branch lengths and the gamma shape parameter, whereas overparameterization caused a decrease in the precision of some parameter estimates. We caution researchers to assure that the most appropriate model is assumed by employing both a priori model choice methods and a posteriori model adequacy tests.
Keywords: Bayesian phylogenetic inference; convergence; Markov chain Monte Carlo; maximum likelihood; model choice; posterior probability
Received March 8, 2003; Revised September 24, 2003; Accepted November 16, 2003
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
D. M. Simon, S. A. Kelchner, and S. Zimmerly A Broadscale Phylogenetic Analysis of Group II Intron RNAs and Intron-Encoded Reverse Transcriptases Mol. Biol. Evol., December 1, 2009; 26(12): 2795 - 2808. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Fletcher and Z. Yang INDELible: A Flexible Simulator of Biological Sequence Evolution Mol. Biol. Evol., August 1, 2009; 26(8): 1879 - 1888. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Joly and A. Bruneau Measuring Branch Support in Species Trees Obtained by Gene Tree Parsimony Syst Biol, May 25, 2009; (2009) syp013v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. R. Lemmon, J. M. Brown, K. Stanger-Hall, and E. M. Lemmon The Effect of Ambiguous Data on Phylogenetic Estimates Obtained by Maximum Likelihood and Bayesian Inference Syst Biol, May 22, 2009; (2009) syp017v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Brown and R. ElDabaje PuMA: Bayesian analysis of partitioned (and unpartitioned) model adequacy Bioinformatics, February 15, 2009; 25(4): 537 - 538. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang Empirical evaluation of a prior for Bayesian phylogenetic inference Phil Trans R Soc B, December 27, 2008; 363(1512): 4031 - 4039. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Dornburg, F. Santini, and M. E. Alfaro The Influence of Model Averaging on Clade Posteriors: An Example Using the Triggerfishes (Family Balistidae) Syst Biol, December 1, 2008; 57(6): 905 - 919. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Scherson, R. Vidal, and M. J. Sanderson Phylogeny, biogeography, and rates of diversification of New World Astragalus (Leguminosae) with an emphasis on South American radiations Am. J. Botany, August 1, 2008; 95(8): 1030 - 1039. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Posada jModelTest: Phylogenetic Model Averaging Mol. Biol. Evol., July 1, 2008; 25(7): 1253 - 1256. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Belfiore, L. Liu, and C. Moritz Multilocus Phylogenetics of a Rapid Radiation in the Genus Thomomys (Rodentia: Geomyidae) Syst Biol, April 1, 2008; 57(2): 294 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ripplinger and J. Sullivan Does Choice in Model Selection Affect Maximum Likelihood Analysis? Syst Biol, February 1, 2008; 57(1): 76 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. McGuire, C. C. Witt, D. L. Altshuler, and J. V. Remsen Phylogenetic Systematics and Biogeography of Hummingbirds: Bayesian and Maximum Likelihood Analyses of Partitioned Data and Selection of an Appropriate Partitioning Strategy Syst Biol, October 1, 2007; 56(5): 837 - 856. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Kolaczkowski and J. W. Thornton Effects of Branch Length Uncertainty on Bayesian Posterior Probabilities for Phylogenetic Hypotheses Mol. Biol. Evol., September 1, 2007; 24(9): 2108 - 2118. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. F. Hugall, R. Foster, and M. S. Y. Lee Calibration Choice, Rate Smoothing, and the Pattern of Tetrapod Diversification According to the Long Nuclear Gene RAG-1 Syst Biol, August 1, 2007; 56(4): 543 - 563. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Brown and A. R. Lemmon The Importance of Data Partitioning and the Utility of Bayes Factors in Bayesian Phylogenetics Syst Biol, August 1, 2007; 56(4): 643 - 655. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang Fair-Balance Paradox, Star-tree Paradox, and Bayesian Phylogenetics Mol. Biol. Evol., August 1, 2007; 24(8): 1639 - 1655. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Pisani, J. A. Cotton, and J. O. McInerney Supertrees Disentangle the Chimerical Origin of Eukaryotic Genomes Mol. Biol. Evol., August 1, 2007; 24(8): 1752 - 1760. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Marshall, C. Simon, and T. R. Buckley Accurate Branch Length Estimation in Partitioned Bayesian Analyses Requires Accommodation of Among-Partition Rate Variation and Attention to Branch Length Priors Syst Biol, December 1, 2006; 55(6): 993 - 1003. [Full Text] [PDF] |
||||
![]() |
L. Mateiu and B. Rannala Inferring Complex DNA Substitution Processes on Phylogenies Using Uniformization and Data Augmentation Syst Biol, April 1, 2006; 55(2): 259 - 269. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Brandley, A. D. Leache, D. L. Warren, and J. A. McGuire Are Unequal Clade Priors Problematic for Bayesian Phylogenetics? Syst Biol, February 1, 2006; 55(1): 138 - 146. [Full Text] [PDF] |
||||
![]() |
L. J. Revell, L. J. Harmon, and R. E. Glor Under-parameterized Model of Sequence Evolution Leads to Bias in the Estimation of Diversification Rates from Molecular Phylogenies Syst Biol, December 1, 2005; 54(6): 973 - 983. [Full Text] [PDF] |
||||
![]() |
D. W. Weisrock, L. J. Harmon, and A. Larson Resolving Deep Phylogenetic Relationships in Salamanders: Analyses of Mitochondrial and Nuclear Genomic Data Syst Biol, October 1, 2005; 54(5): 758 - 777. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang and B. Rannala Branch-Length Prior Influences Bayesian Posterior Probability of Phylogeny Syst Biol, June 1, 2005; 54(3): 455 - 470. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Flynn, J. A. Finarelli, S. Zehr, J. Hsu, and M. A. Nedbal Molecular Phylogeny of the Carnivora (Mammalia): Assessing the Impact of Increased Sampling on Resolving Enigmatic Relationships Syst Biol, April 1, 2005; 54(2): 317 - 337. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. R. Holland, F. Delsuc, V. Moulton, and A. Baker Visualizing Conflicting Evolutionary Hypotheses in Large Collections of Trees: Using Consensus Networks to Study the Origins of Placentals and Hexapods Syst Biol, February 1, 2005; 54(1): 66 - 76. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Roelants and F. Bossuyt Archaeobatrachian Paraphyly and Pangaean Diversification of Crown-Group Frogs Syst Biol, February 1, 2005; 54(1): 111 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Huelsenbeck and B. Rannala Frequentist Properties of Bayesian Posterior Probabilities of Phylogenetic Trees Under Simple and Complex Substitution Models Syst Biol, December 1, 2004; 53(6): 904 - 913. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Pisani Identifying and Removing Fast-Evolving Sites Using Compatibility Analysis: An Example from the Arthropoda Syst Biol, December 1, 2004; 53(6): 978 - 989. [Full Text] [PDF] |
||||




