© 2004 Society of Systematic Biologists
Data Partitions and Complex Models in Bayesian Analysis: The Phylogeny of Gymnophthalmid Lizards
1 Department of Biology, University of Central Florida 4000 Central Florida Boulevard, Orlando, FL 32816–2368, USA; E-mail: tcastoe{at}mail.ucf.edu (T.A.C.), cparkins{at}pegasus.cc.ucf.edu (C.L.P.)
2 Biology Department Vassar College Box 555, 124 Raymond Avenue, Poughkeepsie, NY 12604–0555, USA; E-mail: tiffperu{at}yahoo.com
Edited by Jack Sites: Associate Editor
| Abstract |
|---|
Phylogenetic studies incorporating multiple loci, and multiple genomes, are becoming increasingly common. Coincident with this trend in genetic sampling, model-based likelihood techniques including Bayesian phylogenetic methods continue to gain popularity. Few studies, however, have examined model fit and sensitivity to such potentially heterogeneous data partitions within combined data analyses using empirical data. Here we investigate the relative model fit and sensitivity of Bayesian phylogenetic methods when alternative site-specific partitions of among-site rate variation (with and without autocorrelated rates) are considered. Our primary goal in choosing a best-fit model was to employ the simplest model that was a good fit to the data while optimizing topology and/or Bayesian posterior probabilities. Thus, we were not interested in complex models that did not practically affect our interpretation of the topology under study. We applied these alternative models to a four-gene data set including one protein-coding nuclear gene (c-mos), one protein-coding mitochondrial gene (ND4), and two mitochondrial rRNA genes (12S and 16S) for the diverse yet poorly known lizard family Gymnophthalmidae. Our results suggest that the best-fit model partitioned among-site rate variation separately among the c-mos, ND4, and 12S + 16S gene regions. We found this model yielded identical topologies to those from analyses based on the GTR+I+G model, but significantly changed posterior probability estimates of clade support. This partitioned model also produced more precise (less variable) estimates of posterior probabilities across generations of long Bayesian runs, compared to runs employing a GTR+I+G model estimated for the combined data. We use this three-way gamma partitioning in Bayesian analyses to reconstruct a robust phylogenetic hypothesis for the relationships of genera within the lizard family Gymnophthalmidae. We then reevaluate the higher-level taxonomic arrangement of the Gymnophthalmidae. Based on our findings, we discuss the utility of nontraditional parameters for modeling among-site rate variation and the implications and future directions for complex model building and testing.
Keywords: Autocorrelated gamma; Bayesian analysis; combining data; Gymnophthalmidae; likelihood models; partitioning data; Reptilia; site-specific gamma
Received May 18, 2003; Revised March 15, 2003; Accepted December 14, 2003
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. C. Brandley, D. L. Warren, A. D. Leache, and J. A. McGuire Homoplasy and Clade Support Syst Biol, June 29, 2009; (2009) syp019v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Li, G. Lu, and G. Orti Optimal Data Partitioning and a Test Case for Ray-Finned Fishes (Actinopterygii) Based on Ten Nuclear Loci Syst Biol, August 1, 2008; 57(4): 519 - 539. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Benavides, R. Baum, D. McClellan, and J. W. Sites Molecular Phylogenetics of the Lizard Genus Microlophus (Squamata:Tropiduridae): Aligning and Retrieving Indel Signal from Nuclear Introns Syst Biol, October 1, 2007; 56(5): 776 - 797. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Brown and A. R. Lemmon The Importance of Data Partitioning and the Utility of Bayes Factors in Bayesian Phylogenetics Syst Biol, August 1, 2007; 56(4): 643 - 655. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Chelo, L. Ze-Ze, and R. Tenreiro Congruence of evolutionary relationships inside the Leuconostoc-Oenococcus-Weissella clade assessed by phylogenetic analysis of the 16S rRNA gene, dnaA, gyrB, rpoC and dnaK Int J Syst Evol Microbiol, February 1, 2007; 57(2): 276 - 286. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Vitorino, I. M. Chelo, F. Bacellar, and L. Ze-Ze Rickettsiae phylogeny: a multigenic approach Microbiology, January 1, 2007; 153(1): 160 - 168. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. C. Marshall, C. Simon, and T. R. Buckley Accurate Branch Length Estimation in Partitioned Bayesian Analyses Requires Accommodation of Among-Partition Rate Variation and Attention to Branch Length Priors Syst Biol, December 1, 2006; 55(6): 993 - 1003. [Full Text] [PDF] |
||||
![]() |
L. J. Vitt and E. R. Pianka From the Cover: Deep history impacts present-day ecology and biodiversity PNAS, May 31, 2005; 102(22): 7877 - 7881. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Huelsenbeck and B. Rannala Frequentist Properties of Bayesian Posterior Probabilities of Phylogenetic Trees Under Simple and Complex Substitution Models Syst Biol, December 1, 2004; 53(6): 904 - 913. [Abstract] [Full Text] [PDF] |
||||



