© 2005 Society of Systematic Biologists
Hidden Likelihood Support in Genomic Data: Can Forty-Five Wrongs Make a Right?
1 Department of Biology, University of California—Riverside Riverside, California, 92521, USA E-mail: john.gatesy{at}ucr.edu
2 Evolutionary Genomics Department, DOE Joint Genome Institute 2800 Mitchell Drive, Walnut Creek, California, 94598, USA
Edited by Thomas Buckley: Associate Editor
| Abstract |
|---|
Combined analysis of multiple phylogenetic data sets can reveal emergent character support that is not evident in separate analyses of individual data sets. Previous parsimony analyses have shown that this hidden support often accounts for a large percentage of the overall phylogenetic signal in cladistic studies. Here, reanalysis of a large comparative genomic data set for yeast (genus Saccharomyces) demonstrates that hidden support can be an important factor in maximum likelihood analyses of multiple data sets as well. Emergent signal in a concatenation of 106 genes was responsible for up to 64% of the likelihood support at a particular node (the difference in log likelihood scores between optimal topologies that included and excluded a supported clade). A grouping of four yeast species (S. cerevisiae, S. paradoxus, S. mikatae, and S. kudriavzevii) was robustly supported by combined analysis of all 106 genes, but separate analyses of individual genes suggested numerous conflicts. Forty-eight genes strictly contradicted S. cerevisiae + S. paradoxus + S. mikatae + S. kudriavzevii in separate analyses, but combined likelihood analyses that included up to 45 of the "wrong" data sets supported this group. Extensive hidden support also emerged in a combined likelihood analysis of 41 genes that each recovered the exact same topology in separate analyses of the individual genes. These results show that isolated analyses of individual data sets can mask congruence and distort interpretations of clade stability, even in strictly model-based phylogenetic methods. Consensus and supertree procedures that ignore hidden phylogenetic signals are, at best, incomplete.
Keywords: Character partitions; combined analysis; hidden support; maximum likelihood; phylogenomics; yeast
Received March 17, 2004; Revised June 22, 2004; Accepted October 7, 2004
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
T. L. Fulton and C. Strobeck Multiple markers and multiple individuals refine true seal phylogeny and bring molecules and morphology back in line Proc R Soc B, November 25, 2009; (2009) rspb.2009.1783v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Wurdack and C. C. Davis Malpighiales phylogenetics: Gaining ground on one of the most recalcitrant clades in the angiosperm tree of life Am. J. Botany, August 1, 2009; 96(8): 1551 - 1570. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. R. Linnen and B. D. Farrell Comparison of Methods for Species-Tree Inference in the Sawfly Genus Neodiprion (Hymenoptera: Diprionidae) Syst Biol, December 1, 2008; 57(6): 876 - 890. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Kuo, J. P. Wares, and J. C. Kissinger The Apicomplexan Whole-Genome Phylogeny: An Analysis of Incongruence among Gene Trees Mol. Biol. Evol., December 1, 2008; 25(12): 2689 - 2698. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Wiens, C. A. Kuczynski, S. A. Smith, D. G. Mulcahy, J. W. Sites Jr., T. M. Townsend, and T. W. Reeder Branch Lengths, Support, and Congruence: Testing the Phylogenomic Approach with 20 Nuclear Loci in Snakes Syst Biol, June 1, 2008; 57(3): 420 - 431. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Belfiore, L. Liu, and C. Moritz Multilocus Phylogenetics of a Rapid Radiation in the Genus Thomomys (Rodentia: Geomyidae) Syst Biol, April 1, 2008; 57(2): 294 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Asher, J. H. Geisler, and M. R. Sanchez-Villagra Morphology, Paleontology, and Placental Mammal Phylogeny Syst Biol, April 1, 2008; 57(2): 311 - 317. [Full Text] [PDF] |
||||
![]() |
E. Benavides, R. Baum, D. McClellan, and J. W. Sites Molecular Phylogenetics of the Lizard Genus Microlophus (Squamata:Tropiduridae): Aligning and Retrieving Indel Signal from Nuclear Introns Syst Biol, October 1, 2007; 56(5): 776 - 797. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. V. Edwards, L. Liu, and D. K. Pearl High-resolution species trees without concatenation PNAS, April 3, 2007; 104(14): 5936 - 5941. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gatesy, R. DeSalle, and N. Wahlberg How Many Genes Should a Systematist Sample? Conflicting Insights from a Phylogenomic Matrix Characterized by Replicated Incongruence Syst Biol, April 1, 2007; 56(2): 355 - 363. [Full Text] [PDF] |
||||
![]() |
I. Comas, A. Moya, and F. Gonzalez-Candelas From Phylogenetics to Phylogenomics: The Evolutionary Relationships of Insect Endosymbiotic {gamma}-Proteobacteria as a Test Case Syst Biol, February 1, 2007; 56(1): 1 - 16. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Ane, B. Larget, D. A. Baum, S. D. Smith, and A. Rokas Bayesian Estimation of Concordance among Gene Trees Mol. Biol. Evol., February 1, 2007; 24(2): 412 - 426. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Burleigh, A. C. Driskell, and M. J. Sanderson Supertree Bootstrapping Methods for Assessing Phylogenetic Variation among Genes in Genome-Scale Data Sets Syst Biol, June 1, 2006; 55(3): 426 - 440. [Abstract] [Full Text] [PDF] |
||||




