| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© 2008 Society of Systematic Biologists
Testing Congruence in Phylogenomic Analysis
1 Department of Biochemistry and Molecular Biology, Dalhousie University Halifax NS, Canada B3H 1X5; E-mail: andrew.roger{at}dal.ca (A.J.R.)
2 Department of Mathematics and Statistics and Genome Atlantic, Dalhousie University Halifax NS, Canada B3H 3J5
3 Department für Biologie I, Botanik, Ludwig-Maximilians-Universität München Menzingerstraße 67, D-80638 München, Germany
Edited by Mark Fishbein
| Abstract |
|---|
Phylogenomic analyses of large sets of genes or proteins have the potential to revolutionize our understanding of the tree of life. However, problems arise because estimated phylogenies from individual loci often differ because of different histories, systematic bias, or stochastic error. We have developed CONCATERPILLAR, a hierarchical clustering method based on likelihood-ratio testing that identifies congruent loci for phylogenomic analysis. CONCATERPILLAR also includes a test for shared relative evolutionary rates between genes indicating whether they should be analyzed separately or by concatenation. In simulation studies, the performance of this method is excellent when a multiple comparison correction is applied. We analyzed a phylogenomic data set of 60 translational protein sequences from the major supergroups of eukaryotes and identified three congruent subsets of proteins. Analysis of the largest set indicates improved congruence relative to the full data set and produced a phylogeny with stronger support for five eukaryote supergroups including the Opisthokonts, the Plantae, the stramenopiles + Apicomplexa (chromalveolates), the Amoebozoa, and the Excavata. In contrast, the phylogeny of the second largest set indicates a close relationship between stramenopiles and red algae, to the exclusion of alveolates, suggesting gene transfer from the red algal secondary symbiont to the ancestral stramenopile host nucleus during the origin of their chloroplast. Investigating phylogenomic data sets for conflicting signals has the potential to both improve phylogenetic accuracy and inform our understanding of genome evolution.
Keywords: Concatenated analysis; endosymbiotic gene transfer; hierarchical clustering; lateral gene transfer; likelihood ratio testing; maximum likelihood; phylogenetic congruence; phylogenomics; separate analysis; superkingdom; supermatrix analysis
Received February 16, 2007; Revised April 23, 2007; Accepted August 14, 2007
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. W. Brown, F. W. Spiegel, and J. D. Silberman Phylogeny of the "Forgotten" Cellular Slime Mold, Fonticula alba, Reveals a Key Evolutionary Branch within Opisthokonta Mol. Biol. Evol., December 1, 2009; 26(12): 2699 - 2709. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Deschamps and D. Moreira Signal Conflicts in the Phylogeny of the Primary Photosynthetic Eukaryotes Mol. Biol. Evol., December 1, 2009; 26(12): 2745 - 2753. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Albu, X. J. Min, G. B. Golding, and D. Hickey Nucleotide Substitution Bias within the Genus Drosophila Affects the Pattern of Proteome Evolution Gen Biol Evol, August 21, 2009; 2009(0): 288 - 293. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Hampl, L. Hug, J. W. Leigh, J. B. Dacks, B. F. Lang, A. G. B. Simpson, and A. J. Roger Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic "supergroups" PNAS, March 10, 2009; 106(10): 3859 - 3864. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Galtier and V. Daubin Dealing with incongruence in phylogenomic analyses Phil Trans R Soc B, December 27, 2008; 363(1512): 4023 - 4029. [Abstract] [Full Text] [PDF] |
||||



