Skip Navigation

Systematic Biology 2005 54(5):743-757; doi:10.1080/10635150500234609
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (49)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Brinkmann, H.
Right arrow Articles by Philippe, H.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Brinkmann, H.
Right arrow Articles by Philippe, H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2005 Society of Systematic Biologists

An Empirical Assessment of Long-Branch Attraction Artefacts in Deep Eukaryotic Phylogenomics

Henner Brinkmann1, Mark van der Giezen2, Yan Zhou1, Gaëtan Poncelin de Raucourt1 and Hervé Philippe1

1 Canadian Institute for Advanced Research, Centre Robert Cedergren, Département de Biochimie, Université de Montréal, Succursale Centre-Ville Montréal, Québec H3C3J7, Canada; E-mail: herve.philippe{at}umontreal.ca (H.P.)
2 School of Biological and Chemical Sciences, Queen Mary, University of London Mile End Road, E1 4NS, London, UK

Edited by Marshal Hedin


   Abstract

In the context of exponential growing molecular databases, it becomes increasingly easy to assemble large multigene data sets for phylogenomic studies. The expected increase of resolution due to the reduction of the sampling (stochastic) error is becoming a reality. However, the impact of systematic biases will also become more apparent or even dominant. We have chosen to study the case of the long-branch attraction artefact (LBA) using real instead of simulated sequences. Two fast-evolving eukaryotic lineages, whose evolutionary positions are well established, microsporidia and the nucleomorph of cryptophytes, were chosen as model species. A large data set was assembled (44 species, 133 genes, and 24,294 amino acid positions) and the resulting rooted eukaryotic phylogeny (using a distant archaeal outgroup) is positively misled by an LBA artefact despite the use of a maximum likelihood–based tree reconstruction method with a complex model of sequence evolution. When the fastest evolving proteins from the fast lineages are progressively removed (up to 90%), the bootstrap support for the apparently artefactual basal placement decreases to virtually 0%, and conversely only the expected placement, among all the possible locations of the fast-evolving species, receives increasing support that eventually converges to 100%. The percentage of removal of the fastest evolving proteins constitutes a reliable estimate of the sensitivity of phylogenetic inference to LBA. This protocol confirms that both a rich species sampling (especially the presence of a species that is closely related to the fast-evolving lineage) and a probabilistic method with a complex model are important to overcome the LBA artefact. Finally, we observed that phylogenetic inference methods perform strikingly better with simulated as opposed to real data, and suggest that testing the reliability of phylogenetic inference methods with simulated data leads to overconfidence in their performance. Although phylogenomic studies can be affected by systematic biases, the possibility of discarding a large amount of data containing most of the nonphylogenetic signal allows recovering a phylogeny that is less affected by systematic biases, while maintaining a high statistical support.

Keywords: Distant outgroup; eukaryotic tree; long-branch attraction; microsporidia; multigene data sets; nucleomorph; rooting; species sampling; systematic biases

Received November 5, 2004; Revised February 7, 2005; Accepted April 30, 2005
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
S. Simon, S. Strauss, A. von Haeseler, and H. Hadrys
A Phylogenomic Approach to Resolve the Basal Pterygote Divergence
Mol. Biol. Evol., December 1, 2009; 26(12): 2719 - 2730.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
D. A. Caron, P. D. Countway, P. Savai, R. J. Gast, A. Schnetzer, S. D. Moorthi, M. R. Dennett, D. M. Moran, and A. C. Jones
Defining DNA-Based Operational Taxonomic Units for Microbial-Eukaryote Ecology
Appl. Envir. Microbiol., September 15, 2009; 75(18): 5797 - 5808.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
J. M. Archibald and C. E. Lane
Going, Going, Not Quite Gone: Nucleomorphs as a Case Study in Nuclear Genome Reduction
J. Hered., September 1, 2009; 100(5): 582 - 590.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. A. Castoe, A. P. J. de Koning, H.-M. Kim, W. Gu, B. P. Noonan, G. Naylor, Z. J. Jiang, C. L. Parkinson, and D. D. Pollock
From the Cover: Evidence for an ancient adaptive episode of convergent molecular evolution
PNAS, June 2, 2009; 106(22): 8986 - 8991.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. V. Edwards
Natural selection and phylogenetic analysis
PNAS, June 2, 2009; 106(22): 8799 - 8800.
[Full Text] [PDF]


Home page
Mol Biol EvolHome page
Y. Inagaki, Y. Nakajima, M. Sato, M. Sakaguchi, and T. Hashimoto
Gene Sampling Can Bias Multi-Gene Phylogenetic Inferences: The Relationship between Red Algae and Green Plants as a Case Study
Mol. Biol. Evol., May 1, 2009; 26(5): 1171 - 1178.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
V. Hampl, L. Hug, J. W. Leigh, J. B. Dacks, B. F. Lang, A. G. B. Simpson, and A. J. Roger
Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic "supergroups"
PNAS, March 10, 2009; 106(10): 3859 - 3864.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Si Quang, O. Gascuel, and N. Lartillot
Empirical profile mixture models for phylogenetic reconstruction
Bioinformatics, October 15, 2008; 24(20): 2317 - 2323.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
N. Wahlberg and C. W. Wheat
Genomic Outposts Serve the Phylogenomic Pioneers: Designing Novel Nuclear Markers for Genomic DNA Extractions of Lepidoptera
Syst Biol, April 1, 2008; 57(2): 231 - 242.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. W. Roy and M. Irimia
Rare Genomic Characters Do Not Support Coelomata: Intron Loss/Gain
Mol. Biol. Evol., April 1, 2008; 25(4): 620 - 623.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
C. E. Lane, K. van den Heuvel, C. Kozera, B. A. Curtis, B. J. Parsons, S. Bowman, and J. M. Archibald
Nucleomorph genome of Hemiselmis andersenii reveals complete intron loss and compaction as a driver of protein structure and function
PNAS, December 11, 2007; 104(50): 19908 - 19913.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. Zheng, I. B. Rogozin, E. V. Koonin, and T. M. Przytycka
Support for the Coelomata Clade of Animals from a Rigorous Analysis of the Pattern of Intron Conservation
Mol. Biol. Evol., November 1, 2007; 24(11): 2583 - 2592.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. Grauvogel, H. Brinkmann, and J. Petersen
Evolution of the Glucose-6-Phosphate Isomerase: The Plasticity of Primary Metabolism in Photosynthetic Eukaryotes
Mol. Biol. Evol., August 1, 2007; 24(8): 1611 - 1621.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
N. Rodriguez-Ezpeleta, H. Brinkmann, B. Roure, N. Lartillot, B. F. Lang, and H. Philippe
Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies
Syst Biol, June 1, 2007; 56(3): 389 - 399.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
R. B. Bevan, D. Bryant, and B. F. Lang
Accounting for Gene Rate Heterogeneity in Phylogenetic Inference
Syst Biol, April 1, 2007; 56(2): 194 - 205.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
I. B. Rogozin, Y. I. Wolf, L. Carmel, and E. V. Koonin
Ecdysozoan Clade Rejected by Genome-Wide Analysis of Rare Amino Acid Replacements
Mol. Biol. Evol., April 1, 2007; 24(4): 1080 - 1090.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
N. Rodriguez-Ezpeleta, H. Philippe, H. Brinkmann, B. Becker, and M. Melkonian
Phylogenetic Analyses of Nuclear, Mitochondrial, and Plastid Multigene Data Sets Support the Placement of Mesostigma in the Streptophyta
Mol. Biol. Evol., March 1, 2007; 24(3): 723 - 731.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
V. Ruano-Rubio and M. A. Fares
Artifactual Phylogenies Caused by Correlated Distribution of Substitution Rates among Sites and Lineages: The Good, the Bad, and the Ugly
Syst Biol, February 1, 2007; 56(1): 68 - 82.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
A. J Roger and L. A Hug
The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation
Phil Trans R Soc B, June 29, 2006; 361(1470): 1039 - 1054.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
N. Lartillot and H. Philippe
Computing Bayes Factors Using Thermodynamic Integration
Syst Biol, April 1, 2006; 55(2): 195 - 207.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.