Skip Navigation

Systematic Biology 2007 56(1):25-43; doi:10.1080/10635150601156313
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (3)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Spencer, M.
Right arrow Articles by Susko, E.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Spencer, M.
Right arrow Articles by Susko, E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 Society of Systematic Biologists

Conditioned Genome Reconstruction: How to Avoid Choosing the Conditioning Genome

Matthew Spencer1,2,5, David Bryant3,4 and Edward Susko1

1 Department of Mathematics and Statistics, Dalhousie University Halifax, Nova Scotia, B3H 3J5, Canada
2 Department of Biochemistry and Molecular Biology, Dalhousie University Halifax, Nova Scotia, B3H 4H7, Canada
3 Department of Mathematics, University of Auckland Private Bag 92019, Auckland, New Zealand
4 McGill Centre for Bioinformatics, McGill University 3775 University Street, Duff Medical Building, Montreal, Quebec, H3A 2B4, Canada

Edited by Olivier Gascuel: Associate Editors


   Abstract

Genome phylogenies can be inferred from data on the presence and absence of genes across taxa. Logdet distances may be a good method, because they allow expected genome size to vary across the tree. Recently, Lake and Rivera proposed conditioned genome reconstruction (calculation of logdet distances using only those genes present in a conditioning genome) to deal with unobservable genes that are absent from every taxon of interest. We prove that their method can consistently estimate the topology for almost any choice of conditioning genome. Nevertheless, the choice of conditioning genome is important for small samples. For real bacterial genome data, different choices of conditioning genome can result in strong bootstrap support for different tree topologies. To overcome this problem, we developed supertree methods that combine information from all choices of conditioning genome. One of these methods, based on the BIONJ algorithm, performs well on simulated data and may have applications to other supertree problems. However, an analysis of 40 bacterial genomes using this method supports an incorrect clade of parasites. This is a common feature of model-based gene content methods and is due to parallel gene loss.

Keywords: BIONJ; conditioned genome reconstruction; consistency; gene content; logdet; supertrees

Received January 18, 2006; Revised April 7, 2006; Accepted August 10, 2006


5 Current Address: School of Biological Sciences, University of Liverpool, Liverpool, L69 7ZB, UK; E-mail: m.spencer{at}liverpool.ac.uk


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
M. Spencer and A. Sangaralingam
A Phylogenetic Mixture Model for Gene Family Loss in Parasitic Bacteria
Mol. Biol. Evol., August 1, 2009; 26(8): 1901 - 1908.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
O. Cohen, N. D Rubinstein, A. Stern, U. Gophna, and T. Pupko
A likelihood framework to analyse phyletic patterns
Phil Trans R Soc B, December 27, 2008; 363(1512): 3903 - 3911.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.