Skip Navigation


Systematic Biology Advance Access originally published online on August 18, 2009
Systematic Biology 2009 58(4):411-424; doi:10.1093/sysbio/syp046
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplementary Data
Right arrow All Versions of this Article:
58/4/411    most recent
syp046v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Sennblad, B.
Right arrow Articles by Lagergren, J.
PubMed
Right arrow Articles by Sennblad, B.
Right arrow Articles by Lagergren, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© Society of Systematic Biologists

Probabilistic Orthology Analysis

Bengt Sennblad1,* and Jens Lagergren2

1 Stockholm Bioinformatics Center, Department of Biochemistry, Stockholm University, AlbaNova, 106 91 Stockholm, Sweden
2 Stockholm Bioinformatics Center and School of Computer Science and Communication, Royal Institute of Technology, AlbaNova 106 91, Stockholm, Sweden

* Correspondence to be sent to: Stockholm Bioinformatics Center, Stockholm University, AlbaNova 106 91, Stockholm; Sweden; E-mail: bengt.sennblad{at}sbc.su.se.


   Abstract

Orthology analysis aims at identifying orthologous genes and gene products from different organisms and, therefore, is a powerful tool in modern computational and experimental biology. Although reconciliation-based orthology methods are generally considered more accurate than distance-based ones, the traditional parsimony-based implementation of reconciliation-based orthology analysis (most parsimonious reconciliation [MPR]) suffers from a number of shortcomings. For example, 1) it is limited to orthology predictions from the reconciliation that minimizes the number of gene duplication and loss events, 2) it cannot evaluate the support of this reconciliation in relation to the other reconciliations, and 3) it cannot make use of prior knowledge (e.g., about species divergence times) that provides auxiliary information for orthology predictions. We present a probabilistic approach to reconciliation-based orthology analysis that addresses all these issues by estimating orthology probabilities. The method is based on the gene evolution model, an explicit evolutionary model for gene duplication and gene loss inside a species tree, that generalizes the standard birth–death process. We describe the probabilistic approach to orthology analysis using 2 experimental data sets and show that the use of orthology probabilities allows a more informative analysis than MPR and, in particular, that it is less sensitive to taxon sampling problems. We generalize these anecdotal observations and show, using data generated under biologically realistic conditions, that MPR give false orthology predictions at a substantial frequency. Last, we provide a new orthology prediction method that allows an orthology and paralogy classification with any chosen sensitivity/specificity combination from the spectra of achievable combinations. We conclude that probabilistic orthology analysis is a strong and more advanced alternative to traditional orthology analysis and that it provides a framework for sophisticated comparative studies of processes in genome evolution.

Keywords: Comparative genomics; gene duplication; gene loss; orthology; paralogy; probabilistic modeling; phylogenetics; tree reconciliation

Received August 20, 2008; Revised December 22, 2008; Accepted July 17, 2009


Associate Editor: Lars Jermiin


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?




Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.