© 2007 Society of Systematic Biologists
Estimation of Phylogeny and Invariant Sites under the General Markov Model of Nucleotide Sequence Evolution
1 Sydney Bioinformatics, University of Sydney NSW 2006, Australia
2 Centre for Mathematical Biology, University of Sydney NSW 2006, Australia
3 School of Mathematics and Statistics, University of Sydney NSW 2006, Australia
4 School of Biological Sciences, University of Sydney NSW 2006, Australia E-mail: lars.jermiin{at}usyd.edu.au
Edited by Mike Steel: Associate Editor
| Abstract |
|---|
The models of nucleotide substitution used by most maximum likelihood–based methods assume that the evolutionary process is stationary, reversible, and homogeneous. We present an extension of the Barry and Hartigan model, which can be used to estimate parameters by maximum likelihood (ML) when the data contain invariant sites and there are violations of the assumptions of stationarity, reversibility, and homogeneity. Unlike most ML methods for estimating invariant sites, we estimate the nucleotide composition of invariant sites separately from that of variable sites. We analyze a bacterial data set where problems due to lack of stationarity and homogeneity have been previously well noted and use the parametric bootstrap to show that the data are consistent with our general Markov model. We also show that estimates of invariant sites obtained using our method are fairly accurate when applied to data simulated under the general Markov model.
Keywords: Invariant sites; maximum likelihood; nonhomogeneous process; nonstationary process; nucleotide substitution; phylogenetics
Received May 2, 2006; Revised July 9, 2006; Accepted September 12, 2006
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
R. G. Beiko, W. F. Doolittle, and R. L. Charlebois The Impact of Reticulate Evolution on Genome Phylogeny Syst Biol, December 1, 2008; 57(6): 844 - 856. [Abstract] [Full Text] [PDF] |
||||
