Skip Navigation

Systematic Biology 2007 56(4):656-672; doi:10.1080/10635150701549672
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Pereira, S. L.
Right arrow Articles by Baker, A. J.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Pereira, S. L.
Right arrow Articles by Baker, A. J.
GeoRef
Right arrow GeoRef Citation
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 Society of Systematic Biologists

Mitochondrial and Nuclear DNA Sequences Support a Cretaceous Origin of Columbiformes and a Dispersal-Driven Radiation in the Paleogene

Edited by Adrian Paterson: Associate Editor

Sergio L. Pereira1, Kevin P. Johnson2, Dale H. Clayton3 and Allan J. Baker1,4

1 Department of Natural History, Royal Ontario Museum Toronto, Ontario, M5S 2C6, Canada E-mail: sergio.pereira{at}utoronto.ca
2 Illinois Natural History Survey Champaign, Illinois, 61820, USA
3 Department of Biology, University of Utah Salt Lake City, Utah, 84112, USA
4 Department of Ecology and Evolutionary Biology, University of Toronto Toronto, Ontario, M5S 3B2, Canada


    Abstract
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Acknowledgments
 References
 
Phylogenetic relationships among genera of pigeons and doves (Aves, Columbiformes) have not been fully resolved because of limited sampling of taxa and characters in previous studies. We therefore sequenced multiple nuclear and mitochondrial DNA genes totaling over 9000 bp from 33 of 41 genera plus 8 outgroup taxa, and, together with sequences from 5 other pigeon genera retrieved from GenBank, recovered a strong phylogenetic hypothesis for the Columbiformes. Three major clades were recovered with the combined data set, comprising the basally branching New World pigeons and allies (clade A) that are sister to Neotropical ground doves (clade B), and the Afro-Eurasian and Australasian taxa (clade C). None of these clades supports the monophyly of current families and subfamilies. The extinct, flightless dodo and solitaires (Raphidae) were embedded within pigeons and doves (Columbidae) in clade C, and monophyly of the subfamily Columbinae was refuted because the remaining subfamilies were nested within it. Divergence times estimated using a Bayesian framework suggest that Columbiformes diverged from outgroups such as Apodiformes and Caprimulgiformes in the Cretaceous before the mass extinction that marks the end of this period. Bayesian and maximum likelihood inferences of ancestral areas, accounting for phylogenetic uncertainty and divergence times, respectively, favor an ancient origin of Columbiformes in the Neotropical portion of what was then Gondwana. The radiation of modern genera of Columbiformes started in the Early Eocene to the Middle Miocene, as previously estimated for other avian groups such as ratites, tinamous, galliform birds, penguins, shorebirds, parrots, passerine birds, and toucans. Multiple dispersals of more derived Columbiformes between Australasian and Afro-Eurasian regions are required to explain current distributions.

Keywords: Columbiformes; dispersal; divergence times; doves; fossil; K/T boundary; molecular phylogeny; pigeons

Received November 7, 2006; Revised February 26, 2007; Accepted May 19, 2007


Strongly supported phylogenies inferred from multiple genes, the fossil record, and modern methods of molecular dating are key elements to interpret patterns of macroevolution and biogeography within a temporal framework. For birds the fossil record has revealed that many lineages became extinct about 65.5 million years ago (Mya) during the transition between the Mesozoic and Cenozoic Periods, and most extant lineages radiated shortly after (Brodkorb, 1964; Chiappe and Dyke, 2002; Olson, 1985; Slack et al., 2006). Conversely, inferences from molecular data have suggested that many orders of extant birds are older than 65.5 Mya (e.g., Cooper and Penny, 1997; Hedges et al., 1996; Pereira and Baker, 2006a; Slack et al., 2006; van Tuinen and Hedges, 2001). Additionally, molecular time estimates place the origin of extant avian families at the end of the Mesozoic, and the radiation of modern genera in the Paleocene and Eocene, at least for ratites, tinamous, galliform birds, penguins, shorebirds, parrots, passerine birds, and toucans (e.g., Baker et al., 2004, 2006; Nahum et al., 2003; Paton et al., 2003; Pereira and Baker, 2006a, 2006b; Pereira et al., 2002; Tavares et al., 2006). For many other avian groups, such as the Columbiformes, the lack of a strong phylogenetic hypothesis at and above the genus level based on multiple genes has precluded the interpretation of their evolution within a timeframe.

Columbiformes is one of most easily recognized avian orders worldwide and is traditionally subdivided in two families, the Columbidae and the Raphidae. The Columbidae is represented today by over 300 living species of pigeons and doves. Members of the Columbidae have sturdy bodies, short necks, short slender bills, and a fleshy cere. They vary greatly in body size, from the sparrow-sized ground doves of the genus Columbina to the fowl-sized crowned pigeons of the genus Goura, as well as in coloration, from dull brown or grey to glossy orange or green plumage, especially around the head and wings. Columbidae is generally subdivided into the subfamilies Columbinae (typical pigeons and doves), Treroninae (fruit and green doves), Gourinae (crowned pigeons), and the monotypic Otidiphabinae (pheasant pigeon) and Didunculinae (tooth-billed pigeon). The Raphidae was endemic to the Mascarene Islands in the Indian Ocean, becoming extinct in the 17th and 18th centuries. The family contained only two species of Raphus and the monotypic Pezophaps, known as dodo and solitaires, respectively. Dodo and solitaires were flightless birds of about one meter high, likely the outcome of adaptation to a terrestrial and predator-free lifestyle that led to little morphological resemblance to extant Columbidae (Gibbs et al., 2001).

Morphological differences among members of these two families have guaranteed separate taxonomic status for the Columbidae and the Raphidae in most historical and current classifications (Gibbs et al., 2001; Goodwin, 1983; Sibley and Ahlquist, 1990). However, the only molecular phylogeny of high-level columbiform taxa to include the Raphidae suggested otherwise (Shapiro et al., 2002). Maximum likelihood and Bayesian analyses of 1.4 kb of mitochondrial cytochrome b and the ribosomal 12S rDNA sequences revealed a close phylogenetic relationship between the Raphidae and the monotypic Caloenas (Columbinae) in a well-supported sister clade to the subfamilies Gourinae and Didunculinae (Shapiro et al., 2002). The more derived position of the Raphidae within the Columbiformes conflicts with suggestions that they became isolated before modern lineages evolved or were descendants of unknown extinct lineages (Gibbs et al., 2001).

Furthermore, the monophyly of subfamilies within Columbidae is also questionable as judged from molecular phylogenies estimated with mitochondrial and nuclear DNA sequences (Johnson, 2004; Johnson and Clayton, 2000; Shapiro et al., 2002). Monophyly of the subfamily Columbinae is usually disrupted by the inclusion of all other subfamilies within it, as previously suggested by an unrooted representation of the relationships of genera (Goodwin, 1983). Likewise, molecular data tend to place the genus Treron apart from other members of the Treroninae, and there is disagreement among phylogenetic hypotheses regarding which other genera are closely related to Treron (Johnson, 2004; Shapiro et al., 2002). Nonetheless, these molecular hypotheses have to be interpreted with caution because they were obtained from analyses using limited character and taxa sampling, and recovered many nodes with low branch support.

The origin of modern Columbiformes is also disputed. The Oriental and Australasia regions harbor the highest diversity of species, and hence these areas would be inferred to be the center of origin and dispersal according to classical biogeographic reasoning (Darwin, 1859; Wallace, 1876). On the other hand, molecular phylogenetic analyses suggested that the Neotropics are likely the center of origin of Columbiformes because Neotropical genera were placed closer to the root of the trees obtained in two independent studies (Johnson and Clayton, 2000; Shapiro et al., 2002). However, the two phylogenetic hypotheses proposed for Columbiformes based on modern cladistic methods disagree as to whether the mourning and quail doves (Johnson and Clayton, 2000) or ground doves and allies (Shapiro et al., 2002) are the sister group to the remaining Columbiformes. It is not clear if the topological differences observed in these two studies are due to the choice of outgroup used (Apodiformes in Johnson and Clayton, 2000, and Charadriiformes in Shapiro et al., 2002), bias in base composition, or conflicting phylogenetic signal among the sequence data used. Additionally, both studies did not find strong support for the sequence of branching events closer to the root of the tree, and thus it has not been possible to test biogeographic hypotheses of the origin and radiation of the Columbiformes.

We inferred the phylogenetic relationships within Columbiformes using multiple approaches to clarify the high-level systematics of pigeons and doves. Our phylogeny is based on the largest data set gathered for this group so far, including 9 kilobase pairs (kb) of aligned mitochondrial and nuclear DNA sequences for most of the extant genera. We use the strongly supported phylogeny to estimate divergence times and infer ancestral areas, and propose a temporal-biogeographic hypothesis for dispersal of the Columbiformes across the globe, which seems to have been facilitated by plate tectonics during the Cenozoic.


    Material and Methods
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Acknowledgments
 References
 
Taxon Sampling
We sampled species from 33 out of 41 Columbiformes genera and eight outgroup species (Table 1). Apodiformes and Caprimulgiformes were chosen as closer outgroups to Columbiformes based on molecular and anatomical evidence (Ericson et al., 2006; Livezey and Zusi, 2007). Anseriformes, Galliformes, and Struthioniformes outgroups were also included in the analysis for calibration purposes (see Temporal Constraints in Molecular Dating). DNA or tissue samples from species from five other genera (the extinct Pezophaps and Raphus; the rare or endemic Alectroenas, Didunculus, and Drepanoptila) were not available to us but we included 1.4 kb of DNA sequences from a previous study (Shapiro et al., 2002) in our analyses. Hence, only specimens from the extinct Microgoura and the monotypic Cryptophaps and Starnoenas were missing from our taxon sampling. We considered the monotypic Nesoenas mayeri a synonym of Streptopelia following Johnson et al. (2001), and the New and Old World Columba pigeons as separate genera (Patagioenas and Columba, respectively), following Banks et al. (2003) and Johnson et al. (2001).


View this table:
[in this window]
[in a new window]

 
Table 1 Species sampled and GenBank accession numbers. Genes sequenced were the mitochondrial small ribosomal subunit (12S rDNA), the ATP synthase F0 subunits 8 and 6 plus part of cytochrome c oxidase subunit III (ATPCO3), NADH dehydrogenase subunit 2 (ND2), cytochrome b (cyt b), cytochrome c oxidase subunit I (COI), the nuclear recombination activating protein (RAG-1), interphotoreceptor retinoid-binding protein (IRBP), and β fibrinogen intron 7 (FIB7).

 
DNA Amplification, Sequencing, and Sequence Alignments
Building on the previously published gene segments of Johnson (2004) and Johnson and Clayton (2000), we selected the following mitochondrial regions for amplification: small ribosomal subunit (12S rDNA), the ATP synthase F0 subunits 8 and 6 plus part of cytochrome c oxidase subunit III (ATPase8/6–COIII), NADH dehydrogenase subunit 2 (ND2), cytochrome b (cyt b) (primers designed by O. Haddrath and described in Pereira and Baker, 2004), and cytochrome c oxidase subunit I (COI) (COIaR: AAC YAA CCA CAA AGA CAT YGG and COIbR: GAN AGG ACA TAG TGG AAG TGG GC; O. Haddrath, personal communication). We chose the following nuclear DNA regions for amplification: recombination activating protein (RAG-1) gene (primers R13, R18, R17, R22, R21, and R2b described in Groth and Barrowclough, 1999), interphotoreceptor retinoid-binding protein (IRBP) gene (IRBP217GG: GAC ATG GCT AAR GTN CTA CTG GAT AAC TAC TG and IRBP1095H: CTC TGC GCA CTG CCA GGA TGT C; O. Haddrath, personal communication), and β fibrinogen intron 7 (FIB7) (primers FIB-BI7U and FIB-BI7L from Prychitko and Moore, 1997). We performed mitochondrial and nuclear DNA amplifications as previously described in Pereira and Baker (2005) and Groth and Barrowclough (1999), respectively. PCR products were recovered from 1% agarose gels, purified by centrifuging each through a filter tip, and cycle-sequenced and run on an ABI 3100 automated DNA sequencer (IRBP and 12S rDNA) or Li-Cor 4200 bidirectional automated DNA sequencer (all other genes) according to the manufacturer's suggested protocols. Both L- and H-strands sequences were checked for ambiguities and the final consensus sequence was created for each fragment in Sequencher 4.1.2 (GeneCodes, Ann Arbor, Michigan). All consensus fragments were aligned visually in MacClade 4.0 (Maddison and Maddison, 2000) for each gene, and then concatenated in a matrix of 9083 base pairs (bp), including gaps. Ambiguously aligned regions (1.1% of total alignment) were excluded from the analysis. All sequences obtained in this study were deposited in GenBank (accession numbers EF373276 [GenBank] to EF373530 [GenBank] ). The final matrix and all trees inferred in this study were deposited in TreeBASE (Study Number S1808).

Data Partition, Model Selection, and Bias in Base Composition
We partitioned the data according to the fragments amplified. Thus, ATPase 8, ATPase 6, and COIII were analyzed as a single fragment. This partition is statistically more plausible because it reduces stochastic error in the estimation of the model parameters if the 168 bp for ATPase 8 and 173 bp for COIII were to be considered separate partitions. Model selection for each partition was made according to the Akaike information criteria (AIC) as implemented in MrModelTest 2.0 (Nylander, 2004). AIC was chosen over hierarchical likelihood ratio tests because AIC is not influenced by the order in which models are compared, and the AIC allows quantification of model uncertainty through Akaike weights. The best fitting model for each partition was used in subsequent Bayesian phylogenetic analyses, with parameters estimated during the analyses. We also performed model selection for all genes combined to carry out a nonpartitioned maximum likelihood analysis. Bias in base composition can negatively influence phylogenetic inference (Haddrath and Baker, 2001) and, hence, we tested it by a chi-square test as implemented in TREEPUZZLE 5.0 (Strimmer and von Haeseler, 1996).

Phylogenetic Bayesian Inference
We applied a Metroplis-coupled Markov chain Monte Carlo (MCMC) approach as implemented in MrBayes 3.1.2 (Ronquist and Huelsenbeck, 2003) to infer Bayesian phylogenetic relationships under two simultaneous independent runs, each starting with a different random tree. We set each run to have one cold and five heated chains to allow better mixing of the MCMC chain and minimize the chance of being trapped in local optima. The addition of more heated chains allows for quicker convergence of large and complex data sets such as ours. We considered runs to have reached convergence when the average standard deviation of the split frequencies between both simultaneous runs was smaller than 0.01. We performed runs assuming the same topology to be shared among all partitions but model parameters (substitution matrix, state frequency, proportion of invariable sites, and shape parameter of the gamma distribution) were unlinked and each partition allowed to have its own model of DNA evolution as chosen by AIC. We also assumed a priori among-partition rate variation. All trees were considered equally likely. The tree was rooted with sequences from the Ostrich (Struthio camelus). Other priors applied for all partitions were unconstrained: exponential (10.0) for branch lengths, flat Dirichlet (1,1,1,1) for stationary base frequencies, flat Dirichlet (1,1,1,1,1,1) for the nucleotide substitution ratio, uniform distribution (0,200) for the shape parameter of the gamma distribution of rate variation, and uniform distribution (0,1) for the proportion of invariable sites. MCMC samples were taken in every 1000th cycle. We plotted the log likelihood of sampled topologies to determine the burn-in period in which the MCMC chain had reached a stationary status. Post–burn-in samples from both simultaneous and independent runs were used to construct a 50% majority-rule consensus tree. The proportion of trees in which nodes were recovered after the burnin period is interpreted as the posterior probability (PP) of that node, or the probability that that node is true (Ronquist and Huelsenbeck, 2003). Nodes receiving PP ≥ 0.95 were considered to be strongly supported.

Maximum Likelihood Analysis
Maximum likelihood (ML) inference of phylogeny was executed in GARLI v 0.942 (Zwickl, 2006), assuming the GTR substitution model with a proportion of invariable sites and four-category gamma-distributed rate variation estimated from the data set for 100 replicates. GARLI uses a genetic algorithm to search for the topology, branch lengths, and the substitution model parameters (rate of DNA substitution, base frequency, proportion of invariable sites, and shape parameter of the gamma distribution) that maximize the likelihood of generating the observed set of DNA sequences. Briefly, the genetic algorithm will create a sample of individual solutions at a given time, each solution with its own topology, branch lengths, and model parameters representing the phylogenetic relationships of the given DNA sequences; next, each individual solution is ascribed a fitness likelihood score, which determines the proportion of "offspring" solutions that each individual solution will contribute to the next generation; the topology, branch lengths, or one of the substitution model parameters of each offspring solution is then changed to a new state and the others are reestimated to maximize the likelihood; these steps are repeated for a number of generations until an arbitrary stop condition is reached. For the Columbiformes data set, the stop condition was set to automatic termination, which is reached when three conditions are met: (1) changes in topology do not significantly increase the offspring fitness; (2) the total increase in the likelihood score over a number of generations is less than a specified amount; and (3) the parameter controlling the degree of branch length optimization has reached its minimum value. Two independent GARLI runs, each starting with a different random tree topology, were performed. Judging from the similar results compared across runs, GARLI is unlikely to have been trapped in local optima.

Maximum Parsimony Analysis
Maximum parsimony (MP) inference of phylogeny was conducted in PAUP 4 beta 10 (Swofford, 2001). Gaps were treated as missing characters and multistate characters treated as uncertain. A heuristic tree search was performed by stepwise addition and sequences were added randomly in 100 replicates. The "MulTrees" option was in effect. Branch support was estimated in PAUP 4 by nonparametric bootstrapping with the same parameters as the heuristic search. In this case, for each bootstrap replicate 100 random sequence additions was in effect.

Tree Selection
To compare different tree topologies obtained by different methods of phylogenetic inference, we employed the approximately unbiased (AU) test (Shimodaira, 2002) as implemented in the program CONSEL version 0.1f (Shimodaira and Hasegawa, 2001). The AU test uses a multiscale bootstrap technique and site log-likelihoods when computing P-values for the topologies being tested. The multiscale bootstrap technique removes the bias associated with more conservative tests that will result in more trees in the confidence set when the number of trees being compared increases. The distribution of posterior probabilities for each alternative topology is obtained by counting the number of times the hypothesis is supported by the replicates.

Exclusion of Taxa with Missing Data
To evaluate whether the inclusion of six genera (Alectroenas, Drepanoptila, Didunculus, Ectopistes, Pezophaps, and Raphus) with about 84% of missing data would affect tree topology we repeated all the phylogenetic analysis as described above excluding these six taxa from the data set. We also performed a Bayesian analysis including all taxa but reducing the data set to 1590 bp for which we had 12S rDNA and cyt b sequences for all taxa. The outgroup Podargus was also excluded from this last analysis because we could not obtain 12S rDNA sequences for this bird.

Molecular Dating
For each data partition used in the Bayesian inference of phylogeny we obtained maximum likelihood estimates of the transition/transversion ratio, and nucleotide frequencies in PAML 3.14 (Yang, 1997) under the F84 model of DNA substitution assuming rate variation across sites to follow a gamma distribution with five discrete rate categories (Hasegawa et al., 1985). These parameters were used to estimate branch lengths for each data partition and their approximate variance-covariance matrix to derived estimates of divergence times and 95% credibility intervals (95% CrI) based on all data partitions in a Bayesian framework (Thorne and Kishino, 2002; Thorne et al., 1998). These methods are implemented in the software ESTBRANCHES and MULTIDIVTIME from the MULTIDISTRIBUTE package, freely available from J. Thorne's website: http://statgen.ncsu.edu/thorne/multidivtime.html. The method requires an outgroup to root the tree and imposes the condition that the rate of change in the rate of DNA substitution at the root node is the same at the beginning and at the end of that branch (Kishino et al., 2001; Thorne et al., 1998).

Bayesian dating was run assuming a burn-in period = 3000, sample frequency = 100, number of samples = 10,000. We set the following gamma priors: expected time between tip and root (rttm) = 122.2 Mya based on the split between Galloanserae versus Neoaves estimated from mitogenomic data and 13 fossil constraints equally spread across the tree (Pereira and Baker, 2006a) with standard deviation (SD) = 20 Mya, rate of the root node (rtrate) and its SD = 0.00306 substitutions per site per million years as estimated from the median of the tip-to-root branch lengths for all genes. However, it seems of little practical importance to specify these priors because they do not appear to have any appreciable effect on the Bayesian posterior distribution of node ages and rates of evolution, and because sequence data and time constraints should determine the overall rate and the age of the root (Yang and Yoder, 2003; Wiegmann et al., 2003). We also set the prior for the rate change between ancestral (brownmean) and descendant nodes = 0.00818 (SD = 0.00818) substitutions per site per million of years, so that rttm x brownmean = 1. This later prior follows the suggestion that this is a meaningful value for real and simulated data sets (Wiegmann et al., 2003). Because a priori information for rate change is unknown, a large SD value was chosen as suggested (Thorne and Kishino, 2002), which allows a gene to have a priori a large variation in rate change over time. We assessed the convergence of the MCMC algorithm by running multiple analyses (each one starting with a different randomly selected initial state) and comparing the posterior distribution of divergence times, branch lengths, and the proportion of successful changes of those parameters along the Markov chain.

Temporal Constraints in Molecular Dating
The fossil record of the Columbiformes is scanty and made up mostly of specimens from the Pliocene and Pleistocene, which have not been placed cladistically within the order. The oldest fossil attributed to the Columbiformes is Gerandia calcaria (Olson, 1985) from the Early Miocene (Aquitanian) of France between 20.4 and 23.0 Mya (Gradstein et al., 2004). Hence, we assumed a conservative minimum age of 20.4 Mya for Columbiformes. The fossil record suggests that Apodiformes and Caprimulgiformes were present in the Early Eocene (Dyke and Van Tuinen, 2004). We assumed a conservative minimum age of 48.6 Mya for the separation between these two orders. In previous work (Pereira and Baker, 2006a), the divergence time between Galloanserae (Galliformes and Anseriformes) and Neoaves (all other Neognathae) was estimated at 122.2 Mya (95% CrI 110, 135 Mya) and between Galliformes and Anseriformes at 101.7 Mya (95% CrI 92, 113 Mya), based on a Bayesian analysis of complete mitochondrial genomes and assuming several independent time constraints from the fossil record. Hence, we set minimum and maximum ages for the separation of Galloanserae and Neoaves and for Galliformes and Anseriformes based on the 95% CrI. Similarly, we assumed a minimum and maximum age for the split between Crax and Gallus at 71 and 114 Mya based on the Bayesian 95% CrI of age inferences for major Galliformes lineages using mitochondrial genes and several independent fossils as time constraints (Pereira and Baker, 2006b).

Reconstruction of Ancestral Area States
To infer ancestral areas where Columbiformes may have originated we applied Bayesian and maximum likelihood approaches. A Bayesian stochastic mapping approach was derived for discrete morphological characters, which accounts for uncertainties in model parameters and assumes that character change follows a continuous-time Markov process (Huelsenbeck et al., 2003; Lewis, 2001). The Bayesian method is implemented in SIMMAP 1.0 Beta 2.0.8 Build 10042006 (Bollback, 2006). Biogeographic regions of distribution of Columbiformes were treated as discrete characters, in an analogous way to the treatment of morphological characters, and specific character histories that are consistent with the observations are sampled according to their probability under the model (Huelsenbeck et al., 2003). We accounted for phylogenetic and branch length uncertainties by using the trees and their respective branch lengths included in the post-burn-in portion of the Bayesian posterior distribution of trees obtained in MrBayes 3.1.2. The maximum likelihood approach for ancestral reconstruction maximizes the probability that the observed discrete states evolved under a stochastic Markov model of evolution (Lewis, 2001). The method is implemented in Mesquite 1.1 (build h61) (Maddison and Maddison, 2005a, 2005b). For each internal node, the state that maximizes the probability of observing the states at the terminal nodes is found, while allowing states at all other nodes to vary independently. We reconstructed ancestral area states based on the consensus Bayesian tree topology assuming branch lengths to be scaled to divergence times as estimated in MULTIDIVTIME.

Both the Bayesian and the maximum likelihood approaches overcome the problems associated with reconstruction of ancestral states by a maximum parsimony approach. The maximum parsimony method does not allow multiple changes at a site to occur along a branch, underestimates the variance in ancestral states assignments by considering only those reconstructions that minimize the number of changes, imposes the same probability of change in both long and short nodes, and does not accommodate phylogenetic and parameter uncertainties.

We categorized the current distribution of genera (Goodwin, 1983) as discrete character states based on biogeographic regions (Sclater, 1858; Wallace, 1876) as: (0) Neotropical, (1) Afrotropical, (2) Oriental, and (3) Australasian. Because both Bayesian and maximum likelihood methods require unique character states to be assigned to taxa, and as Caloenas, Chalcophaps, Columba, Ducula, Gallicolumba, Geopelia, Macropygia, Otidiphaps, Ptilinopus, Streptopelia, and Treron occur in more than one of the defined biogeographic areas, we also defined the following composite area states: (4) Australasian plus Oriental, (5) Afrotropical plus Oriental, and (6) Afrotropical plus Oriental plus Australasian.


    Results
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Acknowledgments
 References
 
Sequence Variability
The combined data set had a total of 2973 parsimony-informative characters, and uncorrected sequence divergence between taxa varied from 4% to 15% (Table 2). IRBP and ND2 have the least and highest number of parsimony-informative characters, respectively (Table 2). IRBP has the narrowest and ND2 the widest range of uncorrected distances among Columbiformes (Table 2). A Bonferroni-corrected chi-square test indicated that no taxon has a significant deviation in base composition. FIB7 for Claravis pretiosa was only 395 bp in length, considerably shorter than the average 1034 bp across Columbiformes (range 880 to 1071 bp).


View this table:
[in this window]
[in a new window]

 
Table 2 Sequence variability and model parameters for each data partition. Gene abbreviations as in Table 3.

 
Model Selection
Table 2 also shows the results of model selection for each data partition. For most partitions, the uncertainty in model selection was very low (Akaike weight for best fitting model > 0.9). For FIB7, the second best model added a proportion of invariable sites to the GTR+g model and had an Akaike weight of 0.245. Together, these two models made up 91.2% of the cumulative Akaike weight for FIB7. For IRBP, the second best model dropped the proportion of invariable sites from the best model, GTR+i+g, and had an Akaike weight of 0.217. The first and second best model made up 78.0% of the cumulative Akaike weight.

Tree Inference
The 50% majority-rule consensus Bayesian (BA) topology was obtained from 24,000 post–burn-in trees sampled from two simultaneous and independent runs (Fig. 1). The Columbiformes was recovered as a monophyletic group, with posterior probability of 1.0. Three highly supported major clades were identified within the Columbiformes and will be referred to as A, B, and C (Fig. 1). Clade A is subdivided into two clades with highly supported relationships; one of the subclades contains genera exclusively distributed in the Americas and the other shows a basal polytomy including Old and New World pigeons and doves, including the extinct passenger pigeon Ectopistesmigratorius. Clade B is well resolved and groups the small New World ground doves together. Clade C includes mostly genera found in Africa, Asia, Australia, the East Indies, and New Zealand. Several internal nodes have low posterior probabilities within Clade C. The posterior distribution of trees contained 8175 alternative topologies that differ mostly in the position of some ingroup taxa with low posterior probability, especially within Clade C. Among these alternative topologies, the three major clades are related to each other in three possible alternative ways: (A(B, C), (A, C)B), and (A, B)C), with cumulative posterior probabilities of 0.83, 0.09, and 0.08, respectively.


Figure 1
View larger version (101K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 Bayesian consensus tree. Major clades within Columbiformes are indicated as A, B, and C. Numbers at nodes are posterior probabilities, which are indicated with an * when = 1.0. Scale bar represents expected number of substitutions per site.

 
The best ML tree obtained in GARLI version 0.942 had a very similar topology to the BA tree, and strongly supported the monophyly of clades A, B, and C (i.e., bootstrap proportion [BP] > 80%) as measured by 100 bootstrap replicates (Fig. 2). The placement of B and C as sister clades received 52% bootstrap support. Relationships within clades A and B are very similar in the ML and the BA topologies, except that Zenaida and Uropelia were placed more basally in clades A and B in the ML tree, respectively, but without strong support. Additionally, many relationships within clade C were more strongly supported by bootstrapping (i.e., BP > 70%) and were similar to those recovered in the BA analysis, except for the placement of Alectroenas, Drepanoptila, Goura, Phapitreron, Treron, and Trugon. These taxa received less than 50% bootstrap support in the ML topology, but high posterior probabilities in the BA topology. The ML analysis failed to recover Caprimulgiformes as a monophyletic assemblage in the best sampled ML tree, but Caprimulgus and Podargus were sister genera in 60% of the bootstrapped trees.


Figure 2
View larger version (65K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2 Maximum likelihood tree. Major clades within Columbiformes are indicated as A, B, and C. Numbers at nodes are bootstrap proportions and are indicated with an * when = 100 and not indicated when < 50%. Scale bar represents expected number of substitutions per site.

 
One most parsimonious (MP) tree of 19,082 steps was recovered (Fig. 3). Monophyly of clades A, B, and C was well supported by MP bootstrap proportions. Most phylogenetic relationships within clade C were not supported by bootstrapping. The most parsimonious tree also differed from the consensus BA and ML topologies by positioning clade B as a sister clade to A and C. However, the AU test rejected the MP topology (P = 0.019) in favor of the consensus BA (P = 0.896) and ML topologies (P = 0.161) as good representations of the phylogenetic relationships among Columbiformes.


Figure 3
View larger version (107K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 3 Maximum parsimony tree. Major clades within Columbiformes are indicated as A, B, and C. Numbers at nodes are bootstrap proportions and are indicated with an * when = 100 and not indicated when < 50%. Scale bar represents expected number of substitutions per site.

 
Exclusion of Taxa with Missing Data
Phylogenetic analyses of the 9.0-kb data set, which excludes six Columbiformes with significant amount of missing data (~ 84%), did not show any appreciable topological changes in any method of tree inference (Fig. 4a, Fig. 4b, Fig. 4c). Furthermore, a Bayesian analysis of 1590 bp of aligned sequences from 12S rDNA and cyt b for all taxa (except Podargus for which we could not obtain 12S rDNA sequences) resulted in a less resolved tree topology (Fig. 4d), but the placement of the six genera with significant missing data (Fig. 4d) was similar to that in the topologies derived from the 9.0 kb data set (Fig. 1 to Fig. 3).


Figure 4
View larger version (369K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 4 Phylogenetic reconstruction obtained when six taxa with significant amount of missing data are excluded from the data set (a, b, and c) and when only 12S rDNA and cyt b sequences were used, including GenBank sequences for genera not sampled in this study (d). Reconstructions were based on Bayesian analysis (a and d), maximum likelihood (b), and maximum parsimony (c). Major clades within Columbiformes are indicated as A, B, and C. Numbers at nodes are Bayesian posterior probabilities (a, d) or bootstrap proportions (b, c), indicated with an * when 1.0 or 100, respectively. Scale bar represents expected number of substitutions per site. (Continued)

 
Molecular Dating
We approximated the prior and posterior distribution of divergence times based on a parametric Bayesian approach (Kishino et al., 2001; Thorne and Kishino, 2002). The size of the 95% CrI of the prior distribution for node ages is considerably larger than the size of the 95% CrI of the posterior distribution (Table 3). These differences in the size of credible intervals between the prior and posterior distribution indicate that the prior specification has little influence on the posterior distribution and that most of the information about divergence time is retrieved from the sequence data (Thorne and Kishino, 2002). The prior distribution ignores the information contained in the sequence data; hence, it is expected that there will be a larger amount of uncertainty in prior divergence time estimates. Based on the 95% CrI of the posterior distribution, Columbiformes diverged from outgroups such as Caprimulgiformes and Apodiformes in the Early to Late Cretaceous between 87 to 110 Mya, and Caprimulgiformes and Apodiformes last shared a common ancestor between 83 to 107 Mya. Radiation of modern Columbiform genera occurred from the Early Eocene to Middle Miocene.


View this table:
[in this window]
[in a new window]

 
Table 3 Prior and posterior distribution of divergence times using a Bayesian multigene approach.

 
Reconstruction of Ancestral Area States
Bayesian reconstruction of ancestral areas implemented in SIMMAP and accounting for phylogenetic uncertainty indicated that the most recent common ancestor of all extant Columbiformes lived in Gondwanaland, most likely in what is now the Neotropical region (posterior probability = 0.977). The Australasian and the composite Australasian plus Oriental areas were the only other regions to receive non-zero but low posterior probabilities (< 0.01 and 0.02, respectively). The Markov-ML reconstruction of ancestral areas implemented in Mesquite and accounting for divergence times provided similar conclusions to those from the Bayesian approach. The proportions of the maximum likelihood attributed to each biogeographic region were Neotropical region = 0.85, Australasian = 0.14, remaining areas and composite areas = 0.002 each. The results of reconstruction of ancestral areas favor an ancestral area of modern Columbiformes in Gondwanaland (Fig. 5), when Africa and India had already drifted apart from the Antarctica–Australia–South America landmass. By counting the number of significant changes in the estimates of ancestral areas (Fig. 5), it seems that members of clade A dispersed at least twice to the Oriental biogeographic zone and at least once to the Afrotropical zone, and members of clade C dispersed at least eight times to the Oriental and four times to the Afrotropical zones. No intercontinental dispersals of members of Clade B seem to have occurred according to the reconstruction of ancestral areas (Fig. 5).


Figure 5
View larger version (52K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 5 Chronogram of Columbiformes diversification and reconstruction of ancestral areas. Major clades within Columbiformes are indicated as A, B, and C. Branch lengths are given in Mya following the posterior distribution of divergence times shown in Table 3. Present biogeographic distribution is indicated as colored circles at the tips, and the proportion of the total likelihood received by each biogeographic region as the ancestral area of a given clade is represented by pie charts at internal nodes. Node numbers as in Table 3.

 

    Discussion
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Acknowledgments
 References
 
High-Level Columbiformes Systematics
We gathered the largest mitochondrial and nuclear DNA data set so far to infer the phylogenetic relationships among Columbiformes. Phylogenetic analyses of these sequences recovered competing phylogenetic trees that confirm the monophyly of Columbiformes (Fig. 1, Fig. 2, Fig. 3) and refute the monophyly of the family Columbidae with the placement of Raphidae within it, and also contest subfamily classifications proposed for Columbiformes (reviewed in Sibley and Ahlquist, 1990). Despite some topological differences, all the alternative methods of phylogenetic reconstruction identified three major clades within the Columbiformes (Fig. 1, Fig. 2, Fig. 3). Clade A comprises Geotrygon, Leptotila, and Zenaida of the New World as a sister group to a clade containing typical New World pigeons, including the extinct passenger pigeon Ectopistes migratorius, and Old World cuckoo and turtle doves plus typical pigeons. Clade B includes the small New World ground doves. Clade C contains genera found in Africa, Asia, Australia, the East Indies, and New Zealand, including mountain pigeons, fruit doves, emerald doves, bronzewings, crowned pigeons, Old World ground doves, and many monotypic forms such as the Nicobar, the thick-billed ground, pheasant, wonga, topknot, and New Zealand pigeons, and also the extinct dodo and solitaires usually placed in their own subfamily, the Raphidae. The internal phylogenetic relationships within clades A and B were well supported despite the method of phylogenetic inference used. Conversely, basal phylogenetic relationships within clade C were difficult to resolve and were not well supported by Bayesian posterior probabilities or bootstrap proportions in maximum parsimony and maximum likelihood, even with the combined analysis of about 9.0 kb of mitochondrial and nuclear sequences and appropriate modeling of the DNA substitution process. Clade C may be approaching a hard polytomy, in which case additional DNA sequences are unlikely to improve resolution because many of its members are endemic forms that may have colonized several islands of the Oriental–Australasian biogeographic regions almost simultaneously.

Maximum parsimony analysis indicated clade B to be sister to A and C, as also found previously with maximum parsimony and maximum likelihood analyses of combined cyt b and FIB7 sequences (Johnson and Clayton, 2000). Conversely, Bayesian and maximum likelihood analysis inferred a sister relationship between clades B and C, and in turn as a sister group to A. By applying a topological test, the former topology was strongly rejected in favor of the consensus Bayesian and maximum likelihood topology. Given the results of the topological test, the Bayesian posterior distribution of the alternative topological placements of clades A, B, and C, and the use of different models of DNA substitution for different gene partitions in the Bayesian analysis to obtain a better fit to the evolutionary process in which these sequences have evolved, we conclude that the Bayesian consensus topology is the best phylogenetic hypothesis among Columbiformes genera and will be the only one considered herein for further discussion.

Our Bayesian consensus phylogenetic hypothesis differs from hypotheses in two other studies that employed limited taxon sampling at the genus level (Johnson and Clayton, 2000; Shapiro et al., 2002). Using cyt b and FIB7 sequences, Johnson and Clayton (2000) recovered congruent maximum parsimony and maximum likelihood trees where clade B (as defined here) was a sister group to clades A and C, and clade C was not monophyletic. The only relationship of genera within a clade that differed in our phylogeny was the placement of Treron as sister to Goura. However, Johnson and Clayton (2000) did not sample Turtur and Chalcophaps, which, together with Oena, were more closely related to Treron than to Goura in our study. Additionally, Johnson and Clayton (2000) recovered a phylogenetic tree that had many nodes with low bootstrap support, especially within clade C and for the sister group relationships among the three major clades.

Although our phylogenetic hypothesis was not based on extensive sampling at the species level, it agrees with previous suggestions based on molecular (Johnson, 2004; Johnson and Clayton, 2000; Johnson et al., 2001) and nonmolecular data (Goodwin, 1983; Peters, 1937) that the New World Patagioenas (previously considered part of Columba) and the Old World Columba are not sister taxa. Columba was recovered in our analyses as sister to Streptopelia in agreement with other molecular phylogenetic analyses (Johnson, 2004; Johnson et al., 2001), but the closest sister genus to Patagioenas is still uncertain (this study; Johnson, 2004; Johnson et al., 2001) because it is part of a polytomous subclade within clade A (Fig. 1 to Fig. 3).

Shapiro et al. (2002) recovered a maximum likelihood tree based on 12s rDNA and cyt b sequences, with a well-supported sister relationship between clades B and C. Clade A was not recovered as a monophyletic assemblage in their analysis and was not well supported either by maximum likelihood bootstrapping or Bayesian posterior probability. Additionally, the differences observed between our phylogenetic hypothesis and that of Shapiro et al. (2002) related to those nodes that received low support in their tree, suggesting that the increased character and taxon sampling in our study largely improved tree resolution as expected (e.g., Pereira et al., 2002; Poe and Swofford, 1999). Furthermore, Shapiro et al. (2002) included three endemic (Alectroenas and the monotypic Drepanoptila and Didunculus) and three extinct genera (Ectopistes, Pezophaps, and Raphus) that we were not able to obtain samples of for further DNA sequencing. However, we included the 12S rDNA and cyt b sequences from Shapiro et al. (2002) in our Bayesian analysis and recovered these genera in similar positions. We also ran a second Bayesian analysis reducing our data set to match in length the available sequences for these five genera. The tree recovered (Fig. 4d) placed Pezophaps and Raphus (posterior probability [PP] = 0.99) as a sister clade to Caloenas (PP = 1.0), as in Shapiro et al. (2002), but our tree recovered Alectroenas as a sister (PP = 1.0) to Drepanoptila and Ptilinopus (PP = 0.90), instead of placing Alectroenas and Drepanoptila as sister genera. These results have to be taken cautiously because we only sampled one species per genus and Shapiro et al. (2002) included three species of Ptilinopus, which were not monophyletic and did not receive strong support from either Bayesian posterior probabilities or maximum likelihood nonparametric bootstrapping in their analysis. Moreover, the Bayesian method of tree inference does not seem to be affected adversely by missing data in the complete data set, as predicted by simulation studies (Wiens, 2005), because most of the relationships within clades A, B, and C are recovered with moderate to high nodal support despite the inclusion of those taxa or not.

None of the several published classifications of Columbiformes based on nonmolecular data (revised in Sibley and Ahlquist, 1990) are congruent with the phylogenetic relationships recovered in our analyses. However, Goodwin (1983) presented a noncladistic hypothesis of relationships among genera based on behavior and plumage patterns, where clades A and C were not depicted as monophyletic assemblages, but some of his presumed genera associations were similar to the relationships recovered in the present study. Behavioral characters seem to be more phylogenetically conserved than morphological and osteological characters (Baker et al., 2006; Pereira and Baker, 2005). Hence, we might predict that that the rare, endemic genera Cryptophaps and Starnoenas and the extinct Microgoura, for which we were not able to obtain samples, are closely related to genera suggested by Goodwin (1983). Future sequencing and phylogenetic analysis will be necessary to formally place them in a phylogenetic framework and evaluate these predictions. At this point, we refrain from proposing a new classification at the family level and below because we feel that it is essential to integrate our data with revised nonmolecular data to forward a new classification for the order Columbiformes.

Divergence Times and Historical Biogeography of Columbiformes
We applied a Bayesian approach to estimate divergence times among Columbiformes genera that allowed data to be partitioned by gene and accounted for uncertainty in branch lengths and time constraints (Kishino et al., 2001; Thorne and Kishino, 2002; Thorne et al., 1998). Based on the 95% credible interval (CrI) of the Bayesian posterior distribution of age estimates, our molecular time estimates add the Columbiformes, Apodiformes, and Caprimulgiformes to the group of avian orders that originated in the Cretaceous and survived the Cretaceous-Tertiary mass extinction (Baker et al., 2006; Cooper and Penny, 1997; Nahum et al., 2003; Pereira and Baker, 2006a). Avian fossil evidence supports these molecular time estimates by clearly indicating that Palaeognathae and Galloanserae (Anseriformes and Galliformes) were already independent lineages in the Cretaceous (e.g., Clarke et al., 2005; Livezey, 1997). Some lineages within Neoaves, which is the sister group to Galloanserae, may also have been present in the Cretaceous, although the fossil record is too scanty and fragmentary to be unambiguously attributed to specific modern Neoaves (Cracraft, 2001; Dyke, 2001; Padian and Chiappe, 1998). Furthermore, the trans-Antarctic distributional pattern of Columbiformes and several other groups of birds (e.g., Neotropical and Australasian parrots, cracids and megapodes, and Apodiformes and Caprimulgiformes) suggest a Cretaceous age for these clades that became isolated in South America and Australia with the breakup of Gondwanaland and glaciation of Antarctica (Cracraft, 2001).

Considering the estimated Cretaceous age for modern Columbiformes lineages and their trans-Antarctic distribution, we tested whether the ancestors of the group may have lived in Gondwana by applying a maximum likelihood and a Bayesian framework to map the present-day distribution of genera onto the well-supported consensus Bayesian tree with time constraints and reconstructing ancestral areas (Fig. 5). Our results strongly support a Gondwanaland origin for Columbiformes (as suggested by Johnson and Clayton, 2000; Shapiro et al., 2002), implying diversification by vicariance due to continental drift and dispersal to the Afrotropical and Oriental biogeographic areas by members of clades A and C. Based on 95% CrI of the Bayesian posterior distribution of divergence times, the generic radiation within clade A, whose most recent common ancestor likely lived in the Neotropics (node 3 in Fig. 5), occurred between 39 and 55 Mya when the lineages leading to Neotropical genera became independent from those lineages that would lead to non-Neotropical genera. The ancestors of clade C were more likely to have been present in the Australasian region (node 14 in Fig. 5) before the group started to radiate around 41 to 57 Mya. Mapping ancestral areas onto the time-constrained Bayesian consensus tree (Fig. 5) suggests that the Oriental biogeographic zone seemed to have been colonized at least once by members of clade A and at least eight times by members of clade C, and the Afrotropical biogeographic zone at least once by members of clade A, and at least four times by members of clade C. It is very likely that the colonization route of both biogeographic zones would have been via Australasia judging from the reconstructed ancestral areas for the last common ancestor of Afrotropical and Oriental taxa (Fig. 5). The hypothesized dispersal events are summarized in Figure 6.


Figure 6
View larger version (65K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 6 Summary of inferred dispersal events of Columbiformes mapped on present-day world. Zoogeographic zones are indicated. The question mark above Gondwanaland indicates the uncertainty about which region of the supercontinent may have harbored the ancestors of Columbiformes. Solid and dashed arrows represent dispersal events by members of clades A and C, respectively. Minimum numbers of independent dispersal events are indicated next to arrows when greater than one.

 
Considering the timing of diversification of Oriental and Afrotropical taxa (50 Mya and younger) and the geological history of the Oriental and Australasian biogeographic zones helps to understand why phylogenetic analyses of over 9.0 kb of mitochondrial and nuclear DNA sequences could not provide stronger support for some of the phylogenetic relationships within clade C as it did for clades A and B. In the Early Eocene, Africa and India were separate landmasses drifting away from the Antarctica–Australia–South America supercontinent (Li and Powell, 2001). This implies that dispersal to Africa and India from Antarctican–Australia–South America would be unlikely because they were separated by large oceans at that time. Conversely, Australia separated from Antarctica around 40 Mya and started to drift north towards southeast Asia (Li and Powell, 2001), facilitating the exchange of biota among the Oriental and Australasian biogeographic zones (Hall, 1998, 2001; Li and Powell, 2001; Moss and Wilson, 1998; Pubellier et al., 2003). The exchange process may have been also favored by a considerable decrease in sea level (Haq et al., 1987) that likely resulted in greater land exposure between southeastern Asia and northern Australia, including several intermittent emergent Pacific Islands (Hall, 1998).

Once the Oriental biogeographic zone has been colonized, further dispersal into the Afrotropical and the Palearctic zones would have been a natural outcome because the physical barriers that delimit these zones (e.g., Sahara desert, Himalayas mountains) did not form until the Neogene (Kroepelin, 2006; Li and Powell, 2001). Additionally, the age of Afrotropical Columbiform taxa falls within the 95% CrI of 41 to 28 Mya for members of clade A (node 6 in Fig. 5) and 53 to 15 Mya for members of clade C (nodes 27 and 29 in Fig. 5). Dispersal into the Afrotropical and Palearctic zones may have been facilitated by the close proximity of the African and Eurasian plates (Bumby and Guiraud, 2005; Hall, 1998) and the beginning of the collision of the Indian subcontinent with Eurasia (Li and Powell, 2001; Zhu et al., 2005). This last event may have caused many islands to emerge between India and Africa during times of sea level fall, facilitating the dispersal of organisms between these landmasses. For example, a "stepping stone" route has been suggested previously for the extinct dodo and solitaires (Raphus and Pezophaps) of the Mauritius and Rodrigues Islands in the Mascarene Plateau (Shapiro et al., 2002). Our estimates for the split of dodos and solitaries from their sister genus Caloenas (posterior mean estimate = 33 Mya; 95% CrI 25 to 42 Mya) was younger than those of Shapiro et al. (2002; 42 Mya; 95% CI 31 to 56 Mya). On the other hand, our age estimates for the split of Raphus and Pezophaps (posterior mean estimate = 23 Mya; 95% CrI 15–31 Mya) is similar to that of Shapiro et al. (2002; 25 Mya; 95% confidence interval [CI] 17 to 36 Mya). Our analyses of nuclear and mitochondrial DNA sequences confirm suggestions that the ancestors of dodo and solitaires may have lived in other now-submerged islands in the Indian Ocean (Shapiro et al., 2002), tens of million years before Mauritius and Rodrigues emerged in the Mascarene Plateau (McDougall and Chamalaun, 1969).

In our reconstruction of ancestral areas, we did not consider Antarctica as a character state because no Columbiformes now occur in this continent. However, we cannot rule out the hypothesis that Antarctica and not the Neotropics was in fact the place where the ancestors of Columbiformes first evolved (Fig. 6). If the ancestors of Columbiformes were in Antarctica, the development of a permanent ice cap in Antarctica in the Oligocene and Late Miocene-to-present (Zachos et al., 2001) may have forced them to disperse northwards to more temperate and tropical climates (e.g., Baker et al., 2006; Tavares et al., 2006). Indeed, our estimates of divergence times indicate that the non-Neotropical genera of clade A last shared a common ancestor with Neotropical taxa about 28 to 41 Mya (node 6 in Fig. 5), coinciding with the beginning of glaciation in Antarctica (Zachos et al., 2001). However, most of the colonization events of the Oriental regions via Australia seemed to have occurred in the Eocene, prior to the Antarctic glaciation periods.

Although the breakup of Gondwanaland itself may not have been the ultimate cause of avian diversification, it created the necessary isolation for the ancestors of modern lineages to start their differentiation because of other reasons (Cracraft, 2001). As in many other groups of birds such as ratites, tinamous, galliform birds, penguins, shorebirds, parrots, passerine birds, toucans, and trogons (e.g., Baker et al., 2004, 2006; Crowe et al., 2006; Moyle, 2005; Nahum et al., 2003; Paton et al., 2003; Pereira and Baker, 2006a, 2006b; Pereira et al., 2002; Tavares et al., 2006), most cladogenetic events that led to extant genera of Columbiformes occurred throughout the Eocene and Oligocene epochs, and only a few events took place in the Miocene. These cladogenetic events were likely influenced by significant changes in Earth physiography that occurred during these epochs, like the formation of major mountain ranges, reshaping of river basins, sea level fluctuations, and climatic changes (Hall, 1998; Haq et al., 1987; Li and Powell, 2001; Lundberg et al., 1998; Zhu et al., 2005). Although it is assumed that these events caused major global changes in floral and faunal composition (MacFadden, 2000; Nores, 2004), it is a hard task to link which paleoevents are associated with particular generic divergences within Columbiformes at this moment. Future phylogenetic analysis and molecular dating at the species level might help refine the historical biogeography of the Columbiformes and evaluate the influence of specific continental paleoevents on their radiation (e.g., Baker et al., 2006; Pereira and Baker, 2004).

Our estimates of divergence time revealed a temporal gap of about 41 to 46 Mya between the origin of Columbiformes in the Late Cretaceous and diversification of genera within the order in the Eocene. Similar gaps of about 30 to 45 Mya over the same geological time span has also been observed for diversification of genera among Neotropical cracids (Pereira et al., 2002), toucans (Nahum et al., 2003), one of two major lineages of Neotropical parrots (Tavares et al., 2006), and penguins (Baker et al., 2006). Although diversification of extant genera in these clades of birds might suggest a major culling following the K/P bolide impact, other explanations are possible. Unfortunately, the fossil record for many avian groups is scanty or fragmentary, and thus inadequate to provide clues about the flowering of genera of most of the above groups of birds during the Late Cretaceous to Eocene. However, the Eocene fossil record of penguins indicates that stem penguins (reviewed in Ksepka et al., 2006) became extinct as extant penguins started to diversify (Baker et al. 2006). Baker et al. (2006) and Tavares et al. (2006) suggested that penguins and parrots dispersed from Antarctica to regions of warmer climates as Antarctica became ice-encrusted, and global climate started to cool down in the Eocene (e.g., Shevenell et al., 2004). Other birds do not show such a temporal gap, such as shorebirds, which are distributed worldwide, and a lineage of Neotropical parrots assumed to have been already in South America in the Early Eocene (Baker et al., 2007; Tavares et al., 2006). Hence, the lack of cladogenesis observed in certain avian groups prior to the Eocene may be more apparent than real, and the temporal gap may in fact represent a reshaping of the biodiversity of these groups during the early days of the Eocene. The diverse temporal pattern of avian diversification unveiled so far suggests that avian groups responded differently to the physiographic and climate changes of the Late Cretaceous and Cenozoic, and faunal turnover was greater in some groups than in others.


    Acknowledgments
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Acknowledgments
 References
 
We are grateful for Robert Blaquière for creating a database to filter and report the posterior probabilities of specific tree topologies among the thousands of alternatives in the Bayesian posterior distribution of trees. The following institutions provided tissues specimens: Australian National Wildlife Collection, Field Museum of Natural History, University of Kansas Museum of Natural History, Louisiana State University Museum of Natural Science, University of Washington Burke Museum, and U.S. National Museum of Natural History. KPJ was supported by National Science Foundation grants DEB-0107891 and DEB-0612938. DHC was supported by National Science Foundation grants DEB-0118794 and DEB-0614565. AJB was supported by an operating grant from the Natural Sciences and Engineering Research Council of Canada, the Royal Ontario Museum Foundation, and the National Science Foundation (Assembling the Tree of Life (AToL) Program—EF-0228693).


    References
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Acknowledgments
 References
 

    Baker A. J., Pereira S. L., Haddrath O. P., Haddrath E. K.-A. Multiple gene evidence for expansion of extant penguins out of Antarctica due to global cooling. Proc. R. Soc. Lond. B (2006) 273:11–17.[Abstract/Free Full Text]

    Baker A. J., Pereira S. L., Paton T. A. Phylogenetic relationships and divergence times of Charadriiformes genera: Multigene evidence for the Cretaceous origin of at least 14 clades of shorebirds. Biol. Lett. (2007) 3:205–209.[Abstract/Free Full Text]

    Banks R. C., Cicero C., Dunn J. L., Kratter A. W., Rasmussen P. C., Remsen J. V. Jr., Rising J. D., Stotz D. F. Forty-fourth supplement to the American Ornithologists' Union check-list of North American birds. Auk (2003) 120:923–931.[CrossRef][Web of Science]

    Barker F. K., Cibois A., Schikler P., Feinstein J., Cracraft J. Phylogeny and diversification of the largest avian radiation. Proc. Natl. Acad. Sci. USA (2004) 101:11040–11045.[Abstract/Free Full Text]

    Bollback J. P. SIMMAP: Stochastic character mapping of discrete traits on phylogenies. BMC Bioinformatics (2006) 7:88.[CrossRef][Medline]

    Brodkorb P. Catalogue of fossil birds, part 2 (Anseriformes through Galliformes). Bull Florida State Mus. Biol. Sci (1964) 8:195–335.

    Bumby A. J., Guiraud R. The geodynamic setting of the Phanerozoic basins of Africa. J. Afr. Earth Sci. (2005) 43:1–12.[CrossRef]

    Chiappe L. M., Dyke G. J. The Mesozoic radiation of birds. Annu. Rev. Ecol. Syst. (2002) 33:91–124.[CrossRef][Web of Science]

    Clarke J. A., Tambussi C. P., Noriega J. I., Erickson G. M., Ketcham R. A. Definitive fossil evidence for the extant avian radiation in the Cretaceous. Nature (2005) 433:305–308.[CrossRef]

    Cooper A., Penny D. Mass survival of birds across the Cretaceous-Tertiary boundary: Molecular evidence. Science (1997) 275:1109–1113.[Abstract/Free Full Text]

    Cracraft J. Avian evolution, Gondwana biogeography and the Cretaceous-Tertiary mass extinction event. Proc. R. Soc. Lond. B (2001) 268:459–469.[Abstract/Free Full Text]

    Crowe T. M., Bowie R. C., Bloomer P., Mandiwana T. G., Hedderson T. A. J., Randi E., Pereira S. L., Wakeling J. Phylogenetics, biogeography and classification of, and character evolution in, gamebirds (Aves: Galliformes): Effects of character exclusion, data partitioning and missing data. Cladistics (2006) 22:495–532.[CrossRef][Web of Science]

    Darwin C. On the origin of species by means of natural selection or the preservation of favoured races on the struggle for life (1859) London: John Murray.

    Dyke G. J. The evolutionary radiation of modern birds: Systematics and patterns of diversification. Geol. J. (2001) 36:305–315.[CrossRef]

    Dyke G. J., Van Tuinen M. The evolutionary radiation of modern birds (Neornithes): Reconciling molecules, morphology and the fossil record. Zool. J. Linn. Soc. (2004) 141:153–177.[CrossRef]

    Ericson P. G., Anderson C. L., Britton T., Elzanowski A., Johansson U. S., Kallersjo M., Ohlson J. I., Parsons T. J., Zuccon D., Mayr G. Diversification of Neoaves: Integration of molecular sequence data and fossils. Biol. Lett. (2006) 2:543–547.[Abstract/Free Full Text]

    Gibbs D., Barnes E., Cox J. Pigeons and Doves. A guide to the pigeons and doves of the world (2001) East Sussex, UK: Pica Press.

    Goodwin D. Pigeons and doves of the world (1983) 3rd. edition. Ithaca, NY: Comstock Publishing Associates.

    Gradstein F. M., Ogg J. G., Smith A. G., Agterberg F. P., Bleeker W., Cooper R. A., Davydov V., Gibbard P., Hinnov L. A., House M. R., Lourens L., Luterbacher H. P., McArthur J., Melchin M. J., Robb L. J., Shergold J., Villeneuve M., Wardlaw B. R., Ali J., Brinkhuis H., Hilgen F. J., Hooker J., Howarth R. J., Knoll A. H., Laskar J., Monechi S., Plumb K. A., Powell J., Raffi I., Röhl U., Sadler P., Sanfilippo A., Schmitz B., Shackleton N. J., Shields G. A., Strauss H., Van Dam J., van Kolfschoten T., Veizer J., Wilson D. A geologic time scale 2004 (2004) Cambridge, UK: Cambridge University Press.

    Groth J. G., Barrowclough G. F. Basal divergences in birds and the phylogenetic utility of the nuclear RAG-1 gene. Mol. Phylogenet. Evol. (1999) 12:115–123.[CrossRef][Web of Science][Medline]

    Guindon S., Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. (2003) 52:696–704.[Abstract/Free Full Text]

    Haddrath O., Baker A. J. Complete mitochondrial DNA genome sequences of extinct birds: Ratite phylogenetics and the vicariance biogeography hypothesis. Proc. R. Soc. Lond. B (2001) 268:939–945.[Abstract/Free Full Text]

    Hall R. The plate tectonics of Cenozoic SE Asia and the distribution of land and sea. In: Biogeography and geological evolution of SE Asia—Hall R., Holloway J. D., eds. (1998) Leiden, The Netherlands: Backhuys Publishers. 99–131.

    Hall R. Cenozoic reconstructions of SE Asia and the SW Pacific: Changing patterns of land and sea. In: Faunal and floral migrations and evolution in SE Asia-Australasia—Metcalfe I., Smith J. M. B., Morwood M., Davidson I. D., eds. (2001) Lisse: Swets & Zeitlinger Publishers. 35–56.

    Haq B. U., Hardenbol J., Vail P. R. Chronology of fluctuating sea levels since the Triassic. Science (1987) 235:1156–1167.[Abstract/Free Full Text]

    Hasegawa M., Kishino H., Yano T. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. (1985) 22:160–174.[CrossRef][Web of Science][Medline]

    Hedges S. B., Parker P. H., Sibley C. G., Kumar S. Continental breakup and the ordinal diversification of birds and mammals. Nature (1996) 381:226–9.[CrossRef]

    Huelsenbeck J. P., Nielsen R., Bollback J. P. Stochastic mapping of morphological characters. Syst. Biol. (2003) 52:131–158.[Abstract/Free Full Text]

    Johnson K. P. Deletion bias in avian introns over evolutionary timescales. Mol. Biol. Evol. (2004) 21:599–602.[Abstract/Free Full Text]

    Johnson K. P., Clayton D. H. Nuclear and mitochondrial genes contain similar phylogenetic signal for pigeons and doves (Aves: Columbiformes). Mol. Phylogenet. Evol. (2000) 14:141–51.[CrossRef][Web of Science][Medline]

    Johnson K. P., de Kort S., Dinwoodey K., Mateman A. C., ten Cate C., Lessells C. M., Clayton D. H. A molecular phylogeny of the dove genera Streptopelia Columba. Auk (2001) 118:874–887.[CrossRef][Web of Science]

    Kishino H., Thorne J. L., Bruno W. J. Performance of a divergence time estimation method under a probabilistic model of rate evolution. Mol. Biol. Evol. (2001) 18:352–361.[Abstract/Free Full Text]

    Kroepelin S. Revisiting the age of the Sahara Desert (letters). Science (2006) 312:1138.[Web of Science][Medline]

    Ksepka D. T., Bertelli S., Giannini N. P. The phylogeny of the living and fossil Sphenisciformes (penguins). Cladistics (2006) 22:412–441.[CrossRef][Web of Science]

    Lewis P. O. A likelihood approach to estimating phylogeny from discrete morphological character data. Syst. Biol. (2001) 50:913–925.[Abstract/Free Full Text]

    Li Z. X., Powell C. M. An outline of the palaeogeographic evolution of the Australasian region since the beginning of the Neoproterozoic. Earth-Sci. Rev. (2001) 53:237–277.[CrossRef]

    Livezey B. C. A phylogenetic analysis of basal Anseriformes, the fossil Presbyornis, and the interordinal relationships of waterfowl. Zool. J. Linn. Soc. (1997) 121:361–428.[CrossRef]

    Livezey B. C., Zusi R. L. Higher-order phylogeny of modern birds (Theropoda, Aves: Neornithes) based on comparative anatomy. II. Analysis and discussion. Zool. J. Linn. Soc. (2007) 149:1–95.[Medline]

    Lundberg J. G., Marshall L. G., Guerrero J., Horton B., Malabarba M. C. S. L., Wesselingh F. The stage for Neotropical fish diversification: A history of tropical South American rivers. In: Phylogeny and classification of neotropical fishes—Malabarba L. R., Reis R. E., Vari R. P., Lucena Z. M. S., Lucena C. A. S., eds. (1998) Porto Alegre, Brazil: Editora Universitaria PUCRS. 13–48.

    MacFadden B. J. Cenozoic mammalian herbivores from the Americas: Reconstructing ancient diets and terrestrial communities. Annu. Rev. Ecol. Syst. (2000) 31:33–59.[CrossRef][Web of Science]

    Maddison D. R., Maddison W. P. MacClade 4.0. (2000) Sunderland, Massachusetts: Sinauer Associates.

    Maddison W. P., Maddison D. R. Mesquite: A modular system for evolutionary analysis, version 1.06 (2005a) http://mesquiteproject.org.

    Maddison W. P., Maddison D. R. Stochar: A package of Mesquite modules for stochastic models of character evolution, version 1.06 (2005b) http://mesquiteproject.org.

    McDougall I., Chamalaun F. H. Isotopic dating and geomagnetic polarity studies on volcanic rocks from Mauritius, Indian Ocean. Geol. Soc. Am. Bull. (1969) 80:419–442.

    Moss S. J., Wilson M. E. J. Biogeographic implications of the Tertiary palaeoeographic evolution of Sulawesi and Borneo. In: Biogeography and geological evolution of SE Asia—Hall R., Holloway J. D., eds. (1998) Leiden, The Netherlands: Backhuys Publishers. 133–163.

    Moyle R. Phylogeny and biogeographical history of Trogoniformes, a pantropical bird order. Biol. J. Linn. Soc. (2005) 84:725–738.[CrossRef][Web of Science]

    Nahum L. A., Pereira S. L., Fernandes F. M. D., Matioli S. R., Wajntal A. Diversification of Ramphastinae (Aves, Ramphastidae) prior to the Cretaceous/Tertiary boundary as shown by molecular clock of mtDNA sequences. Genet. Mol. Biol. (2003) 26:411–418.[Web of Science]

    Nores M. The implications of Tertiary and Quaternary sea level rise events for avian distribution patterns in the lowlands of northern South America. Global Ecol. Biogeogr. (2004) 13:149–161.[CrossRef]

    Nylander J. A. MrModelTest 2.0. Program distributed by the editor (2004) Evolutionary Biology Centre, Uppsala University.

    Olson S. L., ed. The fossil record of birds (1985) New York: Academic Press.

    Padian K., Chiappe L. M. The origin and early evolution of birds. Biol. Rev. (1998) 73:1–42.

    Paton T. A., Baker A. J., Groth J. G., Barrowclough G. F. RAG-1 sequences resolve phylogenetic relationships within Charadriiform birds. Mol. Phylogenet. Evol. (2003) 29:268–278.[CrossRef][Web of Science][Medline]

    Pereira S. L., Baker A. J. Vicariant speciation of curassows (Aves, Cracidae): A hypothesis based on mitochondrial DNA phylogeny. Auk (2004) 121:682–694.[CrossRef][Web of Science]

    Pereira S. L., Baker A. J. Multiple gene evidence for parallel evolution and retention of ancestral morphological states in the shanks (Charadriiformes: Scolopacidae). Condor (2005) 107:514–526.

    Pereira S. L., Baker A. J. A mitogenomics timescale for birds detects variable phylogenetic rates of molecular evolution and refutes the standard molecular clock. Mol. Biol. Evol. (2006a) 23:1731–1740.[Abstract/Free Full Text]

    Pereira S. L., Baker A. J. A molecular timescale for galliform birds accounting for uncertainty in time estimates and heterogeneity of rates of DNA substitutions across lineages and sites. Mol. Phylogenet. Evol. (2006b) 38:499–509.[CrossRef][Web of Science][Medline]

    Pereira S. L., Baker A. J., Wajntal A. Combined nuclear and mitochondrial DNA sequences resolve generic relationships within the Cracidae (Galliformes, Aves). Syst. Biol. (2002) 51:946–958.[Abstract/Free Full Text]

    Peters J. L. Check-list of birds of the world (1937) Cambridge, Massachusetts: Harvard University Press.

    Poe S., Swofford D. L. Taxon sampling revisited. Nature (1999) 398:299–300.

    Prychitko T. M., Moore W. S. The utility of DNA sequences of an intron from the beta-fibrinogen gene in phylogenetic analysis of woodpeckers (Aves: Picidae). Mol. Phylogenet. Evol. (1997) 8:193–204.[CrossRef][Web of Science][Medline]

    Pubellier M., Ali J., Monnier C. Cenozoic plate interaction of the Australia and Philippine sea plates: "Hit-and-run" tectonics. Tectonophysics (2003) 363:181–199.[CrossRef][Web of Science]

    Ronquist F., Huelsenbeck J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics (2003) 19:1572–4.[Abstract/Free Full Text]

    Sclater P. L. On the general geographic distribution of the members of the class Aves. J. Proc. Linn. Soc. Zool. (1858) 2:130–145.[CrossRef]

    Shapiro B., Sibthorpe D., Rambaut A., Austin J., Wragg G. M., Bininda-Emonds O. R., Lee P. L., Cooper A. Flight of the dodo. Science (2002) 295:1683.[Free Full Text]

    Shevenell A. E., Kennett J. P., Lea D. W. Middle Miocene Southern Ocean cooling and Antarctic cryosphere expansion. Science (2004) 305:1766–1770.[Abstract/Free Full Text]

    Shimodaira H. An approximately unbiased test of phylogenetic tree selection. Syst. Biol. (2002) 51:492–508.[Abstract/Free Full Text]

    Shimodaira H., Hasegawa M. CONSEL: For assessing the confidence of phylogenetic tree selection. Bioinformatics (2001) 17:1246–1247.[Abstract/Free Full Text]

    Sibley C. G., Ahlquist J. E. Phylogeny and classification of Birds: A study in molecular evolution (1990) New Haven, Connecticut: Yale University Press.

    Slack K. E., Jones C. M., Ando T., Harrison G. L., Fordyce R. E., Arnason U., Penny D. Early penguin fossils, plus mitochondrial genomes, calibrate avian evolution. Mol. Biol. Evol. (2006) 23:1144–1155.[Abstract/Free Full Text]

    Strimmer K., von Haeseler A. Quartet puzzling: A quartet maximum likelihood method for reconstructing tree topologies. Mol. Biol. Evol. (1996) 13:964–969.[Web of Science]

    Swofford D. L. PAUP*: Phylogenetic analysis using parsimony (*and other methods), version 4.0 (2001) Sunderland, Massachussets: Sinauer Associates.

    Tavares E. S., Baker A. J., Pereira S. L., Miyaki C. Y. Phylogenetic relationships and historical biogeography of neotropical parrots (Psittaciformes: Psittacidae: Arini) inferred from mitochondrial and nuclear DNA sequences. Syst. Biol. (2006) 55:454–70.[Abstract/Free Full Text]

    Thorne J. L., Kishino H. Divergence time and evolutionary rate estimation with multilocus data. Syst. Biol. (2002) 51:689–702.[Abstract/Free Full Text]

    Thorne J. L., Kishino H., Painter I. S. Estimating the rate of evolution of the rate of molecular evolution. Mol. Biol. Evol. (1998) 15:1647–57.[Abstract]

    van Tuinen M., Hedges S. B. Calibration of avian molecular clocks. Mol. Biol. Evol. (2001) 18:206–13.[Abstract/Free Full Text]

    Wallace A. R. The geographic distribution of animals (1876) London: Macmillan.

    Wiegmann B. M., Yeates D. K., Thorne J. L., Kishino H. Time flies, a new molecular time-scale for brachyceran fly evolution without a clock. Syst. Biol. (2003) 52:745–56.[Abstract/Free Full Text]

    Wiens J. J. Can incomplete taxa rescue phylogenetic analyses from long-branch attraction? Syst. Biol. (2005) 54:731–742.[Abstract/Free Full Text]

    Yang Z. PAML: A program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. (1997) 13:555–556.[Free Full Text]

    Yang Z., Yoder A. D. Comparison of likelihood and Bayesian methods for estimating divergence times using multiple gene loci and calibration points, with application to a radiation of cute-looking mouse lemur species. Syst. Biol. (2003) 52:705–716.[Abstract/Free Full Text]

    Zachos J., Pagani M., Sloan L., Thomas E., Billups K. Trends, rhythms, and aberrations in global climate 65 Ma to present. Science (2001) 292:686–693.[Abstract/Free Full Text]

    Zhu B., Kidd W. S. F., Rowley D. B., Currie B. S., Shafique N. Age of initiation of the India-Asia collision in the East-Central Himalaya. J. Geol. (2005) 113:265–285.[CrossRef][Web of Science]

    Zwickl D. J. Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion (2006) The University of Texas at Austin. PhD Dissertation.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Virol.Home page
C. Firth, M. A. Charleston, S. Duffy, B. Shapiro, and E. C. Holmes
Insights into the Evolutionary History of an Emerging Livestock Pathogen: Porcine Circovirus 2
J. Virol., December 15, 2009; 83(24): 12813 - 12821.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Pereira, S. L.
Right arrow Articles by Baker, A. J.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Pereira, S. L.
Right arrow Articles by Baker, A. J.
GeoRef
Right arrow GeoRef Citation
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?