Skip Navigation

Systematic Biology 2006 55(1):122-137; doi:10.1080/10635150500481614
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (26)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Phillips, M. J.
Right arrow Articles by Penny, D.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Phillips, M. J.
Right arrow Articles by Penny, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2006 Society of Systematic Biologists

Combined Mitochondrial and Nuclear DNA Sequences Resolve the Interrelations of the Major Australasian Marsupial Radiations

Edited by Mike Lee: Associate Editor

Matthew J. Phillips1, Patricia A. McLenachan2, Christin Down2, Gillian C. Gibb2 and David Penny2

1 Henry Wellcome Ancient Biomolecules Center, Department of Zoology, Oxford University South Parks Road, Oxford, OX1 3PS, United Kingdom; E-mail: mattphillips73{at}hotmail.com
2 Allan Wilson Center for Molecular Ecology and Evolution, Massey University Palmerston North, New Zealand


    Abstract
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Conclusions and Further Work
 References
 
Australasian marsupials include three major radiations, the insectivorous/carnivorous Dasyuromorphia, the omnivorous bandicoots (Peramelemorphia), and the largely herbivorous diprotodontians. Morphologists have generally considered the bandicoots and diprotodontians to be closely related, most prominently because they are both syndactylous (with the 2nd and 3rd pedal digits being fused). Molecular studies have been unable to confirm or reject this Syndactyla hypothesis. Here we present new mitochondrial (mt) genomes from a spiny bandicoot (Echymipera rufescens) and two dasyurids, a fat-tailed dunnart (Sminthopsis crassicaudata) and a northern quoll (Dasyurus hallucatus). By comparing trees derived from pairwise base-frequency differences between taxa with standard (absolute, uncorrected) distance trees, we infer that composition bias among mt protein-coding and RNA sequences is sufficient to mislead tree reconstruction. This can explain incongruence between trees obtained from mt and nuclear data sets. However, after excluding major sources of compositional heterogeneity, both the "reduced-bias" mt and nuclear data sets clearly favor a bandicoot plus dasyuromorphian association, as well as a grouping of kangaroos and possums (Phalangeriformes) among diprotodontians. Notably, alternatives to these groupings could only be confidently rejected by combining the mt and nuclear data. Elsewhere on the tree, Dromiciops appears to be sister to the monophyletic Australasian marsupials, whereas the placement of the marsupial mole (Notoryctes) remains problematic. More generally, we contend that it is desirable to combine mt genome and nuclear sequences for inferring vertebrate phylogeny, but as separately modeled process partitions. This strategy depends on detecting and excluding (or accounting for) major sources of nonhistorical signal, such as from compositional nonstationarity.

Keywords: Base composition; combined data; marsupial; mitochondrial genome; phylogeny

Received November 5, 2006; Revised January 31, 2005; Accepted August 25, 2005


Recent Australasian marsupials have evolved into a multitude of niches and provide an extraordinary parallel to the evolution of placental mammals. Aplin and Archer (1987) recognize four orders. Dasyuromorphia (approximately 65 species) includes the insectivorous/carnivorous marsupial mice/cats/wolves/ anteater. Peramelemorphia (approximately 20 species) includes the semifossorial and omnivorous bandicoots and bilbies. The most numerous group, the largely herbivorous Diprotodontia (approximately 115 species) includes the possums, kangaroos, koala, and wombats, whereas Notoryctemorphia (marsupial moles) includes just two species in the genus Notoryctes. These groups, together with the monotypic Microbiothera (Dromiciops gliroides) from South America, make up the cohort Australidelphia, which was first recognized on the basis of tarsal morphology (Szalay, 1982).

The monophyletic status of Australidelphia, to the exclusion of the South American Didelphimorphia (opossums) and Paucituberculata (including caenolestids: shrew opossums), is the only interordinal grouping of marsupials with strong support from DNA sequence data (see Phillips et al., 2001; Amrine-Madsen et al., 2003; Nilsson et al., 2003). Our present focus is on determining the interrelations of the three major australidelphian radiations. The possibilities include:

Bandicoots plus diprotodontians (BDi). This grouping is consistent with Syndactyla (Jones, 1923–1925), originally diagnosed by the possession of syndactylous hind feet, in which the 2nd and 3rd digits are reduced, equal in length, and fused (or at least held together basally within a common integument) (see Szalay, 1982; Hall, 1987). Retinal morphology (Chase and Graydon, 1990), upper molars with well-defined dilambdodonty (Marshall et al., 1990), and intracranial extension of the squamosal (see Szalay, 1993) also support Syndactyla. Cladistic analyses of morphological data matrices unanimously favor this relationship (Springer et al., 1997b; Lou et al., 2003; Horovitz and Sánchez-Villagra, 2003). Support for Syndactyla among molecular data, on the other hand, has only been claimed for analyses of the nuclear genes, phosphoglycerate kinase (Pgk-1; Colgan, 1999) and recombination activating gene 1 (RAG1, Baker et al., 2004). However, resolution was poor in both studies and in the former case the bandicoots grouped with the dasyuromorphian, Myrmecobius, in some analyses.

Dasyuromorphians plus diprotodontians (DaDi). This grouping is consistent with Eometatheria, in which bandicoots are excluded from the remainder of Australidelphia. Using DNA hybridization data, Kirsch et al. (1997) argued forcefully for this grouping. Although morphology mostly offers contradiction, Asher et al. (2004) consider the continuous facets of the lower ankle joint of eometatherians to be more derived than either the separate facets of didelphids/caenolestids or the intermediate condition in bandicoots. The strongest support for Eometatheria has come from mitochondrial (mt) protein-coding and RNA genes (see Krajewski et al., 1997; Burk et al., 1999). Signal for Eometatheria from concatenated mt genes was strong enough in the combined analyses of Springer et al. (1998) and Asher et al. (2004) to overwhelm largely contradictory signals (see BDi above and BDa below) from the interphotoreceptor retinoid binding protein (IRBP) gene in the former study and from three nuclear genes (IRBP, protamine P1, and Pgk-1) plus morphological data in the latter study.

Bandicoots plus dasyuromorphians (BDa). A "bandicoot-dasyuromorphian" relationship has long been entertained by mammalogists (e.g., Thomas, 1888; Hughes, 1965). Potentially diagnostic characters for this grouping (see Horovitz and Sánchez-Villagra, 2003), such as the possession of three pairs of lower incisors, a broadened ectotympanic, and a teat count (5–8) intermediate between that of didelphids and diprotodontids, may be plesiomorphic for Australasian marsupials. Accordingly, morphological assessments within cladistic (e.g., Luckett, 1994), and "evolutionary inference" (e.g., Szalay and Sargis, 2001) frameworks have dismissed a bandicoot-dasyuromorphian relationship in favor of Syndactyla. However, with a few exceptions (Retief et al., 1995; Kirsch et al., 1997, Baker et al., 2004), studies of nuclear DNA, or nuclear-encoded proteins, have provided cautious support for a bandicoot-dasyuromorphian grouping. Examples come from serology (Kirsch, 1977), albumin microcomplement fixation (e.g., Baverstock et al., 1987), some DNA hybridization studies (e.g., Westerman, 1991), and IRBP sequences (Springer et al., 1997a; Jansa and Voss, 2000). The most convincing support for grouping dasyuromorphians and bandicoots to the exclusion of diprotodontian comes from the Bayesian inference analysis of Amrine-Madsen et al. (2003) on a concatenation of five nuclear genes:IRBP, von Willebrand factor (vWF), apolipoprotein B (APOB), RAG1, and breast and ovarian cancer susceptibility gene 1 (BRCA1). However, Bayesian posterior probabilities do not adequately account for stochastic error (see Suzuki et al., 2002; Douady et al., 2003; Simmons et al., 2004). Hence, it is notable that neither Eometatheria nor Syndactyla could be rejected even at P ≤ 0.50 in their maximum likelihood (ML) analysis.

A recent study by Nilsson et al. (2004) in which a number of new marsupial mt genomes are presented deserves attention. Their ML results favored a bandicoot-dasyuromorphian relationship, though the maximum parsimony (MP) and distance analyses instead favored a close relationship between bandicoots and diprotodontians. The statistical significance of the ML results are difficult to gauge because full bootstrapping was not used to explore sampling effects and significance testing compared alternative trees of interest, rather than the alternative trees with the highest likelihoods.

It is possible that relationships among the major groups have been blurred by additional stochastic error and conflicting signals associated with the inclusion of Dromiciops and Notoryctes as isolated taxa for which long branches cannot be broken. In order to explore this, we run analyses with and without their inclusion. By implication, the primary hypotheses being tested, bandicoots plus diprotodontians (BDi), dasyuromorphians plus diprotodontians (DaDi), and dasyuromorphians plus bandicoots (BDa) define relationships irrespective of the placements of Dromiciops and Notoryctes. Hence, BDi and DaDi are respectively consistent with, but not equivalent to, Syndactyla (which typically includes Notoryctes, e.g., Szalay, 1993) and Eometatheria (which includes both Dromiciops and Notoryctes).

With the intention of resolving the phylogenetic stalemate on the interrelations of the three major Australasian marsupial groups, we have sequenced three new marsupial mt genomes. These are from a spiny bandicoot (Echymipera rufescens) and two dasyuromorphians, a fat-tailed dunnart (Sminthopsis crassicaudata) and a northern quoll (Dasyurus hallucatus), and are analyzed alongside previously sequenced marsupial mt genomes and a concatenated nuclear protein-coding data set (IRBP, vWF, APOB, RAG1, and BRCA1).

Inferring Marsupial Phylogeny from Mitochondrial Genome Sequences
Phylogenetic analyses of concatenated nuclear gene sequences from placental mammals (e.g., Madsen et al., 2001; Murphy et al., 2001) have, by comparison with mitochondrial efforts (e.g., Lin et al., 2002b; Arnason et al., 2002), cast doubt on the feasibility of reconstructing mammalian interordinal phylogeny from mt genomes. Critical examinations (e.g., Curole and Kocher, 1999; Springer et al., 2001; Corneli, 2002; Schmitz et al., 2002b; Phillips and Penny, 2003) have pointed to saturation-related lack of resolution and nucleotide composition bias as the most serious problems facing mitogenomic phylogenetics. However, an earlier study of marsupial mt genomes (Phillips et al., 2001) showed that phylogenetic signal erosion and composition bias are particularly pronounced at protein 3rd codon positions. This appears to be true of metazoan mt genomes in general and exclusion of 3rd codon positions has increased phylogenetic resolution in numerous studies (e.g., Springer et al., 1999; Lin et al., 2002a; Schwartz et al., 2004). Another approach is to recode purines (A, G) as R and pyrimidines (C, T) as Y. The benefits of RY-coding are twofold. Firstly, it excludes the major sources of mammalian mt genome compositional differences, which relate to biases among purine and particularly among pyrimidine transitions (Gibson et al., 2005). Secondly, RY-coding reduces phylogenetic signal erosion at deep levels by hiding transitions, which saturate rapidly relative to tranversions in mt data particularly (see Phillips and Penny, 2003). In light of these observations, we examine whether excluding or recoding putatively biased data is warranted, given the information loss this incurs, and whether the mt and nuclear data should be analyzed together or separately.

A major aim is to assess the potential for compositional nonstationarity to result in an incorrect topology being inferred. Stochastic tests such as the compositional {chi}2 test are commonly used for this purpose (e.g., Amrine and Springer, 1999; Negrisolo et al., 2004). As explained in Phillips et al. (2004), these tests only indicate the presence of compositional heterogeneity beyond that expected under a specified level of sampling error and are not truly comparable between data sets because their statistical power depends on factors such as the number of variable sites. Others (e.g., Ho and Jermiin, 2004) have articulated the need instead for simple methods for inferring whether any bias is in fact sufficient to mislead tree reconstruction. Herein we build upon previous work by Lockhart et al. (1994) and Phillips and Penny (2003) to use minimum evolution (ME) trees derived from pairwise base-frequency differences among taxa to assess the topological nature and magnitude of compositional bias. Comparing the resulting ME scores with those derived from standard (absolute, uncorrected) distances allows us to infer the potential for signal related to composition bias to overcome the "historical signal" at a particular node and so result in incorrect tree reconstruction.

We argue that composition bias is sufficient to mislead tree reconstruction for both the mt protein-coding and RNA data sets. However, these data sets pass the base-frequency versus standard (BF/S) distance test after a combination of excluding and recoding (Y and RY-coding) the partitions that are the major sources of compositional heterogeneity. We subsequently argue that combining such "reduced-bias" mitochondrial sequences and nuclear sequences is desirable for inferring marsupial phylogeny. In addition, we briefly discuss the implications of finding a bandicoot-dasyuromorphian relationship on our understanding of the initial diversification of Australasian marsupials.


    Material and Methods
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Conclusions and Further Work
 References
 
Mitochondrial Genome Sequencing
Total genomic DNA was extracted from liver tissue for all three marsupials using the High Pure PCR Template Preparation Kit from Roche Diagnostics. Mitochondrial DNA was amplified in fragments longer than 5 kb, using the Expand Long Template PCR Kit (Roche Diagnostics). Long fragments were nebulized and subcloned using the TOPO Shotgun Subcloning Kit from Invitrogen. Clones with inserts greater than 800 bp were sequenced using universal forward and reverse primers and standard protocols. The reactions were run on an ABI 3730 capillary machine at the AWC Genome Sequencing Centre, Massey University, Palmerston North. We used Sequencher (Gene Codes Corporation) for assembly and editing and for cloned sequences we used consensus sequence from at least three clones. Gaps were covered by direct sequencing of short PCR products generated from the original long templates, using primers from our vertebrate database.

The complete mitochondrial genomes of the new marsupials are available on GenBank: northern quoll (Dasyurus hallucatus, AY95973), fat-tailed dunnart (Sminthopsiscrassicaudata, AY95974), and spiny bandicoot (Echymipera rufescens, AY95975). All the new genomes exhibit the gene order and other features typical for marsupials (see Janke et al., 1997; Nilsson et al., 2003); a nonfunctional tRNALys, the use of the anticodon GCC for tRNAAsp, and an unusually long sequence of 82 to 87 bp between tRNATrp and tRNAAsn, which contains the sequence for the origin of replication for the light (L) strand. Two of the genomes contain repeat sequences in the control/noncoding region: the northern quoll sequence has around 170 bp of dimeric repeats (gt interspersed with at in an apparently random fashion); the dunnart has 29 copies of a 12bp repeat (tgcgtgtgtgta).

Data Matrices
We analyzed the mitochondrial rRNA, tRNA, and 12 H-strand encoded protein-coding genes from 14 marsupials. Only taxa for which both mt and nuclear sequences are available were used, to ensure both the combinability of these mt and nuclear sequences for the BF/S distance comparisons and the comparability of these data sets for signal retention and base composition heterogeneity metrics. Mitochondrial phylogenetic studies typically exclude NADH6 (e.g., Janke et al., 1996; Reyes et al., 2004) because of its L-strand encoding, which for substitution models disrupts the relationship between strand-specific mutational pressures and the selection pressures on the protein sequence, relative to the H-strand proteins.

Alongside the three new mitochondrial sequences, we have included previously published marsupial mt sequences from a wallaroo (Macropus robustus), brushtail possum (Trichosurus vulpecula), common wombat (Vombatus ursinus), northern brown bandicoot (Isoodon macrourus), phascogale (Phascogale tapoatafa), marsupial mole (Notoryctes typhlops), monito del monte (Dromicops gliroides), Chilean shrew opossum (Rhyncholestes raphanurus), silky shrew opossum (Caenolestes fuliginosus), Virginia Opossum (Didelphis virginiana), and gray short-tailed opossum (Monodelphis domestica). Employing the latter four "Ameridelphian" taxa as the outgroup better serves the aim of inferring relations among the australidelphians than does inclusion of additional placental and monotreme mammals (which diverged from the ingroup twice as long ago; see Nilsson et al., 2003). Inclusion of nonmarsupials substantially decreased the number of alignable sites. Furthermore, including a marsupials-only outgroup appears to be in the interests of modeling substitution among the ingroup. ML estimates of model parameters are similar when obtained for the the ingroup alone or for the ingroup plus marsupial outgroups, but differ considerably when placentals or monotremes are included (particularly for the rate matrix and rate heterogeneity across sites).

As for the original presentation of the nuclear genes (Amrine-Madsen et al., 2003) and in order to be fully compatible with the mt data, some combined sequences are chimeric. For example, the kangaroo subfamily Macropodinae is represented by both Macropus(mtDNA, vWF, BRCA1) and Dendrolagus (APOB, RAG1, IRBP). GenBank accession details for the mt genomes and nuclear sequences are provided in Table 1.


View this table:
[in this window]
[in a new window]

 
Table 1 GenBank accession numbers. Five combined sequences (Didelphinae, Sminthopsinae, Peramelinae, Phalangeridae, and Macropodinae) are chimeric, using sequences from related genera.

 
Sequences were aligned manually within Se-Al v1.0a1 (Rambaut, 1996). With ambiguous sites excluded, the 12 mt protein-coding genes contribute 10,686 sites, the mt RNA genes contribute 3313 sites, and the five nuclear protein-coding genes contribute 5634 sites, for a combined total of 19,633 sites. The data matrix is available online at http://www.systematicbiology.org. We designate the DNA sequences on the basis of source: Nuc, Mt-ptn, Mt-RNA, and simply Mt for the combined mitochondrial data.

The analyses were performed without gapped-sites, leaving 17,936 sites for a 12-taxon data set and 17,864 sites for a 14-taxon data set, which respectively exclude and include the two phylogenetically isolated taxa (Dromiciops and Notoryctes). So, for example, the 12-taxon mitochondrial RNA data set is designated Mt-RNA(12T), whereas the 14-taxon combined mitochondrial and nuclear data set for all positions is MtNuc(14T). We focus on the nucleotide sequence primarily because so much of the signal (particularly in the nuclear data) occurs as synonymous changes. Furthermore, available models for amino acid evolution (for use in ML significance testing) are based on empirical studies (e.g., Adachi and Hasegawa, 1996) primarily of nonmarsupials, whereas analysis of nucleotides allows for more realistic models of sequence evolution to be inferred from the marsupial data.

Examination of Phylogenetic Signal
The extent of substitution saturation is usually identified with saturation plots, in which ML (corrected) branch-lengths are plotted against uncorrected branch-lengths (or distances). However, metrics for comparing saturation plots are not well developed and their interpretation is questionable (Yang, 1998; Baker et al., 2001). In order to overcome some of the shortcomings of saturation plots and further explore phylogenetic signal retention/erosion, we use a stemminess metric. In addition, we use relative composition variability (RCV; Phillips and Penny 2003) as a measure of the magnitude (as opposed simply to presence) of compositional nonstationarity and provide a simple test (using base-frequency difference trees) of the potential for this to mislead tree reconstruction. These metrics and tests are described below.

Stemminess. Fiala and Sokal (1985) defined stemminess as the proportion of overall tree-length that internal branches contribute. As for Phillips et al. (2001), we employ this definition on minimum evolution (ME) trees derived from uncorrected distances. Stemminess is useful for comparing phylogenetic signal erosion between partitions, because (given the same topology) greater erosion results in shorter internal branches relative to their attendant external branches (hence, lower stemminess).

Relative composition variability. (RCV; Phillips and Penny, 2003) RCV is the average variability in composition between taxa; for nucleotides this is:


Formula

Ai, Ti, Ci, and Gi are the frequencies of each nucleotide for the ith taxon. A*, T*, C*, and G* are averages across the n taxa, and t is the number of sites.

Compositional nonstationarity was also examined with {chi}2 tests, performed within PAUP* 4.0b10 (Swofford, 2002). Parsimony-uninformative sites cannot directly contribute to phylogenetic biases and effectively dilute compositional heterogeneity and were excluded from both chi-square and RCV calculations.

Base-frequency (BF) distance trees. ME trees were constructed from a matrix of pairwise base-frequency distances to examine composition bias with respect to specific nodes and assess the potential for this to affect phylogenetic inference. The basic idea is to compare tree-length differences between alternative topologies for trees reconstructed from both standard (absolute, uncorrected) distances and base-frequency differences. In this way, Phillips and Penny (2003) showed that a bias among pyrimidines (C versus T) could account for incorrect rooting of the mammal tree from mt protein-coding genes. Here we extend this methodology to incorporate overall nucleotide composition bias. Base-frequency distances are half the sum of absolute frequency differences between taxon pairs for each nucleotide category. So the pairwise base-frequency (BF) distance between taxa i and j is:


Formula

Ai, Ti, Ci, and Gi, and Aj, Tj, Cjand Gj are the frequencies of each nucleotide for the ith and jth taxa, respectively. Dividing by two is necessary for ME trees on BF distances to be comparable with ME trees on standard distances. This is because a substitution at a site in taxon i that previously had the same base as for taxon j will result in one unit of standard distance, but two units of base-frequency distance. BF distance matrices were calculated within Microsoft Excel, using base frequency data from PAUP*. Parsimony-uninformative characters were excluded from base-frequency distance calculations, as these cannot explain ME differences between standard distance trees. For all of the ME trees, any negative branch-lengths were treated as absolute values for computing tree-length.

Groupings resolved (in agreement with prior molecular and morphological studies) with maximum 100% BPP, ML, and MP support for all data sets and not being tested for composition bias were fixed for the base-frequency versus standard (BF/S) distance comparisons. These constraints include Didelphidae, Caenolestidae, Peramelidae, Dasyuridae, and Dasyurus plus Phascogale and ensure that the branching patterns adjacent to nodes being tested for composition bias are the same as for our primary phylogenetic analyses. Unless otherwise stated, we focus on the results of our primary (12-taxon) data set, for which artifacts owing to the inclusion of the phylogenetically isolated taxa (Dromiciops and Notoryctes) can be discounted. However, in Figure 1 we see that the BF distance tree for MtNuc(14T) shows that Dromiciops and Notoryctes group closely with diprotodontians and dasyurids with respect to base composition, whereas the bandicoots are similar to the "ameridelphian" didelphids and caenolestids.


Figure 1
View larger version (49K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 MtNuc(14T) base frequency (BF) distance tree. Compositional bias favors Eometatheria (including a dasyuromorphian plus diprotodontian association), which is separated from bandicoots, caenolestids, and didelphids by the longest branch on the tree.

 
Using Base-Frequency Distance Trees to Infer the Potential Influence of Composition Bias
Composition {chi}2 testing (Table 2) reveals highly significant (P < 0.0001) base-frequency heterogeneity among each of the Mt-ptn, Mt-RNA, and Nuc data sets under standard (nt) coding. This only tells of the presence of compositional heterogeneity. BF/S distance comparisons allow us to infer the topological nature and the magnitude of composition bias affecting specific hypotheses—in the present case, BDa versus BDi versus DaDi.


View this table:
[in this window]
[in a new window]

 
Table 2 Statistics indicating phylogenetic signal retention (stemminess) and compositional heterogeneity (composition {chi}2 test P-values and relative composition variability [RCV]) for the 12-taxon data sets. These are shown in a for the original nuclear (Nuc) and mitochondrial (Mt-ptn, Mt-RNA) data sets, which have standard nucleotide coding. In b, sources of potentially misleading composition bias (indicated by BF/S distance comparisons, see Table 3) are excluded, such that RB-Mt-RNA is RY-coded and for RB-Mt-ptn, 3rd codon positions are excluded, and 1st codon positions are Y-coded.

 
In order for composition bias to result in an incorrect topology being reconstructed on standard distances (or other methods), we generally expect that the BF distance tree also favors that putatively incorrect topology (condition 1). In addition, support for that topology on BF distance would have to be of greater magnitude than support on standard distance (condition 2). This second condition arises because at least a small proportion of the BF signal will be attributable to sites that are uninformative with respect to the node of interest, even though sites that are uninformative for the tree as a whole are excluded. In Table 3, condition 1 is met where both the standard tree and BF tree favor the same topology and condition 2 is met where the BF/S ratio is > 1.00. The BF/S ratio is the ME score difference between the favored (f) and alternative (a) trees on BF distance, divided by the ME score difference between these trees on standard (absolute, uncorrected) distances:


Formula


View this table:
[in this window]
[in a new window]

 
Table 3 Potential influence of composition bias among the 12-taxon nuclear (a), mt protein (b–d), and mt RNA (e–f) data sets on tree selection between either bandicoot-diprotodontian (BDi), dasyuromorphian-diprotodontian (DaDi), or bandicoots-dasyuromorphian (BDa) associations. Which of these trees' minimum evolution (ME) favors is shown for both standard, uncorrected distances (Std. tree) and base-frequency distances (BF tree)? When both favor the same topology, if the ME difference between that topology (f) and an alternative (a) is greater on BF distances than on Std. distances (i.e., the BF/S ratio = ME(BF)f-a/ME(Std.)f-a. > 1.00), then compositional heterogeneity alone may be sufficient to mislead tree selection. As an example, for data set b (nt-coded Mt-ptn), comparisons of the BF tree (tree-f) with both alternatives (BDa and BDi) produce BF/S ratios > 1.00, so we recommend further bias reduction. Inclusion (coding) indicates the data included, for which numbers refer to codon positions, and whether the data is standard (nt), Y-coded (Y), or RY-coded (RY).

 
In the present context, the BF/S ratio is relevant only when condition 1 is met. In Table 3 we see that base-frequency trees favor DaDi for each of the nt-coded data sets. Standard distance supports a conflicting topology for the nuclear data set. Hence, we can be confident that the topology favored (BDa) on standard distance is not an artifact of the base-frequency bias (which favors DaDi). In contrast, we need to be aware that for both of the full nt-coded mitochondrial data sets, Mt-ptn (b) and Mt-RNA (e), the inferred interrelations of the major Australasian orders may be an artifact of composition bias. In each case DaDi is favored by both BF and standard distances (meeting condition 1) and the BF/S ratio is well above 1.00 in comparing the ME score for this topology with that of at least one of the alternatives (meeting condition 2).

Among marsupial (Phillips et al. 2001) and other mammalian (e.g., Gibson et al., 2005) mt genomes, base-frequency biases are most pronounced among pyrimidines, then among purines, and decrease from 3rd to 1st to 2nd positions for protein-coding sequences. Hence, as a strategy for reducing bias among Mt-ptn and Mt-RNA, we consider in successive steps, Y-coding (C, T lumped as Y) as advocated by Gibson et al. (2005), RY-coding (C, T lumped as Y and A, G lumped as R), and data exclusion. The procedure continues until either condition 1 or 2 is no longer met.

In Table 3c we see that even with 3rd codon positions excluded from Mt-ptn, the base-frequency bias may be strong enough to result in the incorrect tree being inferred on standard distances. Here, ME(BF)DaDi - BDa and ME(BF)DaDi - BDi are similar, 82.93 and 101.09, respectively. However, it is differences on standard distances that account for the disparity in BF/S ratios between DaDi and the two alternatives. ME(Std.) for DaDi is only lower than for BDa by 0.01988 (giving a BF/S ratio of 4171.53), whereas ME(Std.)DaDi - BDi is 31.97 (giving a BF/S ratio of 3.16).

In addition to excluding 3rd codon positions, to be confident that phylogenetic analysis of the Mt-ptn data do not simply reflect base-frequency biases, it was also necessary to Y-code the 1st codon positions (Table 3d). Furthermore, Table 3(f), (g) shows that RY-coding is warranted for Mt-RNA as purine transitions provide a strong enough composition bias signal to result in an incorrect topology being inferred. Although both BF and standard distances favor BDi when the RNA data are RY-coded, condition 2 is not met because the BF/S ratios are < 1.00. These reduced-bias data sets are henceforth referred to as RB-Mt-ptn and RB-Mt-RNA. The coding of the nine data sets used for phylogenetic analysis is described in Table 4.


View this table:
[in this window]
[in a new window]

 
Table 4 Partition inclusion and coding regimes for the data sets used in phylogenetic analyses. Bayesian inference and ML significance testing models are partitioned, ML bootstrap analyses are not. Standard (AGCT) coding is assumed unless otherwise stated. Y–coding lumps pyrimidines (C, T) and RY–coding lumps purines (A, G) and pyrimidines (C, T).

 
Phylogenetic Analysis
Following the examination of saturation and base composition, we inferred the marsupial phylogeny from the original and reduced-bias data sets. Maximum parsimony (MP), minimum evolution (ME), and maximum likelihood (ML) analyses were performed within PAUP*4.0b10. Bootstrapping (1000 replicates) applied heuristic searches from random starting trees. Substitution was modeled for the ML bootstrap analyses using GTR+I+{Gamma}4, the model category favored by AIC scores derived for each of the concatenated (nonpartitioned) data sets using ModelTest 3.06 (Posada and Crandall, 1998). ModelTest parameter estimates (which are based on an NJ tree) were used in heuristic searches to find an ML tree on which parameters were reestimated for ML bootstrapping. Bayesian inference (MrBayes 3.0b4; Huelsenbeck and Ronquist 2001) also employed GTR+I+{Gamma}4 models, except for the RY-coded partitions (F81+I+{Gamma}4), which only have one substitution type. Four chains were run for 3,000,000 (6,000,000 for 14-taxa) generations with trees being sampled every 200 generations. The burn-in for each MrBayes run was set at 500,000 (2,000,000 for 14-taxa) generations, after having checked for convergence and examined estimated sample sizes for the parameter estimates (using Tracer v1.0 Rambaut and Drummond, 2003).

Additional MP analyses were performed within PAUP* for examining congruence between the Mt and Nuc data sets by partition homogeneity testing (ILD test of Farris et al., 1994). Each partition homogeneity test employed 1000 heuristic replicates.

MP, ML, and Bayesian inference on each of the 12- and 14-taxon MtNuc and RB-MtNuc data sets resolve for all but between one and five nodes with 100% bootstrap (and Bayesian posterior probability, BPP) support. Support among alternative topologies was further examined with KH (Kishino and Hasegawa, 1989) and approximately unbiased (AU; Shimodaira, 2002) tests, using the RELL method (100,000 replications) within CONSEL (Shimodaira and Hasegawa, 2001). The AU test is related to the SH test (Shimodaira and Hasegawa, 1999) and has been developed in order to overcome tree selection biases that affect the latter test, particularly when many topologies are being simultaneously compared. The ML significance tests were applied to the separate and combined RB-Mt and Nuc data sets, with all parameters ML-optimized for each tree hypothesis.

For ML significance testing (as for Bayesian inference) we partitioned on the basis of source (mt, nuclear) as well as codon position (1, 2, 3) for proteins and secondary structure (stems, loops) for RNA. The number of (fully independent) partitions and their coding is shown in Table 4 for each data set. Because all of the sites are not evolving under the same processes, partitioning the data for ML analysis allows for more accurate modeling (e.g., Yang, 1996; Caterino et al., 2001). In these analyses, standard nt-coded partitions were modeled under GTR+I+{Gamma}4, whereas Y-coded and RY-coded partitions were modeled under 3 (A-G, A-Y, G-Y) and 1 (R-Y) substitution-type subsets of GTR+I+{Gamma}4, respectively.

The benefits of partitioning into more homogenously evolving subsets can be offset by estimating parameters from fewer sites, reducing the signal to variance ratio (DeBry, 1999; Pupko et al., 2002). For RB-MtNuc(14T), the additional 192 branch-length and substitution process parameters that partitioning incurs is easily justified under the Akaike Information Criterion (AIC; –2lnL + 2 x no. of free parameters) and improves –lnL by 2096.02. This translates into a likelihood "partitioning advantage" (the difference between partitioned and unpartitioned –lnL, divided by the unpartitioned –lnL) of 4.1%. This increases to 7.3% for MtNuc(14T).


    Results
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Conclusions and Further Work
 References
 
Preliminary Data Examination
The potential for base frequency heterogeneity to influence tree selection depends on both the magnitude of the composition bias and the degree to which substitution saturation has eroded "historical" phylogenetic signal. We have employed relative composition variability (RCV) as a measure of the magnitude of composition bias and stemminess as a measure of phylogenetic signal retention. These measures (see Table 2a) help to explain why Nuc appears (from the BF/S distance comparisons) to be less susceptible to composition artifacts than the nt-coded mt data sets. Nuc has a lower RCV and more than double the stemminess of either Mt-ptn or Mt-RNA.

In Table 2b we see that excluding 3rd codon positions, Y-coding 1st positions, and RY-coding RNA results in substantial increases in stemminess and decreases in RCV for both bias-reduced mt data sets. Indeed, for RCV/stemminess, RB-Mt-ptn (0.2467) and RB-Mt-RNA (0.2523) are almost as low as for Nuc (0.2276). This combined factor proxy for the potential influence of composition bias on tree selection is considerably higher for original Mt-ptn (1.448) and Mt-RNA (0.5403).

Incongruence testing provides further evidence for the bias reduction strategy enhancing the fidelity of the mt data. ILD testing with the 12-taxon data sets find the original Mt data to be incongruent with Nuc (P = 0.007). Given the dominance of the Mt data in terms of informative sites in this analysis, the significance levels may even be conservative (see Dowton and Austin, 2002). In contrast, the ILD test indicates that the RB-Mt and Nuc data are fully congruent (P = 1.000).

The situation is less clear for the 14-taxon data sets. The significance level at which congruence between RB-Mt and Nuc remains high (P = 0.609), but is similar to that for Mt and Nuc (P = 0.728). The latter result is curious because the inclusion of Dromiciops and Notoryctes in fact tends to increase support both for DaDi from Mt (not shown) and for BDa from Nuc (as will be seen later). It is likely that the 14-taxon partition homogeneity results are explained by the additional instability or "noise" these taxa lend the trees, which reduces the power of the ILD test to reject congruence.

Phylogenetic Reconstruction
The primary (12-taxon combined mt and nuclear) data sets for examining the interrelations of the Australasian marsupial orders resolve seven of the nine groupings with maximum BPP and bootstrap support (Fig. 2). Each of these, Didelphidae, Caenolestidae, Australidelphia, Diprotodontia, Peramelidae, and Phascogalinae plus Dasyurinae, is well corroborated in previous studies (e.g., Amrine-Madsen et al., 2003; Cardillo et al., 2004). The influence of the efforts to reduce composition bias is most prominent in resolving groupings that have been more elusive. ML and MP bootstrap support for a macropodid plus phalangerid grouping (Phalangerifomes) increases from 84% and 85%, respectively, for MtNuc to 97% and 100% for RB-MtNuc. The most profound impact of the bias reduction is a shift in support for DaDi (MP = 87%, BPP = 0.95; Fig. 2a) to support for BDa (MP = 99%, BPP = 1.00; Fig. 2b). Notably, ML bootstrap support favors BDa even for MtNuc at 60%, though this rises to 93% for RB-MtNuc.


Figure 2
View larger version (44K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2 Marsupial phylogenies inferred from the MtNuc(12T) original (a) and reduced-bias (b) data sets. Bootstrap supports are shown for ML (top) and MP (middle), with BPP values (bottom). Values are not shown for nodes that receive maximum support in all analyses.

 
The results for the individual data sets are teased apart in Table 5. DaDi is favored in analyses of both of the original Mt-ptn and Mt-RNA data sets. However, as a consequence of the bias reduction, ML and BPP support for BDa increase from 15% and 0.00 to 79% and 1.00, respectively, for the mt proteins and from 14% and 0.00 to 52% and 0.55 for the RNA. MP analyses tell essentially the same story (Appendix 1). The upshot of excluding and recoding data on the basis of potential for tree selection artifacts resulting from composition bias is that each of the 12-taxon Nuc, RB-Mt-ptn, and RB-Mt-RNA data sets favor the same tree.


View this table:
[in this window]
[in a new window]

 
Table 5 Maximum-likelihood bootstrap support (and Bayesian posterior probabilities) from the 12-taxon mitochondrial protein, mitochondrial RNA, and nuclear data sets for marsupial relationships, including for bandicoot-dasyuromorphian (BDa), bandicoot-diprotodontian (BDi), and dasyuromorphian-diprotodontian (DaDi) associations.

 
The 14-taxon RB-MtNuc (Fig. 3) and MtNuc (not shown) data sets favor the same relationships as for their respective 12-taxon counterparts (Fig. 2). More specifically, the bias-reduced and original data sets respectively favor BDa and DaDi among the major Australasian groups. However, ML and MP bootstrap support is much reduced in the 14-taxon cases. This may be misleading and was predicted earlier. That the low resolution results from the instability of Dromiciops and Notoryctes on the trees can be shown by constraining their placements in accordance with the tree in Figure 3. With Dromiciops and Notoryctes constrained respectively to fall outside and within the Australasian taxa, BDa (including or excluding Notoryctes) receives 98% and 99% ML and MP bootstrap support for RB-MtNuc(14T)—higher even than for the 12-taxon trees.


Figure 3
View larger version (49K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 3 Marsupial phylogeny inferred from the reduced-bias 14-taxon combined data set, RB-MtNuc(14T). Bootstrap supports are shown for ML (top) and MP (middle), with BPP values (bottom). These are not shown for nodes that receive maximum support in all analyses. All resolve for an monophyletic Australasian grouping.

 
Phylogenetic Hypothesis Testing
Our ML hypothesis testing analyses have two key advantages. Firstly, in contrast to the bootstrap analyses, ML models are partitioned on the basis of codon positions and RNA secondary structure, for which substitution processes differ markedly. Secondly, type 2 error rates associated with Kishino-Hasegawa (KH) and approximately unbiased (AU) hypothesis tests can provide a closer reflection of sampling error than do bootstrap values (Shimodaira and Hasegawa, 1999), which in turn are far more faithful than typically overconfident BPP values (Suzuki et al., 2002; Gontcharov et al., 2004).

Nine 12-taxon trees attracted ≥1% bootstrap (or BPP) support in at least one of the Nuc, Mt, or RB-Mt analyses (either MP, ML, or Bayesian). These include the nine combinations of the three possible sister groupings among the diprotodontians (Phalangeridae, Macropodinae, and Vombatus) with each of BDi, DaDi, and BDa. The results in Table 6a reveal that the same topology (as for Fig. 2b) is favored for each of the data sets RB-Mt(12T), Nuc(12T), and the combined RB-MtNuc(12T). Log likelihood differences from that best tree and significance (P) values are also given in Table 6. Although the KH and AU tests provide similar results, the AU P-values are expected to be more reliable (Shimodaira, 2002) and were usually more conservative when significance was borderline. Hence, unless otherwise stated, we refer to the AU test P-values.


View this table:
[in this window]
[in a new window]

 
Table 6 Log-likelihood differences between trees and their statistical significance under KH and AU tests for the reduced-bias mt (RB-Mt), nuclear (Nuc), and combined (RB-MtNuc) data sets. ML models are partitioned for proteins (by codon) and RNA (stems, loops). Twelve-taxon analyses (a. trees 1–9) compare affinities among the major Australasian orders and within Diprotodontia. Fourteen-taxon analyses (b: trees 10–15) compare australidelphian interordinal affinities. Each data set favors BDa (± Notoryctes). Among the 105 possibilities, the three highest –lnL trees are shown, as well as the best BDi and DaDi trees. P-values > 0.05 are bold. Taxon abbreviations: B, bandicoots (Peramelemorphia); Da, Dasyuromorphia; Di, Diprotodontia; Dr, Dromiciops; M, Macropodinae; P, Phalangeridae; N, Notoryctes; V, Vombatus.

 
Although each of the 12-taxon analyses favors both a bandicoot-dasyuromorphian (BDa) relationship as well as Phalangeriformes, alternatives to both of these cannot be rejected at P ≤ 0.05 for either the mitochondrial or nuclear data alone. Nonrejected alternatives are shown (in bold) in Table 6a for RB-Mt(12T) and Nuc(12T). DaDi is the strongest alternative to BDa for both data sets. Combining the mt and nuclear data provides clear statistical advantages. Individually, both data sets allow alternatives to Phalangeriformes and BDa to be rejected at P {approx} 0.075 and P {approx} 0.275, respectively. On the combined 12-taxon data, Phalangeriformes and BDa alternatives are rejected at P = 0.023 and P = 0.127, respectively.

Table 6b shows the three trees (10, 11, 12) with the highest likelihoods for the 14-taxon mt, nuclear, and combined analyses. Each is consistent with a bandicoot-dasyuromorphian association among the major groups and places Dromiciops outside of the Australasian radiation. Trees with the highest likelihoods that are consistent with BDi (tree 14) and DaDi (tree 15) are also shown. The best BDi tree, which is 20.9 –lnL units inferior to the favored BDa tree, is in fact a variant of Syndactyla, in this case with bandicoots and Notoryctes grouping together. Note that the best DaDi tree (21.2 –lnL units inferior to the favored BDa tree) is not a variant of Eometatheria (Dasyuromorphia, Diprotodontia, Dromiciops, and Notoryctes), which fares worse still, 22.0 –lnL units inferior to BDa.

The most striking effect of including Dromiciops and Notoryctes (Table 6b) is that –lnL support for BDa increases further, such that the BDi and DaDi alternatives can be strongly rejected with the combined mt and nuclear data, at P = 0.033 (tree 14) and P = 0.023 (tree 15). Alternatives to Phalangeriformes among the 14-taxon trees are also rejected at P < 0.05 (not shown). Of the monogeneric orders, Notoryctes is sister to bandicoots plus dasyuromorphians and Dromiciops is sister to the Australasian marsupials. However, as foreshadowed by the bootstrap results (Fig. 3), these placements are weakly supported. Indeed, among the separate data sets, RB-Mt(14T) instead places Notoryctes as sister to the dasyuromorphians, whereas Dromiciops groups with Diprotodontia in the Nuc(14T) analysis.

Clearly, the exact placements of Dromiciops and Notoryctes within Australidelphia must still be considered as uncertain. However, finding that bandicoots and dasyuromorphians associate together among the major orders is a significant result and has important implications for understanding the morphological evolution and initial diversification of Australasian marsupials.


    Discussion
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Conclusions and Further Work
 References
 
We have presented three new mt genomes, one for a bandicoot (Echymipera rufescens) as well as two for Dasyuromorphia (Dasyurus hallucatus and Sminthopsis crassicaudata). These data have helped us resolve for both a bandicoot-dasyuromorphian association among the major Australian marsupial orders and a grouping of kangaroos and possums (Phalangeriformes) among the diprotodontians. We have achieved this by differentiating between sources of compositional nonstationarity that provide either sufficient or insufficient signal to mislead tree reconstruction and by combining only the putatively unbiased mt and nuclear data.

Is Composition Bias Misleading Tree Reconstruction?
The qualitative similarity between MP, ML, and Bayesian inference for the shift in support from DaDi on the original data set (Fig. 2a) to BDa on the reduced-bias data set (Fig. 2b, see also Table 5) suggests a mechanism is at work that can similarly compromise MP and highly parametric (ML and Bayesian) approaches. Compositional nonstationarity fits this description (Galtier and Gouy, 1998; Foster, 2004) and is a usual suspect for phylogenetic bias among animal mtDNA studies. The nonhistorical signals this induces have been shown to mislead tree reconstruction for invertebrates (Jermiin and Crozier, 1994; Delsuc et al., 2003) and basal vertebrates (Naylor and Brown, 1998), through to birds (Paton et al., 2002) and mammals (Schmitz et al., 2002a; Phillips and Penny, 2003). Hence, our first step in assessing the reliability of the different DNA sequence data sets was to examine the potential for nucleotide composition bias to mislead tree reconstruction.

In large data sets, the reduced importance of stochastic error leads to compositional homogeneity being rejected by the composition {chi}2 test even when the signal for a bias is very small relative to historical signals (Phillips et al., 2004). In the present case, excluding data on the basis of {chi}2 testing rejecting homogeneity (P ≤ 0.05) would leave us without either the mitochondrial or nuclear data (see Table 2). Importantly, though, the base-frequency versus standard (BF/S) distance comparisons reveal that composition bias is only of sufficient magnitude to mislead phylogeny reconstruction if Mt-RNA transitions, Mt-ptn 3rd positions, and 1st position pyrimidine transitions are included. That is, where the BF distances favor the same tree as standard distances, but with substantially greater ME support on the BF distances than on standard distances (BF/S ratios > 1.00; see Table 3). The latter condition is necessitated by some of the BF signal being attributable to sites that are uninformative for the node of interest. Excluding these sources of compositional heterogeneity leaves reduced-bias mt data sets (RB-Mt-ptn and RB-Mt-RNA) from which to infer phylogeny alongside the nuclear sequences.

Presently the BF/S distance comparisons are not an explicit test of compositional stationarity. However, data sets could be bootstrapped to incorporate stochastic error. The presence/absence of highly supported nodes on BF distance trees run on the pseudoreplicates would provide a more powerful test of compositional homogeneity than does the {chi}2. This is because the former test is tree-based and factors out compositional correlation due to shared phylogenetic history. Moreover, BF/S distance comparisons gain considerable advantage in explanatory power over {chi}2 testing by indicating whether or not the magnitude of composition bias at specific nodes may be sufficient to account for the apparent phylogenetic signal.

If compositional heterogeneity is the major source of nonhistorical signal in the data and enough sites are sampled to overcome stochastic error, then we would expect BF/S distance comparisons and BF distance trees to predict patterns of incongruence and clade support between partitions. This expectation is borne out remarkably well. ILD test congruence between Nuc(12T) and RB-Mt(12T), but incongruence (P = 0.007) between Nuc(12T) and Mt(12T) is consistent with the BF/S distance comparisons, indicating that composition bias is sufficient to potentially mislead tree reconstruction only for the original Mt-ptn and Mt-RNA data. More specifically, the two differences in the MtNuc tree (Fig. 2a) from the RB-MtNuc tree (Fig. 2b) are the DaDi grouping and reduced support for Phalangeriformes (concurrent with increased support for a kangaroo plus wombat grouping). The BF distance tree in Figure 1 shows that both results are consistent with support from composition bias.

It is particularly interesting that for RB-MtNuc, the inclusion of Dromiciops and Notoryctes increases the –lnL advantage of BDa over DaDi from 12.8 to 21.2 (Table 6). This is unlikely to be due to the phylogenetically isolated taxa effectively reducing stochastic error for inferring the interrelations of the major orders. Despite the –lnL advantage of BDa over BDi also increasing slightly (from 18.7 to 20.9) for the 14-taxon analysis, stochastic effects are in fact greater, such that the AU-test P-value for rejecting BDi actually increases (from 0.014 to 0.037). Again, however, examination of BF distances can explain reduced support for DaDi relative to BDa.

Although the BF/S distance comparisons (Table 3) suggest that composition bias alone is not sufficient to result in incorrect tree selection for the reduced-bias data sets, levels of tree support may still be considerably affected by remnant compositional heterogeneity. ME (BF distance) analysis finds that this remnant heterogeneity favors DaDi in both RB-MtNuc(12T) and RB-MtNuc(14T). With 12 taxa, BDa and BDi require 154.0 and 153.0 changes more than DaDi. With the addition of Dromiciops and Notoryctes, this reduces to 17.3 and 9.3 changes, respectively. Confidence in BDa increases with this decrease in conflict from compositional bias and the significance level at which AU testing rejects DaDi falls from P = 0.127 to P = 0.028.

As for other studies of composition bias among mammalian mt genomes (e.g., Reyes et al., 1998; Gibson et al., 2005), it is among-pyrimidine bias that is the major source of the compositional signal for DaDi (see Fig. 1). High average thymine:cytosine ratios at informative sites among the nt-coded data that were excluded or recoded are found among the bandicoots (1.340) and outgroup "ameridelphians" (1.102), so these tend to attract relative to the lower thymine dasyuromophians (0.939) and diprotodontians (0.601). The strong compositional bias on these mt data also provides a likely explanation for earlier studies of one or a few mt genes (e.g., Krajewski et al., 1997; Burk et al., 1999) favoring Eometatheria (including DaDi). Furthermore, these mt data will have provided the majority of signal to the total evidence analysis of Asher et al. (2004) and the supertree of Cardillo et al. (2004), which again both favored Eometatheria.

The conclusion that the BF/S distance comparisons warrant the alterations made to the original mt data is further supported by the consequent increase in phylogenetic signal retention (high stemminess; Table 2) and congruence with the nuclear data. We also looked at the possibility of branch-length effects (e.g., long-branch attraction; see Felsenstein, 1978; Hendy and Penny, 1989) misleading tree reconstruction before considering phylogenetic analysis of the remaining RB-MtNuc to be reliable.

ML models that incorporate among-site rate heterogeneity are usually expected to perform far better than MP when rates vary considerably across the tree (Sullivan and Swofford, 2001). Hence, similar trends in bootstrap support under MP and ML implies that branch-length biases are not the source of signal for BDa. In fact, inspection of the unrooted RB-MtNuc ML tree (Fig. 4) shows that the "ameridelphian" and dasyurid branches are long, relative to the diprotodontian and bandicoot branches, such that long branch-length attraction would tend to favor a bandicoot-diprotodontian (BDi) association among the major groups. We provisionally consider RB-MtNuc to be unbiased for inferring the present marsupial tree. This is not to say that these data do not contain nonhistorical signals, but rather that finding a bandicoot-dasyuromorphian association may reflect historical signal but is not an artifact of either compositional nonstationarity or the distribution of branch-lengths.


Figure 4
View larger version (38K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 4 Maximum likelihood (GTR+I+{Gamma}4) tree for the 14-taxon reduced-bias mt and nuclear combined data set, RB-MtNuc(14T).

 
Combining Nuclear and Mitochondrial Data
The mt protein, mt RNA, and nuclear protein data sets are clearly different process partitions in the sense of Bull et al. (1993). Before even considering the empirical data on differences in substitution processes (e.g., Lin and Danforth, 2004), we know they differ in fundamental aspects of replication, exposure to and buffering of oxidative damage, error correction, and mode of inheritance (see Ballard and Whitlock, 2004). Given the differences in evolutionary processes, some proponents of taxonomic congruence (e.g., Miyamoto and Fitch, 1995) would encourage separate analysis of these partitions. However, the primary arguments for separate analysis are to aid in identifying incongruence (Chen et al., 2003) and to account for differences in substitution models (Buckley et al., 2002). Neither argument applies in the present situation. Firstly, the nuclear and reduced-bias mt data sets display high ILD-test congruence and, secondly, in our combined analysis we have allowed ML and Bayesian models to be optimized for each partition. The important question here is whether combining RB-Mt with Nuc reduces the overall influence of sampling error and increases our confidence in the tree or, alternatively, whether including the "noisier" (lower stemminess; Table 2) mt data effectively pollutes the signal from the nuclear data.

AU significance testing reveals that combining the mt and nuclear data does increase the prominence of apparent phylogenetic signal relative to the influence of sampling error. Alternatives to Phalangeriformes (Table 6a) and to BDa (even for the less composition-biased 14-taxon analysis; Table 6b) can only be rejected at P = 0.079 and P = 0.051, respectively, with the nuclear sequences alone. These P-values fall to 0.023 and 0.033 with the inclusion of the mt data. Such increases in resolution upon combining data that are not significantly incongruent are essentially what has been sought by advocates of conditional data concatenation (e.g., Bull et al., 1993; Huelsenbeck et al., 1996).

The present study represents a partial unification of the ideals of both total evidence (Kluge, 1989) and taxonomic congruence (Mickevich, 1978), with a number of major concerns of both approaches being met. Even among advocates of total evidence there is room for excluding possible heritable characters from phylogenetic inference. Kluge (1989), for example, wrote that "including all relevant evidence can be seen as a harmless activity, unless one is prepared to argue a priori that certain evidence will confound the analysis and must therefore be eliminated." To this end we have inferred the marsupial phylogeny from all the putatively unbiased evidence from among the mt and nuclear data. With respect to taxonomic congruence, incongruence among partitions has been explained and its source excluded, whereas partitioned ML allowed differing patterns of substitution between the Nuc and Mt-ptn codon positions and Mt-RNA loops and stems to be modeled separately. Combined, partitioned analysis also avoids many of the vagaries of weighting (or not) separately analyzed partitions (Hillis, 1987; and see Levasseur and Lapointe, 2001).

Our method only differs from conditional data concatenation in our emphasis on partitioning the ML analysis. A further amendment worth consideration is whether or not to include data that are putatively unbiased and congruent with other partitions, but retain so little signal relative to "noise" that they only serve to increase variance and so reduce the power to reject hypotheses. A number of possible solutions to such data quality versus quantity problems have been proposed (e.g., Mindell and Thacker, 1996; Cunningham, 1997). In practice, though, and certainly in the present case, such a decision will often not be required with our strategy of only including putatively unbiased data. This is because BF/S distance comparisons will often identify noisy data as being potentially misleading for tree reconstruction, because composition bias tends to be accentuated when saturation has eroded much of the ancestral signal. Hence, the partition would be excluded before concerns are raised over the affect of saturation on statistical power.

Inference on the Evolution of the Major Radiations of Australasian Marsupials
We can now be confident that among the three major lineages of Australasian marsupials, it is the primarily herbivorous diprotodontians that diverged first, with the semifossorial/omnivorous bandicoots and insectivorous/carnivorous dasyuromorphians diverging from each other shortly after. This phylogeny can be used to guide future efforts for reconstructing the ecological niche and morphology of the last common ancestor (LCA) of these groups and for determining which traits were retained in the earliest stem diprotodontians, dasyuromorphians, and bandicoots.

Presently, though, we warn of the possibility of overinterpretation. Absence of, or incorrect resolution in rooting the Australasian marsupials in morphological cladistic analyses (e.g., Springer et al., 1997a; Horovitz and Sánchez-Villagra 2003), implies either the presence of major sources of nonhistorical signal, or that few synapomorphies of the BDa stem lineage have been retained. Hence, it may be difficult to accurately track transformations in character-mapping studies. Indeed, given the relative shortness of those internal branch-lengths on the RB-MtNuc(14T)phylogeny (Fig. 4), the morphology of the LCA of bandicoots and dasyuromorphians may have changed little from that of the LCA of all Australasian marsupials.

The near-absence of an Australian terrestrial vertebrate fossil record from the latest Oligocene ({approx} 25 Mya) back to the Early Cretaceous ({approx} 105 Mya) is the biggest stumbling block to inferring the nature of the initial diversification of Australasian marsupials. Thus it is critical to determine the relationships of Paleocene and Eocene fossils with alleged close affinities to the Australasian crown group, such as Andinodelphys (Marshall et al., 1990) from Bolivia and the fragmentary specimens from Murgon, Australia (Godthelp et al., 1999; also see Woodburne and Case, 1996). Hopefully, using the molecular tree as a scaffold will remove some of the uncertainty involved in placing fossil taxa on the tree.

Beyond the interrelations of the major Australasian orders, we can dismiss two key hypotheses, Syndactyla, a variant of BDi that typically includes Notoryctes, and Eometatheria, a variant of DaDi that also includes Dromiciops and Notoryctes. The latter hypothesis influenced Kirsch et al. (1997) to flag the possibility that bandicoots and other australidelphians were independently derived from among South American marsupials. Our well-corroborated finding of a bandicoot-dasyuromorphian association instead implies that bandicoot origins lie firmly within the Australasian radiation. Hence, the osseous patella, "mortise and tenon" upper ankle joint, chorioallantoic placenta, and extended corpus luteum life of bandicoots must be considered to have evolved in convergence with placental mammals. The alternative, retention from early therian ancestry, requires at least five independent losses among marsupials for each of these characters if Dromiciops is basal among australidelphians and "Ameridelphia" is paraphyletic (e.g., Amrine-Madsen et al., 2003). It would be interesting to explore whether all four of these traits, which appear to be adaptive for either a cursorial (running) lifestyle (Muizon, 1998; Szalay and Sargis, 2001) or rapid reproduction (Tyndale-Biscoe and Renfree, 1987) are evolutionarily linked.

A further conclusion is that syndactyly of the 2nd and 3rd pedal digits is not a synapomorphy of bandicoots and diprotodontians, as has often been advocated (see Szalay, 1993; Hall, 1987; Luckett, 1994). It has already been pointed out that basic aspects of syndactyly are under simple genetic control (Muragaki et al., 1996) and that syndactyly has evolved independently in placentals, including in gibbons and otter shrews (Nowak, 1991). Indeed, syndactyly appears to be incipient in some didelphimorph marsupials (Bensley, 1903; Kirsch, 1977), whereas Sargeant and Thulborn (1986) even attribute syndactylous footprints from the Cretaceous to an early marsupial.

The upshot of syndactyly being homoplastic among marsupials is that it can be added to the growing list of complex characters that were considered as robust support for mammal clades that have since been shown to be incorrect. Other examples include tribosphenic molars (which evolved independently in Gondwanan and Laurasian mammals; see Lou et al., 2001) and the double-trochlea astragali (ankle bones) of artiodactyls, which are now known to be paraphyletic (Gatesy and O'Leary, 2001). We need to be very careful about giving any character special phylogenetic status, no matter how complex it appears to be, especially if the facets of that complexity may be functionally, genetically, or developmentally linked.


    Conclusions and Further Work
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Conclusions and Further Work
 References
 
We have employed a simple strategy for resolving incongruence between mitochondrial and nuclear DNA process partitions for marsupial interrelations. The key was to examine compositional nonstationarity using base-frequency versus standard (BF/S) distance comparisons. Unlike stochastic tests (e.g., composition {chi}2), BF/S distance comparisons are not affected by phylogenetic correlation, they allow biases to be explored across specific nodes and, most importantly, indicate whether sources of bias are of sufficient magnitude to potentially mislead tree reconstruction.

The BF/S distance comparisons show impressive explanatory power, allowing us to predict patterns of incongruence and clade support among partitions. One drawback, however, is that only standard, uncorrected distances (for inferring tree support) are directly comparable with the BF distances. The difficulty is in appropriately weighting the BF distances such that they are comparable with more realistic corrected distance models. This problem deserves further attention, although ML/Bayesian approaches for exploring the contribution of composition bias to relative support among competing hypotheses would be more desirable and may even be more tractable.

In the first part of our strategy, the BF/S distance comparisons show signal among Mt-ptn and Mt-RNA to be potentially misleading. In the second, composition bias was reduced by excluding Mt-ptn 3rd positions, Y-coding Mt-ptn 1st positions, and RY-coding Mt-RNA, with the resulting reduced bias data sets combined with the nuclear sequences. Consequently, we have for the first time been able to confidently determine affinities among the three major radiations of Australasian marsupials ((Diprotodontia, (Peramelemorphia, Dasyuromorphia)).

Further resolving the interordinal relations of marsupials requires the monogeneric Dromiciops and Notoryctes to be placed with respect to the short internal branches of the current tree. It appears likely that Dromiciops is basal among Australidelphia (leaving a monophyletic Australasian clade; Fig. 3), with AU-testing on the reduced-bias mt data set rejecting alternatives at P < 0.1 (e.g., tree 13 in Table 6). Although the present analysis of the nuclear sequences marginally favors Dromiciops as sister to Diprotodontia, with increased taxon sampling, Bayesian inference on these sequences (Amrine-Madsen et al., 2003) also favored a basal placement for Dromiciops among Australidelphia. Notoryctes is more problematic still, with little separating alternative sister relations (with either Dasyuromorphia, bandicoots, or a bandicoot-dasyuromorphian clade) for either the mt or nuclear analyses.

Confidently resolving the affinities of Dromiciops and Notoryctes may require data from transposable elements or genomic rearrangements. Nevertheless, the present results could aid these efforts and morphological investigations by providing a phylogenetic scaffold for the three major radiations. Furthermore, as shown in Table 2, there is considerable compositional heterogeneity even among nuclear data. As such, sequence analyses focusing on family-level or deeper marsupial phylogeny should routinely employ objective means for identifying specific branches that may result from composition bias. We further advise that although sequencing effort should concentrate on the generally higher quality nuclear data, the mitochondrial data can offer valuable independent confirmation and should not be ignored.


View this table:
[in this window]
[in a new window]

 
Appendix 1 Maximum-parsimony bootstrap support from the 12-taxon mitochondrial protein, mitochondrial RNA and nuclear data sets for marsupial relationships, including for bandicoot-dasyuromorphian (BDa), bandicoot-diprotodontian (BDi), and dasyuromorphian-diprotodontian (DaDi) associations.

 

    Acknowledgements
 
We thank Don Colgan and the Australian Museum (Sydney) for the dasyurid and bandicoot tissue samples. Olga Kardailsky and Charmaine Carlisle made valuable contributions to the laboratory work. We also thank Mike Westerman, Cary Krajewski, Mike Lee, and Peter Foster for helpful comments on the manuscript. This project is supported by the New Zealand Marsden Fund.


    References
 Top
 Abstract
 Material and Methods
 Results
 Discussion
 Conclusions and Further Work
 References
 

    Adachi J., Hasegawa M. Model of amino acid substitution in proteins encoded by mitochondrial DNA. J. Mol. Evol. (1996) 42:459–468.[Web of Science][Medline]

    Amrine H. M., Springer M. S. Maximum likelihood analysis of the tethythere hypothesis based on a multigene data set and a comparison of different models of sequence evolution. J. Mamm. Evol. (1999) 6:161–176.[CrossRef]

    Amrine-Madsen H., Scally M., Westerman M., Stanhope M. J., Krajewski C., Springer M. S. Nuclear gene sequences provide evidence for the monophyly of australidelphian marsupials. Mol. Phyl. Evol. (2003) 28:186–196.[CrossRef][Web of Science][Medline]

    Aplin K. P., Archer M. Recent advances in marsupial systematics with a new syncretic classification. In: Possums and opossums: Studies in evolution—Archer M., ed. (1987) Chipping Norton, New South Wales, Australia: Surrey Beatty and Sons. xv–lxxii.

    Arnason U., Adegoke J. A., Bodin K., Born E. W., Esa Y. B., Gullberg A., Nilsson M., Short R. V., Xu X., Janke A. Mammalian mitogenomic relationships and the root of the eutherian tree. Proc. Natl. Acad. Sci. USA (2002) 99:8151–8156.[Abstract/Free Full Text]

    Asher R. J., Horovitz I., Sá-nchez-Villagra M. R. First combined cladistic analysis of marsupial mammal interrelationships. Mol. Phyl. Evol. (2004) 33:240–250.[CrossRef][Web of Science][Medline]

    Baker M. L., Wares J. P., Harrison G. A., Miller R. D. Relationships among the families and orders of marsupials and the major mammalian lineages based on recombination activating gene-1. J. Mamm. Evol. (2004) 11:1–16.[CrossRef]

    Baker R. H., Wilkinson G. S., DeSalle R. Phylogenetic utility of different types of molecular data used to infer evolutionary relationships among stalk-eyed flies (Diopsidae). Syst. Biol. (2001) 50:87–105.[Abstract/Free Full Text]

    Ballard J. W. O., Whitlock M. C. The incomplete natural history of mitochondria. Mol. Ecol. (2004) 13:729–744.[CrossRef][Medline]

    Baverstock P. R., Birrell J., Krieg M. Albumin immunologic relationships among Australian possums: a progress report. In: Possums and opossums: Studies in evolution—Archer M., ed. (1987) Chipping Norton, New South Wales, Australia: Surrey Beatty and Sons. 229–234.

    Bensley B. A. On the evolution of the Australian Marsupialia: With remarks on the relationships of the marsupials in general. Trans. Linn. Soc. Lond. (1903) 9:83–217.

    Buckley T. R., Arensburger P., Simon C. Combined data, bayesian phylogenetics, and the origin of the New Zealand cicada genera. Syst. Biol. (2002) 51:4–18.[Abstract/Free Full Text]

    Bull J. J., Huelsenbeck J. P., Cunningham C. W., Swofford D. L., Waddell P. J. Partitioning and combining data in phylogenetic analysis. Syst. Biol. (1993) 42:384–397.[Free Full Text]

    Burk A., Westerman M., Kao D. J., Kavanagh J. R., Springer M. S. An analysis of marsupial interordinal relationships based on 12S rRNA, tRNA valine, 16S rRNA, and cytochrome b sequences. J. Mamm. Evol. (1999) 6:317–334.[CrossRef]

    Cardillo M. O. R., Bininda-Emonds P., Boakes E., Purvis A. A species-level phylogenetic supertree of marsupials. J. Zool. (2004) 264:11–31.[CrossRef][Web of Science]

    Caterino M. S., Reed R. D., Kuo M. M., Sperling F. A. H. A partitioned likelihood analysis of swallowtail butterfly phylogeny (Lepidoptera: Papilionidae). Syst. Biol. (2001) 50:106–127.[Abstract/Free Full Text]

    Chase J., Graydon M. L. The eye of the northern brown bandicoot, Isoodon macrourus. In: Bandicoots and Bilbies—Seebeck J. H., Brown R. R., Wallis R. L., Kemper C. M., eds. (1990) Chipping Norton, New South Wales, Australia: Surrey Beatty and Sons. 117–122.

    Chen W.-J., Bonillo C., Lecointre G. Repeatability of clades as a criterion of reliability: A case study for molecular phylogeny of Acanthomorpha (Teleostei) with larger number of taxa. Mol. Phyl. Evol. (2003) 26:262–288.[CrossRef][Web of Science][Medline]

    Colgan D. J. Phylogenetic studies of marsupials based on phosphoglycerate kinase DNA sequences. Mol. Phyl. Evol. (1999) 11:13–26.[CrossRef][Web of Science][Medline]

    Corneli P. S. Complete mitochondrial genomes and eutherian evolution. J. Mammal. Evol. (2002) 9:281–305.[CrossRef]

    Cunningham C. W. Is congruence between data partitions a reliable predictor of phylogenetic accuracy? Empirically testing an iterative procedure for choosing among phylogenetic methods. Syst. Biol. (1997) 46:464–478.[Abstract/Free Full Text]

    Curole J. P., Kocher T. D. Mitogenomics: Digging deeper with complete mitochondrial genomes. Trends Ecol. Evol. (1999) 14:394–398.[CrossRef][Medline]

    DeBry R. W. Maximum likelihood analysis of gene-based and structure-based process partitions, using mammalian mitochondrial genomes. Syst. Biol. (1999) 48:286–299.[Abstract/Free Full Text]

    Delsuc F., Phillips M. J., Penny D. Comment on "Hexapod origins: monophyletic or paraphyletic?" Science (2003) 301:1887–1889.

    Douady C. J., Delsuc F., Boucher Y., Doolittle W. F., Douzery E. J. Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability. Mol. Biol. Evol. (2003) 20:248–254.[Abstract/Free Full Text]

    Dowton M., Austin A. D. Increased congruence does not necessarily indicate increased phylogenetic accuracy—the behavior of the incongruence length difference test in mixed-model analyses. Syst. Biol. (2002) 51:19–31.[Abstract/Free Full Text]

    Farris J. S., Källersjö M., Kluge A. G., Bult C. Testing significance of incongruence. Cladistics (1994) 10:315–319.[CrossRef][Web of Science]

    Felsenstein J. Cases in which parsimony or compatibility methods will be positively misleading. Syst. Zool. (1978) 27:401–410.[Abstract/Free Full Text]

    Fiala K. L., Sokal R. R. Factors determining the accuracy of cladogram estimation: evaluation using computer simulation. Evolution (1985) 39:609–622.[CrossRef][Web of Science]

    Foster P. G. Modeling compositional heterogeneity. Syst. Biol. (2004) 53:485–495.[Abstract/Free Full Text]

    Galtier N., Gouy M. Inferring pattern and process: Maximum likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis. Mol. Biol. Evol. (1998) 15:871–879.[Abstract]

    Gatesy J., O'Leary M. A. Deciphering whale origins with molecules and fossils. Trends Ecol. Evol. (2001) 16:562–570.[CrossRef]

    Gibson A., Gowri-Shankar V., Higgs P. G., Rattray M. A comprehensive analysis of mammalian mitochondrial genome base composition and improved phylogenetic methods. Mol. Biol. Evol. (2005) 22:251–264.[Abstract/Free Full Text]

    Godthelp H., Wroe S., Archer M. A new marsupial from the Early Eocene Tingamarra local fauna of Murgon, southeastern Queensland: A prototypical Australian marsupial? J. Mamm. Evol. (1999) 6:289–313.[CrossRef]

    Gontcharov A. A., Marin B., Melkonian M. Are combined analyses better than single gene phylogenies? A case study using SSU rDNA and rbcL sequence comparisons in the Zygnematophyceae (Streptophyta). Mol. Biol. Evol. (2004) 21:612–624.[Abstract/Free Full Text]

    Hall L. S. Syndactyly in marsupials—problems and prophecies. In: Possums and opossums: Studies in evolution—Archer M., ed. (1987) Chipping Norton, New South Wales, Australia: Surrey Beatty and Sons. 245–255.

    Hendy M., Penny D. A framework for the quantitative study of phylogenetic data. Syst. Zool. (1989) 38:297–309.[Abstract/Free Full Text]

    Hillis D. M. Molecular versus morphological approaches to systematics. Annu. Rev. Ecol. Syst. (1987) 18:23–42.[CrossRef][Web of Science]

    Ho S. Y. W., Jermiin L. S. Tracing the decay of the historical signal in biological sequence data. Syst. Biol. (2004) 53:623–637.[Abstract/Free Full Text]

    Horovitz I., Sánchez-Villagra M. R. A morphological analysis of marsupial mammal higher-level phylogenetic relationships. Cladistics (2003) 19:181–212.[CrossRef][Web of Science]

    Huelsenbeck J., Bull J. J., Cunningham C. W. Combining data in phylogenetic analysis. Trends Ecol. Evol. (1996) 11:152–158.[CrossRef]

    Huelsenbeck J., Ronquist F. MrBayes: Bayesian inference on phylogenetic trees. Bioinformatics (2001) 17:754–755.[Abstract/Free Full Text]

    Hughes R. L. Comparative morphology of spermatozoa from five families. Aust. J. Zool. (1965) 13:533–543.[CrossRef]

    Janke A., Gemmell N. J., Feldmaier-Fuchs G., von Haeseler A., Pääbo S. The mitochondrial genome of a monotreme—the platypus (Ornithorhynchus anatinus). J. Mol. Evol. (1996) 42:153–159.[CrossRef][Web of Science][Medline]

    Janke A., Xu X., Arnason U. The complete mitochondrial genome of the wallaroo (Macropus robustus) and the phylogenetic relationship of among Monotremata, Marsupialia and Eutheria. Proc. Nat. Acad. Sci. USA (1997) 94:1276–1281.[Abstract/Free Full Text]

    Jansa S. A., Voss R. S. Phylogenetic studies of didelphid marsupials I. Introduction and preliminary results from nuclear IRBP gene sequences. J. Mammal. Evol. (2000) 7:43–77.

    Jermiin L. S., Crozier R. H. The cytochrome b region in the mitochondrial DNA of the ant Tetraponera rufoniger: Sequence divergence in Hymenoptera may be associated with nucleotide content. J. Mol. Evol. (1994) 38:282–294.[CrossRef][Web of Science][Medline]

    Jones F. W. The Mammals of South Australia (1923–1925) Adelaide: Government Printers.

    Kirsch J. A. W. The comparative serology of Marsupialia, and the classification of marsupials. Aust. J. Zool. (Suppl.) (1977) 38:1–152.

    Kirsch J. A. W., Lapointe F.-J., Springer M. S. DNA-hybridisation studies of marsupials and their implications for metatherian classification. Aust. J. Zool. (1997) 45:211–280.[CrossRef][Web of Science]

    Kishino H., Hasegawa M. Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data and the branching order Hominidae. J. Mol. Evol. (1989) 29:170–179.[CrossRef][Web of Science][Medline]

    Kluge A. G. A concern for evidence and a phylogenetic hypothesis of relationships among Epicrates (Boidae, Serpentes). Syst. Zool (1989) 38:7–25.[Abstract]

    Krajewski C., Buckley L., Westerman M. DNA phylogeny of the marsupial wolf resolved. Proc. Roy. Soc. Lond. B (1997) 264:911–917.[CrossRef][Medline]

    Levasseur C., Lapointe F.-J. War and peace in phylogenetics: A rejoinder on total evidence and consensus. Syst. Biol. (2001) 50:881–891.[Abstract/Free Full Text]

    Lin C.-P., Danforth B. N. How do insect nuclear and mitochondrial gene substitution patterns differ? Insights from Bayesian analyses of combined data sets. Mol. Phyl. Evol. (2004) 30:686–702.[CrossRef][Web of Science][Medline]

    Lin Y.-H., McLenachan P. A., Gore A. R., Phillips M. J., Ota R., Hendy M., Penny D. Four new mitochondrial genomes, and the increased stability of evolutionary trees of mammals from improved taxon sampling. Mol. Biol. Evol. (2002a) 19:2060–2070.[Abstract/Free Full Text]

    Lin Y.-H., Waddell P. J., Penny D. Pika and vole mitochondrial genomes increase support for both rodent monophyly and glires. Gene (2002b) 294:119–124.[CrossRef][Web of Science][Medline]

    Lockhart P. J., Steel M., Hendy M., Penny D. Recovering evolutionary trees under a more realistic model of sequence evolution. Mol. Biol. Evol. (1994) 11:605–612.[Web of Science][Medline]

    Lou Z.-X., Cifelli R., Kielan-Jaworowska Z. Dual origin of tribosphenic mammals. Nature (2001) 409:53–57.[CrossRef][Medline]

    Lou Z.-X., Ji Q., Wible J. R., Yuan C.-X. An Early Cretaceous tribosphenic mammal and metatherian evolution. Science (2003) 302:1934–1940.[Abstract/Free Full Text]

    Luckett W. P. Suprafamilial relationships within Marsupialia: Resolution and discordance from multidisciplinary data. J. Mammal. Evol. (1994) 2:255–288.[CrossRef]

    Madsen O., Scally M., Douady C. J., Kao D., DeBry R. W., Adkins R., Amrine H. M., Stanhope M. J., de Jong W. W., Springer M. S. Parallel adaptive radiations in two major clades of placental mammals. Nature (2001) 409:610–614.[CrossRef][Medline]

    Marshall L. G., Case J. A., Woodburne M. O. Phylogenetic relationships of the families of marsupials. In: Current mammalogy, Volume 2—Genoways H. H., ed. (1990) New York: Plenum Press. 433–505.

    Mickevich M. F. Taxonomic congruence. Syst. Zool. (1978) 27:143–158.[Abstract/Free Full Text]

    Mindell D. P., Thacker C. E. Rates of molecular evolution: Phylogenetic issues and applications. Annu. Rev. Ecol. Syst. (1996) 27:279–303.[CrossRef][Web of Science]

    Miyamoto M. M., Fitch W. M. Testing species phylogenies and phylogenetic methods with congruences. Syst. Biol. (1995) 44:64–76.[Abstract/Free Full Text]

    Muizon de C. Mayulestes ferox, a borhyaenoid (Metatheria, Mammalia) from the early Palaeocene of Bolivia. Phylogenetic and palaeobiologic implications. Geodiversitas (1998) 20:19–142.

    Muragaki Y., Mundios S., Upton J., Olsen B. R. Altered growth and branching patterns in synpolydactyly caused by mutations in HOXD13. Science (1996) 272:548–551.[Abstract]

    Murphy W. J., Eizirik E., Johnson W. E., Zhang Y. P., Ryder O. A., O'Brien S. J. Molecular phylogenetics and the origins of placental mammals. Nature (2001) 409:614–618.[CrossRef][Medline]

    Naylor G. J. P., Brown W. M. Amphioxus mitochondrial DNA, Chordate phylogeny, and the limits of inference based on comparisons of sequences. Syst. Biol. (1998) 47:61–76.[Abstract/Free Full Text]

    Negrisolo E., Minelli A., Valle G. The mitochondrial genome of the house centipede Scutigera and the monophyly versus paraphyly of myriapods. Mol. Biol. Evol. (2004) 21:770–780.[Abstract/Free Full Text]

    Nilsson M., Arnason U., Spencer P. B. S., Janke A. Marsupial relationships and a timeline for marsupial radiation in South Gondwana. Gene (2004) 340:189–196.[CrossRef][Web of Science][Medline]

    Nilsson M., Gullberg A., Spotorno A. E., Arnason U., Janke A. Radiation of extant marsupials after the K/T boundary: Evidence from complete mitochondrial genomes. J. Mol. Evol. (2003) 57 (Suppl.):S3–S12.[CrossRef][Web of Science][Medline]

    Nowak R. M. Walker's Mammals of the World, 5th edition (1991) Baltimore: Johns Hopkins University Press.

    Paton T., Haddrath O., Baker A. J. Complete mitochondrial DNA genome sequences show that modern birds are not descended from transitional shorebirds. Proc. Roy. Soc. Lond. B (2002) 269:839–846.[Medline]

    Phillips M. J., Delsuc F., Penny D. Genome-scale phylogeny and the detection of systematic biases. Mol. Biol. Evol. (2004) 21:1455–1458.[Abstract/Free Full Text]

    Phillips M. J., Lin Y.-H., Harrison G. L., Penny D. Mitochondrial genomes of a bandicoot and a brushtail possum confirm the monophyly of Australiadelphian marsupials. Proc. Roy. Soc. Lond. B (2001) 268:1533–1538.[Medline]

    Phillips M. J., Penny D. The root of the mammalian tree inferred from whole mitochondrial genomes. Mol. Phyl. Evol. (2003) 28:171–185.[CrossRef][Web of Science][Medline]

    Posada D., Crandall K. A. Model Test: Testing the model of DNA substitution. Bioinformatics (1998) 14:817–818.[Abstract/Free Full Text]

    Pupko T., Huchon D., Cao Y., Okada N., Hasegawa M. Combining multiple data sets in a likelihood analysis: which models are the best? Mol. Biol. Evol. (2002) 19:2294–2307.[Abstract/Free Full Text]

    Rambaut A. Sequence Alignment Editor, version v1.0 alpha1 (1996).

    Rambaut A., Drummond A. J. (2003) Tracer v1.0. Available at http://evolve.zoo.ox.ac.uk.

    Retief J. D., Krajewski C., Westerman M., Winkfein R. J., Dixon G. H. Molecular phylogeny and evolution of marsupial protamine P1 genes. Proc. Roy. Soc. Lond. B (1995) 259:7–14.[Medline]

    Reyes A., Gissi C., Catzeflis F., Nevo E., Pesole G., Saccone C. Congruent mammalian trees from mitochondrial and nuclear genes using Bayesian methods. Mol. Biol. Evol. (2004) 21:397–403.[Abstract/Free Full Text]

    Reyes A., Gissi C., Pesole G., Saccone C. Asymmetrical directional mutation pressure in the mitochondrial genome of mammals. Mol. Biol. Evol. (1998) 15:957–966.[Abstract]

    Sarjeant W. A. S., Thulborn R. A. Probable marsupial footprints from the Cretaceous sediments of British Columbia. Can. J. Earth Sci. (1986) 23:1223–1227.[CrossRef]

    Schmitz J., Ohme M., Suryobroto B., Zischler H. The colugo (Cynocephalus variegatus, Dermoptera): the primates' gliding sister? Mol. Biol. Evol. (2002a) 19:2308–2312.[Abstract/Free Full Text]

    Schmitz J., Ohme M., Zischler H. The complete mitochondrial sequence of Tarsius bancanus: evidence for extensive nucleotide compositional plasticity of primate mitochondrial DNA. Mol. Biol. Evol. (2002b) 19:544–553.[Abstract/Free Full Text]

    Schwarz M. P., Tierney S. M., Cooper S. J. B., Bull N. J. Molecular phylogenetics of the allodapine bee genus Braunsapis: A-T bias and heterogeneous substitution parameters. Mol. Phyl. Evol. (2004) 32:110–122.[CrossRef][Web of Science][Medline]

    Shimodaira H. An approximately unbiased test of phylogenetic tree selection. Syst. Biol. (2002) 51:492–508.[Abstract/Free Full Text]

    Shimodaira H., Hasegawa M. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. (1999) 16:1114–1116.[Web of Science]

    Shimodaira H., Hasegawa M. CONSEL: For assessing the confidence of phylogenetic tree selection. Bioinformatics (2001) 17:1246–1247.[Abstract/Free Full Text]

    Simmons M. P., Pickett K. M., Miya M. How meaningful are Bayesian support values? Mol. Biol. Evol. (2004) 21:188–199.[Abstract/Free Full Text]

    Springer M. S., Amrine H. M., Burk A., Stanhope M. J. Additional support for Afrotheria and Paenungulata, the performance of mitochondrial versus nuclear genes, and the impact of data partitions with heterogeneous base composition. Syst. Biol. (1999) 48:65–75.[Abstract/Free Full Text]

    Springer M. S., Burk A., Kavanagh J. R., Waddell V. G., Stanhope M. J. The interphotoreceptor retinoid binding protein gene in therian mammals: implications for higher level relationships and evidence for loss of function in the marsupial mole. Proc. Natl. Acad. of Sci. USA (1997a) 94:13754–13759.[Abstract/Free Full Text]

    Springer M. S., DeBry R. W., Douady C. J., Amrine H. M., Madsen O., de Jong W. W., Stanhope M. J. Mitochondrial versus nuclear gene sequences in deep-level phylogeny reconstruction. Mol. Biol. Evol. (2001) 18:132–143.[Abstract/Free Full Text]

    Springer M. S., Kirsch J. A. W., Case J. A. The chronicle of marsupial evolution. In: Molecular evolution and adaptive radiation—Givnish T. J., Sytsma K. J., eds. (1997b) New York: Cambridge University Press. 129–161.

    Springer M. S., Westerman M., Kavanagh J. R., Burk A., Woodburne M., Kao D. J., Krajewski C. The origin of the Australasian marsupial fauna and the phylogenetic affinities of the enigmatic monito del monte and the marsupial mole. Proc. Roy. Soc. Lond. B (1998) 265:2381–2386.[Medline]

    Sullivan J., Swofford D. L. Should we use model-based methods for phylogenetic inference when we know that assumptions about among-site rate variation and nucleotide substitution pattern are violated. Syst. Biol. (2001) 50:723–729.[Free Full Text]

    Suzuki Y., Glazko G. V., Nei M. Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics. Proc. Natl Acad. Sci. USA (2002) 99:16138–16143.[Abstract/Free Full Text]

    Swofford D. L. PAUP*. Phylogenetic analysis using parsimony (*and other methods), version 4.0b10(PPC) (2002) Sunderland, Massachusetts: Sinaur Associates.

    Szalay F. S. A new appraisal of marsupial phylogeny and classification. In: Carnivorous Marsupials—Archer M., ed. (1982) Sydney: Royal Zoological Society of New South Wales. 621–640.

    Szalay F. S. Metatherian taxon phylogeny: Evidence and interpretation from the cranioskeletal system. In: Mammal phylogeny: Mesozoic differentiation, multituberculates, monotremes, early eutherians, and marsupials—Szalay F. S., Novacek M. J., McKenna M. C., eds. (1993) New York: Springer-Verlag. 216–241.

    Szalay F. S., Sargis E. J. Model-based analysis of postcranial osteology of marsupials from the Palaeocene of Itaboraí (Brazil) and the phylogenetics and biogeography of Metatheria. Geodiversitas (2001) 23:139–302.

    Thomas O. Catalogue of the Marsupialia and Monotremata in the Collection of the British Museum (Natural History) (1888) London: British Museum.

    Tyndale-Biscoe C. H., Renfree M. B. Reproductive physiology of marsupials (1987) Cambridge: Cambridge University Press.

    Westerman M. Phylogenetic relations of the marsupial mole, Notoryctes typhlops (Marsupialia: Notoryctidae). Aust. J. Zool (1991) 39:529–537.[CrossRef][Web of Science]

    Woodburne M. O., Case J. A. Dispersal, vicariance, and the Late Cretaceous to Early Tertiary land mammal biogeography from South America to Australia. J. Mammal. Evol. (1996) 3:121–161.[CrossRef]

    Yang Z. Maximum-likelihood models for combined analyses of multiple sequence data. J. Mol. Evol (1996) 42:587–596.[CrossRef][Web of Science][Medline]

    Yang Z. On the best evolutionary rate for phylogenetic analysis. Syst. Biol. (1998) 47:125–133.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
W. Miller, D. I. Drautz, J. E. Janecka, A. M. Lesk, A. Ratan, L. P. Tomsho, M. Packard, Y. Zhang, L. R. McClellan, J. Qi, et al.
The mitochondrial genome sequence of the Tasmanian tiger (Thylacinus cynocephalus)
Genome Res., February 1, 2009; 19(2): 213 - 220.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. C. Pratt, G. C. Gibb, M. Morgan-Richards, M. J. Phillips, M. D. Hendy, and D. Penny
Toward Resolving Deep Neoaves Phylogeny: Data, Signal Enhancement, and Priors
Mol. Biol. Evol., February 1, 2009; 26(2): 313 - 326.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (26)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Phillips, M. J.
Right arrow Articles by Penny, D.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Phillips, M. J.
Right arrow Articles by Penny, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?