Skip Navigation

Systematic Biology 2007 56(6):896-906; doi:10.1080/10635150701772563
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Shaffer, H. B.
Right arrow Articles by Thomson, R. C.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Shaffer, H. B.
Right arrow Articles by Thomson, R. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 Society of Systematic Biologists

Delimiting Species in Recent Radiations

Edited by John Weins: Guest Associate Editor

H. Bradley Shaffer and Robert C. Thomson

1 Section of Evolution and Ecology, and Center for Population Biology, University of California Davis, CA, 95616, USA E-mail: hbshaffer{at}ucdavis.edu (H.B.S.) rcthomson{at}ucdavis.edu (R.C.T.)


    Abstract
 Top
 Abstract
 Species Boundaries in Recent...
 Developing Markers in Nonmodel...
 Species Delimitation
 Conclusions
 Acknowledgments
 References
 
Despite considerable effort from the systematics community, delimiting species boundaries in recent radiations remains a daunting challenge. We argue that genealogical approaches, although sometimes useful, may not solve this important problem, because recently derived species often have not had sufficient time to achieve monophyly. Instead, we suggest that population genetic approaches that rely on large sets of informative markers like single nucleotide polymorphisms (SNPs) provide an alternative framework for delimiting very recently derived species. We address two major challenges in applying such markers to species delimitation: discovering markers in nonmodel systems and using them to delimit recently derived species. Using turtles as a test case, we explore the utility of a single, relatively low-coverage genomic resource as an aid in gene and marker discovery. We exploit an end-sequenced bacterial artificial chromosome (BAC) library from an individual painted turtle (Chrysemys picta) and outline a novel protocol that efficiently identifies primer pairs that amplify homologous sequences across the tree of living turtles. Preliminary data using this library to discover SNPs in Emydura macquarii, a species that diverged from C. picta ~ 210 million years ago, indicate that sequences identified from the Chrysemys BAC library provide useful SNPs even in this very distantly related taxon. Several recent methods in wide use in the population genetics literature allow one to discover potential species, or test existing species hypotheses, with SNP data and may be particularly informative for very recently derived species. As BAC and other genomic resources become increasingly available for scattered taxa across the tree of life, we are optimistic that these resources will provide abundant, inexpensive markers that will help delimit boundaries in problematic, recent species radiations.

Keywords: BAC end sequence; Chrysemys; comparative genomics; Emydura; recent speciation; species delimitation; turtle

Received December 15, 2006; Revised March 19, 2007; Accepted August 14, 2007


Species criteria, definitions, and delimitations occupy one of the most contentious and least resolvable debates in systematics. It seems fair to say that many systematists agree that species are real, important, and sometimes extremely difficult to identify (de Queiroz, 1998; Hey et al., 2003). There is also a growing sense that the virtually limitless empirical data available from emerging genomic databases may help solve the problem of delimiting difficult, recently derived species (Shaw and Danley, 2003; Liti et al., 2006). Presumably, it must be the case that more data are better than less, and that information from multiple, independent genetic loci should help in delimiting species, even if those species are very recently formed and therefore have few or no unique features. However, it is also true that more data do not always help to resolve difficult problems in historical biology. Identifying species trees among rapidly speciating taxa is a case in point; situations exist where the most probable gene tree does not match the actual species tree, and adding additional data does not help (Degnan and Rosenberg, 2006; Kubatko and Degnan, 2007).

One point that seems incontrovertible is that it is getting easier to acquire genomic-level data for an ever-widening set of taxa. The NCBI Entrez Genome Project Web page currently lists 236 eukaryote genomes and some 1100 prokaryotic genomes that are completed or in progress and many more partial genomic resources, including end-sequenced bacterial artificial chromosome (BAC) libraries and expressed sequence tag (EST) databases. These resources are produced for single species (often for single individuals), but they generally include hundreds of thousands to millions of base pairs of sequence data for that individual. For these taxa, and presumably many of their relatives, the availability of such genomic tools implies that the information contained in very large quantities of DNA sequence data can now be brought to bear on species delimitation problems. Although these tools may not solve every problem in species identification and delimitation, they should represent a major step forward over the previous generation of genetic analyses. In addition, these multigene data sets should allow systematists to test species lineages that have been proposed based on single gene analyses, including mitochondrial DNA (Avise and Walker, 1999).

In this paper, we focus on two important issues in the use of genomic data in recent species delimitation. First, can genomic-level tools, which will always be available for only a small fraction of the total diversity of life, be applied to related species that lack such tools? More specifically, how can we find hundreds of markers quickly and efficiently in nonmodel systems? Second, how can we use those markers to delimit species in a rigorous, satisfying way? For this second question, we consider several population genetically oriented approaches that may be used in conjunction with more phylogenetically oriented strategies to identify and delimit recently derived species.

In pursuing these questions, we do not discuss the vast literature on species concepts except to say that we fundamentally agree with Mayden (1997) and de Queiroz (1998, 2005) that species are segments of evolutionary lineages. Under this view, which goes back at least to Simpson (1951), species delimitation boils down to the identification of "metapopulation lineages" (de Queiroz, 2005), with the most difficult problems often represented by the youngest species pairs (Fig. 1). As species persist over time, the attributes that they take on, and therefore the ease with which lineages can be discovered and delimited, changes (de Queiroz, 1998). In practice, older species have often had time to accumulate a range of features, including reproductive isolation, fixed apomorphies, and gene-tree monophyly, that allow for straightforward lineage (= species) delimitation. However, very young taxa that are early in the process of speciation often lack these features, making them particularly challenging for systematists (Fig. 1). We therefore focus our attention on the value of genomic data for delimiting young species early in the process of diversification.


Figure 1
View larger version (35K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 A schematic diagram of the divergence of species through time. We focus our attention on the early part of the process, since that is when species delimitation is most difficult. Figure modified from de Queiroz (1998).

 
Sites and Marshall (2003, 2004) provide recent summaries of advances in the actual criteria for delimiting species boundaries. These authors place species delimitation methods into two broad categories: tree-based and non–tree-based. Tree-based methods tend to require demonstrable monophyly, whereas non–tree-based methods more often focus on the discovery of differentiation associated with the earliest stages of lineage diversification. Both are clearly valuable in species delimitation research, particularly when sufficient time has elapsed for monophyly to evolve (Hudson and Coyne, 2002). Maddison and Knowles recently examined the related problem of phylogeny reconstruction among known species that have not had time to sort to monophyly, and point out that some limited phylogenetic information still resides in these incompletely sorted gene trees (Maddison and Knowles, 2006). Presumably, the same is true for species delimitation, and methods developed to extract this "fuzzy" information using tree-based approaches are being developed. Here, we concentrate our discussion on approaches derived from population genetics rather than those that require monophyly for species identification. In doing so, we fully recognize that tree-based methods (Baum and Shaw, 1995; Wiens and Penkrot, 2002) may also identify recently derived species; our hope is that both approaches can be brought to bear on difficult but often critically important recent species radiations.


    Species Boundaries in Recent Radiations
 Top
 Abstract
 Species Boundaries in Recent...
 Developing Markers in Nonmodel...
 Species Delimitation
 Conclusions
 Acknowledgments
 References
 
One of the more disturbing realizations to emerge from the speciation literature is that many of the most widely cited and widely studied examples of recent species radiations lack clear evidence on actual species boundaries. Consider the following textbook examples of recent species radiations:

Darwin's Finches
Petren et al. (2005) present the most complete summary to date of species boundaries in several groups of Darwin's finches, including data on 1428 birds from 74 populations for 16 microsatellite loci for the cactus finches (Geospiza scandens and conirostris), the sharp-beaked ground finch (G. difficilis), and the warbler finches (Certhidea olivacea and fusca). Although these and other species in the group have been extensively analyzed with mitochondrial and nuclear markers (Freeland and Boag, 1999), species boundaries remain fuzzy and ill-defined for many taxa, making these animals "the poster child for foggy species boundaries" (Petren, personal communication, 2006). The most probable explanations for this lack of resolution include hybridization (Grant et al., 2005) and recency of speciation coupled with rapid morphological divergence (Petren et al., 2005). In resolving species boundaries, these authors conclude that additional individual sampling will not help, but additional nuclear genetic data may help clarify these difficult species boundaries.

Cichlid Fishes
Cichlid fishes constitute one of the most famous and widely publicized cases of adaptive speciation and extreme rates of lineage diversification on earth. Yet, cichlid species boundaries have proven remarkably difficult to verify with molecular tools. Recent analyses of mitochondrial and nuclear DNA sequences (Seehausen, 2004) and SINES (Takahashi et al., 2001) as well as older mtDNA analyses (Moran and Kornfield, 1993) indicate that morphologically and ecologically defensible species are often not monophyletic and are frequently characterized by extremely recent divergence. Although hybridization may be further clouding the historical reconstruction of species boundaries (Seehausen, 2004), it remains the case that molecular analyses have yet to clarify species boundaries in these rapidly diversifying fishes.

Galápagos Tortoises
As an icon for speciation, adaptation, and conservation, the Galápagos tortoises are among the most intensively studied species on earth. With 1611 sequences currently lodged in GenBank (release 161.0, release date 15 August, 2007) for the species Geochelone nigra and its 14 subspecies, one might think that the debate over species delimitation would finally be resolved for this endangered tortoise complex. Although currently considered a single species (Pritchard, 1996), recent work utilizing mitochondrial DNA and microsatellites for hundreds of tortoises from across the archipelago (Caccone et al., 2002) and from more restricted sets of localities (Beheregaray et al., 2003; Ciofi et al., 2006) indicated that additional evolutionary lineages may exist. Even after this enormous effort by a talented team of investigators, species boundaries among Galápagos tortoise populations remain controversial and unsettled (Russello et al., 2005).

Tiger Salamanders
Taking an example from our own work, the tiger salamander complex stands out as a recent radiation (estimates range from 3.5 to 5 million years old) with important conservation (Shaffer et al., 2004) and phylogeographic (Shaffer and McKnight, 1996) implications. The ~ 15-species complex is of interest as a focus of conservation (IUCN, 2006), speciation biology, and because it contains the axolotl (Ambystoma mexicanum, see http://salamander.uky.edu/), an important model system in vertebrate developmental biology (Shaffer, 1993). A concerted effort with allozymes (Shaffer, 1984a), morphometrics (Shaffer, 1984b; Irschick and Shaffer, 1997), and mitochondrial DNA sequences (Shaffer and McKnight, 1996) has revealed a complex pattern of extremely rapid speciation events characterized by short internodes and poorly resolved species boundaries. Our most recent analysis of eight nuclear loci derived from an extensive expressed sequence tag (EST) library (Weisrock et al., 2006) demonstrated that whereas some taxa exhibit monophyly for a majority of genes, others fail to do so, leaving species boundaries still very much up in the air.

These and many other examples demonstrate a simple point. Frequently, the molecular data and analytical approaches that have been used to delimit species boundaries in recently radiating lineages have highlighted the difficulty in recovering what appear to be good species, rather than providing unambiguous guidelines for species delimitation. Although this empirical result may stem from several sources, ranging from incorrectly identified species to hybridization to an inappropriate choice of molecular markers, we feel that two issues may plague many studies in the recent literature. First, the reliance on phylogenetic tree–oriented approaches and species concepts (Cracraft, 1989; Shaw, 2001) often requires gene-tree monophyly and complete lineage sorting to recognize species (Baum and Donoghue, 1995). However, recent theoretical work has demonstrated that reciprocal monophyly for a pair of species under the neutral coalescent takes from 4 to 7 Negenerations for 50% of nuclear genes sampled (where Neis the historically effective population size of each descendant taxon) to 9 to 12Negenerations for 95% of sampled genes (Hudson and Coyne, 2002). Although historical and current population sizes are seldom known, it seems likely that reciprocal monophyly for most genes will require tens of thousands of generations (and millions may be more common), implying a long waiting time for monophyly to evolve even in completely isolated lineages. A recent review suggested that this long waiting time may be causing a lack of species monophyly even for the rapidly evolving metazoan mitochondrial DNA (Funk and Omland, 2003), and it will be a much more severe problem for nuclear genes because of their lower levels of variation coupled with their larger effective population size (Hudson and Coyne, 2002). Second, most studies have relied on one or a few genes or, in studies that use mitochondrial DNA only, a single linkage block. Given the extremely idiosyncratic nature of gene tree histories (Rosenberg and Nordborg, 2002; Maddison and Knowles, 2006), particularly when populations are structured (Wakeley, 2000; Irwin, 2002), accurate pictures of recent speciation histories will only be possible when integrating over the historical information available from multiple, independent gene histories (Jennings and Edwards, 2005).

Recent advances in population-genetically oriented methodologies combined with the increasingly wide availability of genomic resources including expressed sequence tag (EST), end-sequenced bacterial artificial chromosome (BAC) libraries, and low-to-moderate coverage genome sequences now afford the opportunity to collect large amounts of nucleotide variation data that can be effectively aimed at species delimitation problems across a broad spectrum of species. Obviously, some groups have better developed genomic resources than others, and lineages with few resources may require considerable marker development. However, our own empirical work in developing PCR primers, DNA sequences, and single nucleotide polymorphisms (SNPs) suggests that relatively modest genomic resources may provide a rich source of data, even for taxa that are distantly related to a model species.

Why SNPs?
Single nucleotide polymorphisms are a promising marker-type for species delimitation. SNPs have low rates of substitution (10– 8 to 10– 9) and therefore extremely low levels of homoplasy relative to microsatellites (which have mutation rates on the order of 10– 4) at similar levels of divergence (Brumfield et al., 2003). In addition, the mutation rate of microsatellite loci varies across alleles within loci and across loci to a much higher degree than is likely to be observed in SNP loci (Macaubas et al., 1997), which can make it difficult to fit models of molecular evolution to microsatellite data. One limitation of SNPs compared to microsatellites is that they are generally biallelic and therefore contain less information per locus than most microsatellites (Morin et al., 2004). However, this limitation is somewhat offset by the relative ease of scoring many loci for SNPs relative to microsatellites.

SNPs are similar to amplified fragment length polymorphisms (AFLPs) and related marker-types in that they are usually biallelic. However, SNPs are codominant markers whereas AFLPs are dominant (homozygous and heterozygous individuals cannot be distinguished), rendering SNPs more informative per locus, on average, than dominant markers. Most biallelic marker types, aside from SNPs, are anonymous, which makes homoplasy difficult to assess, as loci and alleles are often difficult to distinguish. Some estimates of levels of homoplasy have been made in AFLP data sets. These estimates range from 5% in potatoes (Solanum) (vander Voort et al., 1997) to 100% among species from different subtribes of Carduinae thistles (O'Hanlon and Peakall, 2000), with a range of estimates falling between these extremes (O'Hanlon and Peakall, 2000; Mechanda et al., 2004). Finally, reproducibility has been a concern with certain anonymous marker types, particularly randomly amplified polymorphic DNA (RAPD) and, to a lesser extent, AFLPs (Jones et al., 1997). Our feeling is that the relative ease and certainty with which SNPs can be developed, genotyped, and analyzed, coupled with their low cost and low levels of homoplasy, make them an ideal marker choice for species boundary research.


    Developing Markers in Nonmodel Systems
 Top
 Abstract
 Species Boundaries in Recent...
 Developing Markers in Nonmodel...
 Species Delimitation
 Conclusions
 Acknowledgments
 References
 
The traditional strategy for DNA marker discovery in nonmodel systems has been to use the power of comparative analysis to predict primer sequences in taxa for which there are no available sequence data. This "universal primer" approach (Kocher et al., 1989; Lyons et al., 1997) has been widely applied across the tree of life. In theory, the approach is straightforward—if two or more taxa share a homologous DNA sequence for a 20– to 30–base pair fragment, then the most parsimonious prediction is that their most recent common ancestor, and all descendants of that MRCA, will also share that sequence. When a pair of such conserved sequences is identified, they then comprise a candidate pair of primers that should amplify a homologous DNA fragment for all descendants of that MRCA, and potentially for additional related taxa. This universal primer strategy has been a powerful one for primer discovery for organellar genes, particularly for mitochondrial DNA in metazoans (Simon et al., 1994), chloroplast genes (Taberlet et al., 1991; Demesure et al., 1995), and, to a lesser extent, mitochondrial genes in plants (Demesure et al., 1995; Dumolin-Lapegue et al., 1997).

The universal primer strategy has also been used for nuclear genes, particularly in efforts to locate putative exon-primed, intron-crossing (EPIC) primers. Introduced more than a decade ago (Palumbi and Baker, 1994), and developed for a variety of organisms (Hassan et al., 2002; Creer et al., 2005), the universal EPIC approach has been heralded by some as a strategy that may yield usable sequence across both narrow and broad phylogenetic distances (Aitken et al., 2004), although others have felt that such nuclear primers are likely to be taxonomically restricted and "may not be applicable even in closely related species" (Zhang and Hewitt, 2003).

Here, we explore a somewhat different approach to primer design for gene and marker discovery. One of the limitations of the universal primer approach is that it requires having at least two taxa that span the target species and that have both been sequenced for a homologous region of the genome. In many groups, such comparative data simply do not exist across enough of the genome to allow for the development of many independent markers. As an alternative approach, we explore a strategy of developing primers directly from a single species, rather than identifying conserved primer sequences across divergent taxa. We then ask how those primers perform in increasingly divergent taxa sampled from a known tree. This very direct approach has not been systematically explored, at least to our knowledge, for any clade. We explore this strategy for two reasons. First, it is a simple, straightforward approach to primer discovery that can be accomplished with a single genomic resource for a single species. Second, the possibility exists that primers and the sequences that they span that were discovered based on a universal primer approach may be biased toward relatively conserved sequences simply because locus selection proceeds by picking parts of the genome (primers) that contain conserved sequences. If primers that have long been conserved tend to span genomic regions that are also relatively conservative, this could mean that universally primed sequences are on average somewhat less variable than might be otherwise expected. Because highly variable sites and sequences should be the most useful for delimiting very recent speciation events, we reasoned that primers derived from a single species might avoid such a low-variation bias if it exists.

An Example with Turtles
Turtles are a reasonable case study for marker development with limited genomic information. As a clade, the crown group of living turtles is low-diversity (about 320 living species), deeply divergent (living members span a phylogenetic split of approximately 210 million years (Near et al., 2005)), and has limited genomic resources available. A recent literature review (Engstrom et al., in press) identified 202 mitochondrial primers that span the turtle mitochondrial genome and an additional 181 microsatellite primers but primers for only 11 nuclear loci. In addition, for problems at the within/between species boundary, many of these 11 nuclear loci have been found to be largely uninformative (Caccone et al., 2004; Spinks and Shaffer, 2005).

Currently, a single genome-level resource exists for turtles. Initially constructed as part of the "100 BAC" NSF initiative (http://www.nsf.gov/bio/pubs/awards/bachome.htm), this end-sequenced BAC library consists of 3461 end sequences, each approximately 500 to 800 base pairs in length, for a single, field-collected, western painted turtle (Chysemys picta) from the State of Washington. In total, the end-sequenced resource yielded 2,432,811 base pairs of usable sequence information for this individual turtle. Such a resource base may be paltry by the standards of many model organisms, but it represents a tremendous amount of genomic information for phylogenetic analysis (2.5 megabases of data for one individual). Because the major features of turtle phylogeny are becoming reasonably well known (Krenz et al., 2005; Near et al., 2005; Parham et al., 2006), we were able to choose representative taxa that span recent-to-ancient divergence times with respect to C. picta and ask how well primers derived from Chrysemys sequence alone work as one moves increasingly large phylogenetic distances from this single species.

We break the problem of going from this BAC resource to a panel of usable markers into the following steps: (1) primer discovery, (2) SNP discovery, and (3) genotyping. Finally, we demonstrate the utility of our strategy by developing a small data set for SNP discovery with an application in a distantly related turtle species.

Primer discovery
Using the Chrysemys BAC end-sequences, we sought to develop a set of 96 potential primer pairs that would provide orthologous DNA sequence information for species delimitation studies across turtles. Each of the 3416 end-sequences was screened for repeat sequences (e.g., known short interspersedelements [SINEs], long interspersed elements [LINEs], and simple repeats) with RepeatMasker (Smit et al., 2004), using Gallus gallus as the query species (RepBase update 10.04) and sensitivity/processing time set to "slow" in order to maximize the number of repeats identified. Regions of sequence that were identified as repetitive were masked with Ns and were subsequently removed using a Perl script (available from R.C.T). The aim of this screening step was to isolate sequences that were single copy and thus would allow us to prime orthologous regions of the genome. The remaining end-sequences were selected at random and compared to existing GenBank sequences using BLASTX (http://www.ncbi.nlm.nih.gov/BLAST/). From this group, we chose 48 sequences that showed high similarity (defined as e-value < 10– 5) over at least ~ 350 bp with existing GenBank sequences, 24 that showed low similarity (e-value > 10– 5), and 24 that showed no similarity (no BLASTX matches), resulting in 96 sequences for primer design. We designed a single primer pair for each of these end-sequences using the program Primer3, setting the optimal primer size to 20 bp and optimal annealing temp to 60°C (Rosen and Skaletsky, 2000). In addition, the "product size ranges" setting was set to favor the largest possible primed region, with the constraint that the entire primed region (including both primer sequences) had to fall within the region of similarity revealed in the BLAST search. All primer sets were designed under the same conditions so that subsequent PCRs could be run as high-throughput batches. Thus, although we used comparative BLASTX results to organize end-sequence products into high, low, and no similarity categories, all actual primers were perfect matches to the C. picta BAC turtle. We screened all 96 primer pairs in 25-µ L PCR reactions using standard PCR conditions, AmpliTaq DNA polymerase, and an annealing temperature of 60°C across a panel of five species that provide phylogenetic coverage across the turtle tree of life (Fig. 2). These five taxa included the original painted turtle (C. picta) from which the BAC library was constructed (serving as a positive control, and hereafter referred to as the BAC turtle), a southern painted turtle (variously considered a subspecies of Chrysemys picta, C. p. dorsalis, or a full species, C. dorsalis), a western pond turtle (Emys [previously Clemmys, now referred to Actinemys by some authors] marmorata), a snapping turtle (Chelydra serpentina), and an Australian side-necked turtle (Emydura macquarii) (Turtle Taxonomy Working Group, in press). Based on a reasonably well-understood phylogeny of the major clades of turtles (Krenz et al., 2005; Parham et al., 2006), these taxa span phylogenetic nodes at roughly 2-, 50-, 100-, and 200-million-year intervals with respect to the BAC turtle (Fig. 2). Chrysemys picta and C. dorsalis have been variously considered conspecific subspecies or distinct species and are estimated to have diverged approximately 2 to 3.5 million years ago (Starkey et al., 2003); other divergence dates are derived from Near et al. (2005).


Figure 2
View larger version (62K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2 A phylogeny of 14 families (23 species) representing the major clades of living turtles. The phylogenetic placement of the BAC turtle, Chrysemys picta (Emydidae), and the families to which it is compared in Figure 3 are shown. Figure modified from Near et al. (2005).

 
Each PCR reaction was run once, visualized on 1% agarose gels, and scored as amplifying a single product of the predicted size, multiple products one of which was the predicted size, or no products (Fig. 3). Although limited to a single specimen per species and a single PCR with no optimization, the results suggest that primers chosen from a single BAC individual routinely amplify the predicted product across turtles (Table 1). For 48 "high-similarity" sequences, there was a gradual fall-off from roughly 80% of the primers producing a single amplicon within Chrysemys to 50% in Emydura. Most of the remaining high similarity primers produced multiple bands, rather than no bands, suggesting that many more sequences could be recovered with additional optimization. For the 24 "low-similarity" sequences, the falloff from Chrysemys dorsalis (96%) to E. macquarii (21%) for primers that yielded single bands was considerably steeper, with what appears to be a qualitative shift in efficiency between the Emys marmorata (a confamilial emydid with Chrysemys) and Chelydra serpentina (family Chelydridae). The "no-similarity" primers yielded a slightly smaller set of successful amplifications from Chrysemys (79%) to Emydura (17%). In addition, the fraction of PCR experiments that yielded no amplification increased dramatically for the no-similarity primers as a function of phylogenetic divergence (Fig. 3, Table 1).


Figure 3
View larger version (119K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 3 Results of 480 PCR reactions for high, low, and no similarity sequences to a phylogenetic range of turtle taxa. Primers were all derived from the BAC turtle, and all amplifications were tried only once at 60 °C. Amplifications that yielded a single band of the predicted size are shown in white, amplifications that yielded multiple bands are shown in black, and failed amplifications yielding no visible bands are shown in gray.

 


View this table:
[in this window]
[in a new window]

 
Table 1. The percentage of primers designed from a single-specimen BAC library that amplify across the turtle tree of life. Entries for time are the estimated age of the most recent common ancestor of the species and the BAC turtle (Near et al., 2005). Table entries are the percentage of primers, developed from the BAC turtle, that yielded single, multiple, or no amplicons across species. For definitions of high, low, and no similarity, see text.

 
In summary, these experiments suggest that primers developed from a single-species genomic resource like a BAC, EST, or whole genomic library can be a rich source of comparative data across large phylogenetic distances. Perhaps not surprisingly, this appears to be a particularly strong result for sequences with high sequence similarity to outgroup taxa. However, even sequences with no BLAST hits to Genbank, which presumably represent either noncoding or unique turtle sequences, produced a single amplicon in 50% or more of primer pairs out to the snapping turtle, suggesting that this is also a reasonably efficient strategy for marker discovery in less conserved genomic regions.

SNP discovery
To convert primers that amplify at the PCR level into SNP markers that may be useful for species delimitation, we employed a discovery panel approach in which SNP loci are found by resequencing a subset of the total number of individuals in the study, followed by sequence alignment and "tagging" (identification) of polymorphic sites. The makeup of a discovery panel will vary from study to study and depends on the specific questions being asked. If a study seeks to test the concordance of species taxa with real lineages (Hey and Machado, 2003), then the discovery panel should include individuals broadly representative of the current species taxonomy (at least one member of every named species, subspecies, or other hypothesis being tested). Alternatively, for studies that seek to delimit species without respect to current taxonomy (or when no species level taxonomy exists), we favor discovery panels composed of natural sampling units (e.g., individuals from all allopatric parts of the range) that are also geographically representative of the overall range of the populations of interest. Although increasing the number of individuals included in the discovery panel will increase the number of polymorphic sites that are identified, the cost savings provided by the SNP approach (relative to sequencing all individuals) will be reduced. Also, because multiple SNPs derived from the same sequence are expected to be tightly linked, generally one need only include enough individuals on the discovery panel to discover one SNP per sequence locus when methods call for unlinked marker data. The number of individuals required to accomplish this will vary with the level of variation present in the sequence data. When multiple SNPs are identified in a single sequence, we favor choosing one at random to reduce the bias that can result if, for example, maximally differentiated SNPs are chosen instead.

Several software packages exist that can assist in the tagging of polymorphic sites in sequence data. Given the challenge of distinguishing true heterozygotes from ambiguous/poor quality reads, this step is actually more challenging than one might expect. PolyPhred is one package that integrates with the Phred/Phrap/Consed genome assembly package and that can recognize the heterozygous positions that are routinely encountered when sequencing diploid loci (Gordon et al., 1998; Stephens et al., 2006). This software can also rank heterozygous sites according to their "quality," as measured by the fit of putatively heterozygous sites to the ideal pattern for a SNP in that sequence data. By using these and similar tools, researchers can quickly locate the highest quality (and therefore, most likely) polymorphic positions in their sequence data, as well as judge whether they have included enough individuals in their discovery panel to confidently call heterozygous positions.

Genotyping
Genotyping strategies tend to center on two issues: accuracy and cost. At one extreme, one can simply sequence each individual and treat haplotypes, or contained polymorphisms, as characters for species delimitation. Sequences tend to be accurate but expensive (currently averaging several dollars per sequence at most facilities). At the other extreme are tiling microarrays (Mockler and Ecker, 2005), where tens of thousands of SNPs can be scored from an individual for pennies per genotype, assuming that the genomic resources are available for the target species (unlikely for most species delimitation problems). In between are strategies ranging from low-throughput, relatively high-cost approaches like single-base extension of individual SNPs from PCR products (Hsu et al., 2001) to the Illumina Beadstation that can provide > 1000 simultaneous SNP genotypes/individual for a fraction of the cost. Our goal is not to review these platforms but to simply point out that a range of options, costs, and levels of automation exist that should allow most research groups to accurately genotype large numbers of individuals for multiple SNPs at a reasonable cost. Even the conversion of SNPs into restriction fragment length polymorphisms has its place as a low-tech genotyping strategy (Fitzpatrick and Shaffer, 2004), although it is relatively inefficient compared to most other approaches.

SNP discovery in a distantly related clade
As a test of our BAC-derived primer design strategy, we amplified, sequenced, and identified SNPs for four loci, across two populations, for the distantly related pleurodire Emydura macquarii (family Chelidae). This species complex occurs throughout eastern, central, and northern Australia and southern New Guinea and presents a challenging case for species delimitation. Two recent analyses of the Australian populations have recognized a single species (Georges and Adams, 1996) or a set of at least seven species and numerous additional subspecies (Cann, 1998). Because the primers in Figure 3 were designed from the BAC Chrysemys picta sequence data, one might expect them to effectively identify polymorphic markers for this and closely related species. Our interest here was to ask whether PCR primers derived from a single species yield informative markers for species delimitation even in distantly related taxa.

Our sequencing panel included five individuals from each of two adjacent river drainages in central Australia, the Cooper and Murray-Darling. Emydura from these two drainages are currently referred to the subspecies E. macquarii emmotti and E. m. krefftii, respectively, although considerable uncertainty over species delimitation exists (Turtle Taxonomy Working Group, in press). We attempted to sequence these 10 individuals for four loci that had yielded single bands at the PCR level. Of these four loci, three yielded useable sequence, each of which contained a SNP every ~ 250 bp (Table 2). Although clearly limited and very preliminary, these results demonstrate that our strategy appears to effectively identify loci, even in distantly related taxa, that can provide useful variation for defining species boundaries. For example, locus TB73 has a potentially fixed difference between Emydura from these two river systems at positions 112 and 481, and potential SNP frequency differences and/or river-specific alleles exist at most other variable positions.


View this table:
[in this window]
[in a new window]

 
Table 2 SNP loci identified by resequencing five Emydura macquarii from each of two river systems. Locus refers to the primer pairs used for sequencing and position refers to the base pair position of each SNP locus within that sequencing read. Alternative bases (e.g., A/G) refer to heterozygous individuals and question marks denote missing data for one individual due to low sequence quality.

 

    Species Delimitation
 Top
 Abstract
 Species Boundaries in Recent...
 Developing Markers in Nonmodel...
 Species Delimitation
 Conclusions
 Acknowledgments
 References
 
Even if researchers agree on a species concept, there invariably are multiple strategies for delimiting species using the same data sets (Wiens and Penkrot, 2002; Sites and Marshall, 2003, 2004), and these strategies may lead different researchers to different conclusions (Marshall et al., 2006). Our own position is that when species are old and well-differentiated (the "easy" part of Fig. 1), most methods and most data will lead most researchers to the same species delimitation conclusions. When species are young, delimitation is more difficult, both conceptually and empirically. For this part of the empirical spectrum, Hey et al. (2003) summarized the situation nicely. "In short, species entities are very difficult to study, for they are evolutionarily and demographically dynamic. They will often not be very distinct, and the degree to which they are distinct can change over time..." (Hey et al., 2003: 599). How, then, can one use SNPs or other genotypic data to delimit recently derived, closely related species?

We follow de Queiroz (1998, 2005) in defining species as separately evolving metapopulation lineages. That is, a species is a demographically and genetically interconnected set of populations (metapopulation) that has continuity through time (Simpson, 1951; Mayden, 1997). Although this definition does not provide an unambiguous cutoff for when speciation has occurred, it emphasizes that the primary goal of species delimitation research is lineage discovery and delimitation. Such lineage discovery is neither straightforward nor precise in the earliest stages of speciation, and we must expect some fuzziness and uncertainty in delimiting young species in the early stages of differentiation (de Queiroz, 1998; Hey et al., 2003). Under the neutral coalescent the expected time to monophyly for even a single nuclear gene is often long (Hudson and Coyne, 2002), suggesting that many real species exist that have not yet achieved monophyly for even a few genes.

The metapopulation lineage species definition naturally leads to operational species delimitation approaches that recognize, across multiple loci, sets of populations that freely exchange genes in nature, and have severely restricted or no gene exchange with other sets of populations. Whether these conditions exist due to intrinsic features of organisms (behavioral or physiological reproductive isolating mechanisms) or extrinsic features of their distribution across the landscape (an island population separated from its mainland counterparts) is immaterial, since both produce distinct lineages. Rather, the critical point is that evidential support exists for species, and (in our opinion) that there is enough redundancy in that evidence to ensure that metapopulation lineages, rather than allele frequencies shaped by natural selection gradients or idiosyncratic histories of individual loci, are the best explanation of the data in hand. If two or more loci indicate that lineages are distinct from each other, those metapopulation lineages become candidates for species recognition. A single locus, even if fixed for different states, provides a testable hypothesis regarding species distinction but not, in our minds, unambiguous evidence for species recognition. This reliance on multiple markers implies that species in their very earliest stages will probably not be recognized, but it is the price one pays for the taxonomic stability that follows from only recognizing comparatively well-differentiated species.

Two fundamentally different strategies, aimed at very different research agendas, exist within what we refer to as species delimitation research. The first is the relatively exploratory phase where one asks whether a set of samples comprise one or many species, with no a priori hypothesis of what those species might be. Examining all possible species among a set of individuals is complex combinatorial problem, and we think that a satisfying and successful approach is implemented in STRUCTURE (Pritchard et al., 2000; Falush et al., 2003). STRUCTURE seeks to cluster individuals without regard to their population-of-origin based on rough conformity to Hardy-Weinberg genetic expectations. The strategy employed by STRUCTURE is straightforward and matches reasonably well the properties of metapopulation lineages. The primary assumptions of the model are Hardy-Weinberg equilibrium within populations (or metapopulations) and linkage equilibrium between loci, although the latter assumption has been relaxed (Falush et al., 2003). The program then seeks sets of individuals that are in approximate multilocus Hardy-Weinberg equilibrium, searching parameter space sequentially across an arbitrary number of K taxa. A Markov chain Monte Carlo (MCMC) search strategy simultaneously estimates allele frequencies and population of origin for each potential K (Pritchard et al., 2000; Falush et al., 2003), and the method can identify admixed (that is, mixed-origin) individuals between otherwise differentiated populations (Falush et al., 2003). Although originally developed for population identification, STRUCTURE has performed well in the identification of domestic strains of chickens, suggesting that it may serve as a useful tool for the initial stages of lineage diversification (Rosenberg et al., 2001). Our sense is that STRUCTURE may be most useful in determining the lower bound of potential species; if units are not identified by this approach, they lack the evidential support to be even candidates for species recognition. Additional programs that are relevant to identifying the early phases of lineage diversification continue to be developed, and should provide additional insights into the initial diversification of species (Corander et al., 2004; Guillot et al., 2005).

Alternatively, a priori named species (species taxa) may be tested with genotypic data (Wiens and Penkrot, 2002; Hey et al., 2003; Sites and Marshall, 2004). Here, the delimitation problem is much more prescribed, since one is asking whether two or more sets of individuals should be considered separate species. When species are very young, the hybrid zone barrier (HZB) approach (Porter, 1990) is appealing in its simplicity—essentially it calculates Wright's Fstvalues between sets of species taxa, converts them to Nm values, and argues from population genetic theory that when 0.0 < Nm < 0.5, two populations are strong potential candidates for species recognition. Like STRUCTURE, the HZB approach may recognize taxa that are insufficiently differentiated for many practitioners to feel that they warrant species recognition, and the suggested cutoff for Nm is somewhat arbitrary. The extent to which STRUCTURE and HZB tend to identify the same sets of potential species has not been tested, although such a test would be very interesting.

Among the most recently proposed methods, we feel that the emerging parametric approaches that simultaneously estimate ancestral population size, migration rate and divergence times are particularly appealing for testing species taxa. The isolation-with-migration (Nielsen and Wakeley, 2001; Hey and Nielsen, 2004) model considers an important set of conditions for the earliest phases of the species delimitation problem—populations that have diverged but may still incur limited, ongoing gene flow (or gene flow in the recent past). Based on MCMC sampling of parameter space, IM works with multilocus data (sequences and microsatellites) from a pair of species taxa and provides important insights into the earliest stages of differentiation and speciation. This approach attempts to fit a six-parameter model of the population divergence process to the data and can provide information about time since divergence, ongoing migration, and population size. Limitations of this parameter-rich model include the requirement of a large amount of data in order to obtain robust parameter estimates. Additionally, run times can be extremely long (on the order of months) and the current software implementation of the IM model does not include a model of molecular evolution for SNPs. Although it provides no prescribed cutoff value for species recognition, Hey and Nielsen (2004) provide an interesting example of how IM can be used to gain deeper insights into the history of speciation in Drosophila pseudoobscura and D. persimilis. We view this approach as an important melding of population genetics and speciation biology that is central to the metapopulation lineage concept.


    Conclusions
 Top
 Abstract
 Species Boundaries in Recent...
 Developing Markers in Nonmodel...
 Species Delimitation
 Conclusions
 Acknowledgments
 References
 
Although the empirical study of species boundaries is enjoying renewed interest in the systematics community (Wiens and Penkrot, 2002; Sites and Marshall, 2004), significant challenges remain in the empirical delimitation of species boundaries in recent radiations of closely related species (Hey et al., 2003). This stems from several sources, including the historical reliance on a few genomic regions, the limited number of available markers for most taxa, and a paucity of appropriate methods for the analysis of these data. The approach that we have outlined appears to be an effective strategy for quickly identifying many markers, at least for clades where some genomic resources are available. Both tree-based and non–tree-based methods of analysis can and should contribute to further empirical progress in delimitation of recent species, and we discuss several population genetic techniques that can help identify species lineages that have not had time to sort into monophyletic groups. As genomic resources continue to become available for diverse taxa drawn from the tree of life, we look forward to finally bringing the appropriate data to bear on "the species problem" in recent species radiations that have long fascinated, and frustrated, evolutionary biologists.


Figure 4
View larger version (45K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
 

    Acknowledgments
 Top
 Abstract
 Species Boundaries in Recent...
 Developing Markers in Nonmodel...
 Species Delimitation
 Conclusions
 Acknowledgments
 References
 
We thank Scott Edwards and Andy Shedlock for access to the Chrysemys BAC end-sequence data; Bob Macey and the JGI for DNA from the BAC turtle; Phil Spinks for help with sequencing and troubleshooting; Arthur Georges, Louise Whitaker, and Rob Shaffer for field assistance in Australia; and John Wiens for organizing the symposium and critically reading the paper. This material is based upon work supported in part by the NSF, the University of California Agricultural Experiment Station, and the UC Davis Center for Population Biology.


    References
 Top
 Abstract
 Species Boundaries in Recent...
 Developing Markers in Nonmodel...
 Species Delimitation
 Conclusions
 Acknowledgments
 References
 

    Aitken N., Smith S., Schwarz C., Morin P. A. Single nucleotide polymorphism (SNP) discovery in mammals: A targeted-gene approach. Mol. Ecol. (2004) 13:1423–1431.[CrossRef][Medline]

    Avise J. C., Walker D. E. Species realities and numbers in sexual vertebrates: Perspectives from an asexually transmitted genome. Proc. Natl. Acad. Sci. USA (1999) 96:992–995.[Abstract/Free Full Text]

    Baum D. A., Donoghue M. J. Choosing among alternative "phylogenetic" species concepts. Syst. Bot. (1995) 20:560–573.[CrossRef]

    Baum D. A., Shaw K. L. Genealogical perspectives on the species problem. In: Monographs in systematic botany; Experimental and molecular approaches to plant biosystematics, 53—Hoch P. C., Stephenson A. G., eds. (1995) St. Louis, Missouri: Missouri Botanical Garden. 289–303.

    Beheregaray L. B., Ciofi C., Caccone A., Gibbs J. P., Powell J. R. Genetic divergence, phylogeography and conservation units of giant tortoises from Santa Cruz and Pinzon, Galapagos Islands. Conserv. Genet. (2003) 4:31–46.[CrossRef]

    Brumfield R. T., Beerli P., Nickerson D. A., Edwards S. V. The utility of single nucleotide polymorphisms in inferences of population history. Trends Ecol. Evol. (2003) 18:249–256.[CrossRef]

    Caccone A., Gentile G., Burns C. E., Sezzi E., Bergman W., Ruelle M., Saltonstall K., Powell J. R. Extreme difference in rate of mitochondrial and nuclear DNA evolution in a large ectotherm, Galapagos tortoises. Mol. Phylogen. Evol. (2004) 31:794–798.[CrossRef][Web of Science][Medline]

    Caccone A., Gentile G., Gibbs J. P., Fritts T. H., Snell H. L., Betts J., Powell J. R. Phylogeography and history of giant Galapagos tortoises. Evolution (2002) 56:2052–2066.[CrossRef][Web of Science][Medline]

    Cann J. Australian freshwater turtles (1998) Singapore: Beaumont Publishing.

    Ciofi C., Wilson G. A., Beheregaray L. B., Marquez C., Gibbs J. P., Tapia W., Snell H. L., Caccone A., Powell J. R. Phylogeographic history and gene flow among giant Galapagos tortoises on southern Isabela Island. Genetics (2006) 172:1727–1744.[Abstract/Free Full Text]

    Corander J., Waldmann P., Marttinen P., Sillanpaa M. J. BAPS 2: Enhanced possibilities for the analysis of genetic population structure. Bioinformatics (2004) 20:2363–2369.[Abstract/Free Full Text]

    Cracraft J. Speciation and its ontology: The empirical consequences of alternative species concepts for understanding patterns and processes of differentiation. In: Speciation and its consequences—Otte D., Endler J. A., eds. (1989) Sunderland, Massachusetts: Sinauer Associates. 28–59.

    Creer S., Malhotra A., Thorpe R. S., Pook C. E. Targeting optimal introns for phylogenetic analyses in non-model taxa: Experimental results in Asian pitvipers. Cladistics (2005) 21:390–395.[CrossRef][Web of Science]

    de Queiroz K. The general lineage concept of species. In: Endless forms: Species and speciation—Howard D. J., Berlocher S. H., eds. (1998) New York: Oxford University Press. 57–75.

    de Queiroz K. Ernst Mayr and the modern concept of species. Proc. Natl. Acad. Sci. USA (2005) 102:6600–6607.[Abstract/Free Full Text]

    Degnan J. H., Rosenberg N. A. Discordance of species trees with their most likely gene trees. PLoS Genet. (2006) 2:762–768.[Web of Science]

    Demesure B., Sodzi N., Petit R. J. A set of universal primers for amplification of polymorphic noncoding regions of mitochondrial and chloroplast DNA in plants. Mol. Ecol. (1995) 4:129–131.[Medline]

    Dumolin-Lapegue S., Pemonge M. H., Petit R. J. An enlarged set of consensus primers for the study of organelle DNA in plants. Mol. Ecol. (1997) 6:393–397.[CrossRef][Medline]

    Engstrom T. N., Edwards T., Osentoski M. F., Myers E. M. A compendium of PCR primers for mtDNA, microsatellite and other nuclear loci for freshwater turtles and tortoises. In: Defining turtle diversity: Proceedings of a workshop on genetics, ethics, and taxonomy of freshwater turtles and tortoises—Shaffer H. B., Georges A., FitzSimmons N. N., Rhodin A. G. J., eds. (2007) Lunenberg, Massachusetts: Chelonian Research Foundation.

    Falush D., Stephens M., Pritchard J. K. Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies. Genetics (2003) 164:1567–1587.[Abstract/Free Full Text]

    Fitzpatrick B. M., Shaffer H. B. Environment-dependent admixture dynamics in a tiger salamander hybrid zone. Evolution (2004) 58:1282–1293.[CrossRef][Web of Science][Medline]

    Freeland J. R., Boag P. T. The mitochondrial and nuclear genetic homogeneity of the phenotypically diverse Darwin's ground finches. Evolution (1999) 53:1553–1563.[CrossRef][Web of Science]

    Funk D. J., Omland K. E. Species-level paraphyly and polyphyly: Frequency, causes, and consequences, with insights from animal mitochondrial DNA. Annu. Rev. Ecol. Evol. Syst. (2003) 34:397–423.[CrossRef]

    Georges A., Adams M. Electrophoretic delineation of species boundaries within the short-necked freshwater turtles of Australia (Testudines: Chelidae). Zool. J. Linn. Soc. (1996) 118:241–260.[CrossRef]

    Gordon D., Abajian C., Green P. Consed: A graphical tool for sequence finishing. Genome Res. (1998) 8:195–202.[Abstract/Free Full Text]

    Grant P. R., Grant B. R., Petren K. Hybridization in the recent past. Am. Nat. (2005) 166:56–67.[CrossRef][Web of Science][Medline]

    Guillot G., Mortier F., Estoup A. GENELAND: A computer package for landscape genetics. Mol. Ecol. Notes (2005) 5:712–715.[CrossRef][Web of Science]

    Hassan M., Lemaire C., Fauvelot C., Bonhomme F. Seventeen new exon-primed intron-crossing polymerase chain reaction amplifiable introns in fish. Mol. Ecol. Notes (2002) 2:334–340.[CrossRef][Web of Science]

    Hey J., Machado C. A. The study of structured populations—New hope for a difficult and divided science. Nat. Rev. Genet. (2003) 4:535–543.[CrossRef][Web of Science][Medline]

    Hey J., Nielsen R. Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura D. persimilis. Genetics (2004) 167:747–760.[Abstract/Free Full Text]

    Hey J., Waples R. S., Arnold M. L., Butlin R. K., Harrison R. G. Understanding and confronting species uncertainty in biology and conservation. Trends Ecol. Evol. (2003) 18:597–603.[CrossRef]

    Hsu T. M., Chen X., Duan S., Miller R. D., Kwok P. Y. Universal SNP genotyping assay with fluorescence polarization detection. BioTechniques (2001) 31:560–568.[Web of Science][Medline]

    Hudson R. R., Coyne J. A. Mathematical consequences of the genealogical species concept. Evolution (2002) 56:1557–1565.[CrossRef][Web of Science][Medline]

    Irschick D. J., Shaffer H. B. The polytypic species revisited: Morphological differentiation among tiger salamanders (Ambystoma tigrinum) (Amphibia: Caudata). Herpetologica (1997) 53:30–49.[Web of Science]

    Irwin D. E. Phylogeographic breaks without geographic barriers to gene flow. Evolution (2002) 56:2383–2394.[CrossRef][Web of Science][Medline]

    IUCN. IUCN Red List of Threatened Species (2006) www.iucnredlist.org.

    Jennings W. B., Edwards S. V. Speciational history of Australian grass finches (Poephila) inferred from thirty gene trees. Evolution (2005) 59:2033–2047.[Web of Science][Medline]

    Jones C. J., Edwards K. J., Castaglione S., Winfield M. O., Sala F., vandeWiel C., Bredemeijer G., Vosman B., Matthes M., Daly A., Brettschneider R., Bettini P., Buiatti M., Maestri E., Malcevschi A., Marmiroli N., Aert R., Volckaert G., Rueda J., Linacero R., Vazquez A., Karp A. Reproducibility testing of RAPD, AFLP and SSR markers in plants by a network of European laboratories. Mol. Breed. (1997) 3:381–390.[CrossRef]

    Kocher T. D., Thomas W. K., Meyer A., Edwards S. V., Paabo S., Villablanca F. X., Wilson A. C. Dynamics of mitochondrial-DNA evolution in animals—Amplification and sequencing with conserved primers. Proc. Natl. Acad. Sci. USA (1989) 86:6196–6200.[Abstract/Free Full Text]

    Krenz J. G., Naylor G. J. P., Shaffer H. B., Janzen F. J. Molecular phylogenetics and evolution of turtles. Mol. Phylogen. Evol. (2005) 37:178–191.[CrossRef][Web of Science][Medline]

    Kubatko L. S., Degnan J. H. Inconsistency of phylogenetic estimates from concatenated data under coalescence. Syst. Biol. (2007) 56:17–24.[Abstract/Free Full Text]

    Liti G., Barton D. B. H., Louis E. J. Sequence diversity, reproductive isolation and species concepts in Saccharomyces. Genetics (2006) 174:839–850.[Abstract/Free Full Text]

    Lyons L. A., Laughlin T. F., Copeland N. G., Jenkins N. A., Womack J. E., Obrien S. J. Comparative anchor tagged sequences (CATS) for integrative mapping of mammalian genomes. Nat. Genet. (1997) 15:47–56.[CrossRef][Web of Science][Medline]

    Macaubas C., Jin L., Hallmayer J., Kimura A., Mignot E. The complex mutation pattern of a microsatellite. Genome Res. (1997) 7:635–641.[Abstract/Free Full Text]

    Maddison W. P., Knowles L. L. Inferring phylogeny despite incomplete lineage sorting. Syst. Biol. (2006) 55:21–30.[Abstract/Free Full Text]

    Marshall J. C., Arevalo E., Benavides E., Sites J. L., Sites J. W. Delimiting species: Comparing methods for mendelian characters using lizards of the Sceloporus grammicus (Squamata: Phrynosomatidae) complex. Evolution (2006) 60:1050–1065.[Web of Science][Medline]

    Mayden R. L. A hierarchy of species concepts: The denouement in the saga of the species problem. In: Species—Clardge M. F., Dawah H. A., Wilson M. R., eds. (1997) London: Chapman and Hall. 381–424.

    Mechanda S. M., Baum B. R., Johnson D. A., Arnason J. T. Sequence assessment of comigrating AFLP (TM) bands in Echinacea—Implications for comparative biological studies. Genome (2004) 47:15–25.[Medline]

    Mockler T. C., Ecker J. R. Applications of DNA tiling arrays for whole-genome analysis. Genomics (2005) 85:1–15.[CrossRef][Web of Science][Medline]

    Moran P., Kornfield I. Retention of an ancestral polymorphism in the Mbuna species flock (Teleostei, Cichlidae) of Lake Malawi. Mol. Biol. Evol. (1993) 10:1015–1029.[Web of Science]

    Morin P. A., Luikart G., Wayne R. K. SNPs in ecology, evolution and conservation. Trends Ecol. Evol. (2004) 19:208–216.[CrossRef]

    Near T. J., Meylan P. A., Shaffer H. B. Assessing concordance of fossil calibration points in molecular clock studies: An example using turtles. Am. Nat. (2005) 165:137–146.[CrossRef][Web of Science][Medline]

    Nielsen R., Wakeley J. Distinguishing migration from isolation: A Markov chain Monte Carlo approach. Genetics (2001) 158:885–896.[Abstract/Free Full Text]

    O'Hanlon P. C., Peakall R. A simple method for the detection of size homoplasy among amplified fragment length polymorphism fragments. Mol. Ecol. (2000) 9:815–816.[CrossRef][Medline]

    Palumbi S. R., Baker C. S. Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales. Mol. Biol. Evol. (1994) 11:426–435.[Abstract]

    Parham J. F., Feldman C. R., Boore J. L. The complete mitochondrial genome of the enigmatic bigheaded turtle (Platysternon): Description of unusual genomic features and the reconciliation of phylogenetic hypotheses based on mitochondrial and nuclear DNA. BMC Evol. Biol. (2006) 6.

    Petren K., Grant P. R., Grant B. R., Keller L. F. Comparative landscape genetics and the adaptive radiation of Darwin's finches: The role of peripheral isolation. Mol. Ecol. (2005) 14:2943–2957.[CrossRef][Medline]

    Porter A. H. Testing nominal species boundaries using gene flow statistics—The taxonomy of 2 hybridizing Admiral butterflies (Limenitis, Nymphalidae). Syst. Zool. (1990) 39:131–147.[Abstract/Free Full Text]

    Pritchard J. K., Stephens M., Donnelly P. Inference of population structure using multilocus genotype data. Genetics (2000) 155:945–959.[Abstract/Free Full Text]

    Pritchard P. C. A. The Galapagos tortoises: Nomenclatural and survival status. Chelonian Res. Monogr. (1996) 1:1–85.

    Rosen S., Skaletsky H. Primer3 on the WWW for general users and for biologist programmers. In: Bioinformatics methods and protocols: Methods in molecular biology—Krawetz S., Misener S., eds. (2000) Totowa, New Jersey: Humana Press. 365–386.

    Rosenberg N. A., Burke T., Elo K., Feldmann M. W., Freidlin P. J., Groenen M. A. M., Hillel J., Maki-Tanila A., Tixier-Boichard M., Vignal A., Wimmers K., Weigend S. Empirical evaluation of genetic clustering methods using multilocus genotypes from 20 chicken breeds. Genetics (2001) 159:699–713.[Abstract/Free Full Text]

    Rosenberg N. A., Nordborg M. Genealogical trees, coalescent theory and the analysis of genetic polymorphisms. Nat. Rev. Genet. (2002) 3:380–390.[CrossRef][Web of Science][Medline]

    Russello M. A., Glaberman S., Gibbs J. P., Marquez C., Powell J. R., Caccone A. A cryptic taxon of Galapagos tortoise in conservation peril. Biol. Lett. (2005) 1:287–290.[Abstract/Free Full Text]

    Seehausen O. Hybridization and adaptive radiation. Trends Ecol. Evol. (2004) 19:198–207.[CrossRef][Medline]

    Shaffer H. B. Evolution in a paedomorphic lineage. I. An electrophoretic analysis of the Mexican ambystomatid salamanders. Evolution (1984a) 38:1194–1206.[CrossRef][Web of Science]

    Shaffer H. B. Evolution in a paedomorphic lineage. II. Allometry and form in the Mexican ambystomatid salamanders. Evolution (1984b) 38:1207–1218.[CrossRef][Web of Science]

    Shaffer H. B. Phylogenetics of model organisms: The laboratory axolotl, Ambystoma mexicanum. Syst. Biol. (1993) 42:508–522.[Abstract/Free Full Text]

    Shaffer H. B., McKnight M. L. The polytypic species revisited: Genetic differentiation and molecular phylogenetics of the tiger salamander Ambystoma tigrinum (Amphibia: Caudata) complex. Evolution (1996) 50:417–433.[CrossRef][Web of Science]

    Shaffer H. B., Pauly G. B., Oliver J. C., Trenham P. C. The molecular phylogenetics of endangerment: cryptic variation and historical phylogeography of the California tiger salamander, Ambystoma californiense. Mol. Ecol. (2004) 13:3033–3049.[CrossRef][Medline]

    Shaw K. L. The genealogical view of speciation—Commentary. J. Evol. Biol. (2001) 14:880–882.[CrossRef][Web of Science]

    Shaw K. L., Danley P. D. Behavioral genomics and the study of speciation at a porous species boundary. Zoology (2003) 106:261–273.[CrossRef][Web of Science][Medline]

    Simon C., Frati F., Beckenbach A., Crespi B., Liu H., Flook P. Evolution, weighting, and phylogenetic utility of mitochondrial gene-sequences and a compilation of conserved polymerase chain-reaction primers. Ann. Entomol. Soc. Am. (1994) 87:651–701.[Web of Science]

    Simpson G. G. The species concept. Evolution (1951) 5:285–298.[CrossRef][Web of Science]

    Sites J. W., Marshall J. C. Delimiting species: A Renaissance issue in systematic biology. Trends Ecol. Evol. (2003) 18:462–470.[CrossRef]

    Sites J. W., Marshall J. C. Operational criteria for delimiting species. Annu. Rev. Ecol. Evol. Syst. (2004) 35:199–227.[CrossRef]

    Smit A. F. A., Hubley R., Green P. (2004) RepeatMasker Open-3.0.

    Spinks P. Q., Shaffer H. B. Range-wide molecular analysis of the western pond turtle (Emys marmorata): Cryptic variation, isolation by distance, and their conservation implications. Mol. Ecol. (2005) 14:2047–2064.[CrossRef][Medline]

    Starkey D. E., Shaffer H. B., Burke R. L., Forstner M. R. J., Iverson J. B., Janzen F. J., Rhodin A. G. J., Ultsch G. R. Molecular systematics, phylogeography, and the effects of Pleistocene glaciation in the painted turtle (Chrysemys picta) complex. Evolution (2003) 57:119–128.[CrossRef][Web of Science][Medline]

    Stephens M., Sloan J. S., Robertson P. D., Scheet P., Nickerson D. A. Automating sequence-based detection and genotyping of SNPs from diploid samples. Nat. Genet. (2006) 38:375–381.[CrossRef][Web of Science][Medline]

    Taberlet P., Gielly L., Pautou G., Bouvet J. Universal primers for amplification of 3 noncoding regions of chloroplast DNA. Plant Mol. Biol. (1991) 17:1105–1109.[CrossRef][Web of Science][Medline]

    Takahashi K., Terai Y., Nishida M., Okada N. Phylogenetic relationships and ancient incomplete lineage sorting among cichlid fishes in Lake Tanganyika as revealed by analysis of the insertion of retroposons. Mol. Biol. Evol. (2001) 18:2057–2066.[Abstract/Free Full Text]

    Turtle Taxonomy Working Group. Bickham J. W., Iverson J. B., Parham J. F., Philippen H. D., Rhodin A. G. J., Shaffer H. B., Spinks P. Q., van Dijk P. P. An annotated list of modern turtle terminal taxa, with comments on areas of taxonomic instability and recent change. In: Defining turtle diversity: Proceedings of a workshop on genetics, ethics, and taxonomy of freshwater turtles and tortoises—Shaffer H. B., Georges A., FitzSimmons N., Rhodin A. G. J., eds. (2007) Lunenberg, Massachusetts: Chelonian Research Foundation.

    vanderVoort J., vanZandvoort P., vanEck H. J., Folkertsma R. T., Hutten R. C. B., Draaistra J., Gommers F. J., Jacobsen E., Helder J., Bakker J. Use of allele specificity of comigrating AFLP markers to align genetic maps from different potato genotypes. Mol. Gen. Genet. (1997) 255:438–447.[CrossRef][Web of Science][Medline]

    Wakeley J. The effects of subdivision on the genetic divergence of populations and species. Evolution (2000) 54:1092–1101.[CrossRef][Web of Science][Medline]

    Weisrock D. W., Shaffer H. B., Storz B. L., Storz S. R., Voss S. R. Multiple nuclear gene sequences identify phylogenetic species boundaries in the rapidly radiating clade of Mexican ambystomatid salamanders. Mol. Ecol. (2006) 15:2489–2503.[CrossRef][Medline]

    Wiens J. J., Penkrot T. A. Delimiting species using DNA and morphological variation and discordant species limits in spiny lizards (Sceloporus). Syst. Biol. (2002) 51:69–91.[Abstract/Free Full Text]

    Zhang D. X., Hewitt G. M. Nuclear DNA analyses in genetic studies of populations: Practice, problems and prospects. Mol. Ecol. (2003) 12:563–584.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Syst BiolHome page
B. C. O'Meara
New Heuristic Methods for Joint Species Delimitation and Species Tree Inference
Syst Biol, November 10, 2009; (2009) syp077v1.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
M. T. Monaghan, R. Wild, M. Elliot, T. Fujisawa, M. Balke, D. J.G. Inward, D. C. Lees, R. Ranaivosolo, P. Eggleton, T. G. Barraclough, et al.
Accelerated Species Inventory on Madagascar Using Coalescent-Based Models of Species Delineation
Syst Biol, July 1, 2009; (2009) syp027v1.
[Abstract] [Full Text] [PDF]


Home page
CSH ProtocolsHome page
N. Valenzuela
The Painted Turtle, Chrysemys picta: A Model System for Vertebrate Evolution, Ecology, and Human Health
CSH Protocols, July 1, 2009; 2009(7): pdb.emo124 - pdb.emo124.
[Abstract] [Full Text]


Home page
Syst BiolHome page
P. Q. Spinks and H. B. Shaffer
Conflicting Mitochondrial and Nuclear Phylogenies for the Widely Disjunct Emys (Testudines: Emydidae) Species Complex, and What They Tell Us about Biogeography and Hybridization
Syst Biol, May 28, 2009; (2009) syp005v1.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Shaffer, H. B.
Right arrow Articles by Thomson, R. C.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Shaffer, H. B.
Right arrow Articles by Thomson, R. C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?