Skip Navigation

Systematic Biology 2006 55(6):928-935; doi:10.1080/10635150600865419
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (13)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Ray, D. A.
Right arrow Articles by Batzer, M. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Ray, D. A.
Right arrow Articles by Batzer, M. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2006 Society of Systematic Biologists

SINEs of a Nearly Perfect Character

Edited by Andrew Shedlock: Associate Editor

David A. Ray1, Jinchuan Xing2, Abdel-Halim Salem3 and Mark A. Batzer2

1 Department of Biology, West Virginia University PO Box 6057, Morgantown, West Virginia 26506, USA
2 Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Multiscale Systems, Louisiana State University 202 Life Sciences Building, Baton Rouge, Louisiana 70803, USA E-mail: mbatzer{at}lsu.edu (M.A.B.)
3 Department of Anatomy, Faculty of Medicine, Suez Canal University Ismailia, Egypt


    Abstract
 Top
 Abstract
 Acknowledgments
 References
 
Mobile elements have been recognized as powerful tools for phylogenetic and population-level analyses. However, issues regarding potential sources of homoplasy and other misleading events have been raised. We have collected available data for all phylogenetic and population level studies of primates utilizing Alu insertion data and examined them for potentially homoplasious and other misleading events. Very low levels of each potential confounding factor in a phylogenetic or population analysis (i.e., lineage sorting, parallel insertions, and precise excision) were found. Although taxa known to be subject to high levels of these types of events may indeed be subject to problems when using SINE analysis, we propose that most taxa will respond as the order Primates has—by the resolution of several long-standing problems observed using sequence-based methods.

Keywords: Alu; mobile element; phylogenetics; retrotransposon; SINE

Received November 4, 2005; Revised December 31, 2005; Accepted March 15, 2006


Although implemented previously in smaller studies (Minghetti and Dugaiczyk, 1993; Murata et al., 1993, 1996, 1998; Ryan and Dugaiczyk, 1989), the SINE method of phylogenetic reconstruction began receiving increased attention when Norihiro Okada and colleagues published their investigation into the relationship between whales and artiodactyls (Nikaido et al., 1999; Shimamura et al., 1997). The potential for a close relationship among artiodactyls and cetaceans had been proposed and investigated in earlier studies (Flower, 1883; Gatesy et al., 1999; Sarich, 1985) but had never been so clearly and simply elucidated. These authors pronounced that the SINE method represented a revolution in phylogenetic inference because of its apparent freedom from the problems associated with homoplasy.

SINE insertions in a genome offer two important advantages over other markers used for systematic and population genetic studies. First, the presence of an element in an individual is presumed to represent identity by descent (Batzer and Deininger, 2002). Polymorphic mobile element insertions will thus reflect relationships more accurately than many other markers (i.e., sequence data, restriction fragment length polymorphisms [RFLPs], and microsatellites) that may only reflect identity by state (i.e., homoplasy) (Batzer et al., 1994). A second advantage of these genetic markers is that the ancestral state of an insertion polymorphism is known to be the absence of the element at a particular genomic location (Batzer et al., 1994; Perna et al., 1992). Precise knowledge of the ancestral state of a genomic polymorphism allows us to draw trees of population relationships without making unnecessary assumptions (Batzer et al., 1994; Perna et al., 1992).

Hillis (1999), however, offered several cautious notes about hailing SINEs as the answer to all of our phylogenetic woes. He correctly observed that almost every time a new methodology is developed, it is heralded, at least in the short term, as superior to all methods that have come before. In particular, Hillis worried that aspects of SINE analysis that may contribute to homoplasy had not been sufficiently investigated. Analysis of phylogenies using the SINE method may be adversely affected by various events that could distort the true evolutionary history of species. This has been a problem for every method yet devised to examine evolutionary relationships and SINEs are indeed not immune. They are, however, thought to be less susceptible to these problems.

There are four potential sources for confusion in SINE analysis of phylogenetics (Fig. 1): lineage sorting, parallel insertions (including precise- and near-parallel insertions), precise excision, and paralogous insertions. Lineage sorting is caused by the presence of a polymorphic insertion in a common ancestor that alternatively becomes fixed or extinct in the genomes of daughter species. Parallel insertions include the insertion of distinct elements at or near the same location (within the polymerase chain reaction [PCR] amplicon) in the genomes of different taxa under study. Precise SINE excision was until very recently not thought to be an issue as there was no known mechanism for these sorts of events to occur. Paralogous insertions include duplicated regions of the genome at which an insertion may have occurred at one of the duplicates but not the other.


Figure 1
View larger version (25K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 Schematic of the types of potentially misleading events encountered during a SINE-based analysis of phylogeny. (A) Lineage sorting due to retention of an ancestral polymorphism and subsequent random fixation or loss in daughter lineages; (B) a potentially confusing pattern due to a near-parallel insertion; (C) insertion homoplasy due to a precise-parallel insertion; and (D) mobile element excision. Straight arrows along branches indicate mobile element insertion events and the arrow head indicates that orientation of the inserted element. The curved arrow indicates the precise removal of an element from the genome.

 
Shedlock and Okada produced two reviews of the issues associated with the SINE method (Shedlock and Okada, 2000; Shedlock et al., 2004), focusing on the problems of lineage sorting. In particular, they utilized the relatively large data sets that have been collected over the years on cichlid phylogeny and population biology. Since then, however, there have been numerous published studies using SINEs as phylogenetic and population genetic characters in a wider variety of taxa (Bamshad et al., 2003; Batzer and Deininger, 2002; Carroll et al., 2001; Churakov et al., 2005; Cotrim et al., 2004; Kawai et al., 2002; Nasidze et al., 2001; Nishihara et al., 2002; Ray et al., 2005; Roos et al., 2004; Roy-Engel et al., 2002; Salem et al., 2003; Sasaki et al., 2004; Schmitz et al., 2001, 2005; Schmitz and Zischler, 2003; Singer et al., 2003; Takahashi et al., 2001b; Terai et al., 2003, 2004; Watkins et al., 2003; Xing et al., 2005; Zampicinini et al., 2004). These studies have proven very successful and have served to confirm many of the positive aspects of SINE analysis of phylogenies and population dynamics.

The most comprehensive work has been performed in primate taxa using Alu elements. In fact, nearly the entire order has been investigated using the SINE method (see Table 1). One of the reasons that Alu elements have been so successfully used in investigations of primate phylogenetics is the existence of the human genome draft sequence. This has been an invaluable tool for determining the flanking sequences of Alu insertions during the process of primer design (Ray et al., 2005; Salem et al., 2003; Schmitz et al., 2001; Xing et al., 2005). In addition, the presence of the draft sequence has also allowed us to ascertain ~ 90% of the Alu elements in the human genome. These elements have been divided into subfamilies, each of which began expanding in the primate genome at different times. For example, the Ye family of Alu elements first arose in the common ancestor of all hominoid primates (Salem et al., 2005), whereas the Yb8 subfamily is restricted to gorillas, chimpanzees, and humans (Han et al., 2005).


View this table:
[in this window]
[in a new window]

 
Table 1 Instances of misleading and homoplasy inducing events observed during studies of primate phylogeny and population biology using SINEs.

 
In the current work, we have taken the presence/absence data sets currently available for analyses of primate phylogenetic and population genetic analysis (as well as a few other appropriate sources) to determine the levels of potentially homoplasious events and additional misleading patterns on the study of primate relationships. Although other taxa that have experienced rapid divergences may in fact be subject to more substantial problems resulting from lineage sorting in particular (Takahashi et al., 2001a, 2001b), we suspect that most phylogenetic groups will respond to SINE analysis much the way primates have—by allowing for the resolution of several previously unresolved relationships. A combination of careful data analysis and adequate sampling of informative loci and taxa should minimize most of the issues raised by the specter of homoplasy.

Lineage Sorting
Lineage sorting is a form of homoplasy introduced by the random fixation or extinction of alternative alleles in distinct lineages. The potential for lineage sorting is an issue whenever researchers deal with species groups that may have had a large ancestral effective population size and/or experienced rapid speciation (Nei, 1987; Pamilo and Nei, 1988; Takahata, 1989). A classic example from the application of SINEs to phylogeny is found in cichlids (Takahashi et al., 2001b) in which 14 of 38 loci appeared to be subject to lineage sorting including an event thought to have occurred as long as 14 million years ago. However, as we will discuss, this high level of lineage sorting is probably not typical of SINE studies, as shown in the studies in primates and other organisms.

The studies of mobile element insertions in primates include several large-scale studies of Alu insertion patterns from the hominid (Salem et al., 2003), cercopithecid (Xing et al., 2005), platyrrhine (Ray et al., 2005; Singer et al., 2003), tarsier (Schmitz et al., 2001), and strepsirrhine (Roos et al., 2004) lineages. Of the 131 insertions characterized in the hominid lineage, only one locus could be interpreted as being the product of lineage sorting. Xing et al.'s study of 285 Alu insertion loci in Cercopithecidae recovered four putative lineage sorting events. There were no examples found in examinations of 190 and 74 New World monkey insertions, 118 Alu insertions used to determine the affiliations of tarsiers, and 61 insertions from an investigation into strepsirrhine (prosimian) phylogeny.

Several other large-scale analyses not directly aimed at phylogenetic analysis can also help us assess the extent of SINE insertion homoplasy due to lineage sorting. Although these studies were aimed at characterizing specific subfamilies of Alu elements, the work did involve amplification of loci isolated from the human genome in various nonhuman primates. For example, one study involved the Alu Yd3 subfamily (Xing et al., 2003). These authors examined 133 loci in lineages ranging from human to owl monkey and found no evidence of lineage sorting. Two additional similar studies of the Alu Ya (Otieno et al., 2004) and Yb (Carter et al., 2004) lineages analyzed another 2672 individual loci and again found no evidence of lineage sorting.

These projects, in conjunction with several others summarized in Table 1, comprise a total of over 11,000 individual Alu loci. Of those, only seven suspected cases of lineage sorting were discovered. Thus, it appears that the overall frequency of lineage sorting events in the primate order is small, ~0.0006 events/insertion. We understand that this collection of studies is biased toward the human lineage, and that this may skew that data. We would argue, however, that the relatively rapid divergence between Homo, Pan, and Gorilla (1 to 3 million years between the Gorilla divergence and the divergence of Homo and Pan) suggested by several studies (Goodman et al., 1998; O'Huigin et al., 2002; Satta et al., 2000; Sibley and Ahlquist, 1984) would promote the occurrence of lineage sorting events and the observation that so few were recovered suggests a low rate of occurrence. Thus, as long as sufficient sampling is present at each node and the species being studied do not belong to a group that is prone to lineage sorting, these types of events should not be a serious problem. In fact, the paucity of these events in some taxa and their higher frequency in others can be an indication of speciation dynamics of a lineage as it evolved; see Shedlock et al. (2004) for a recent discussion.

Parallel Insertions
The insertion of SINEs into regions occupied by similar elements in other taxa is another source of potential confusion. In fact, given the observation that some mobile element families appear to accumulate in particular regions of the genome (Greally, 2002; Jurka et al., 2005) as well as exhibit a preference for certain target sequences (Gentles et al., 2005; Jurka, 1997), these sorts of events are not unexpected. Other factors that can influence their rate of occurrence would include the relative divergence time between taxa and relative rates of retrotransposition (Hillis, 1999). Including a wide range of taxa in any application of SINE data to phylogenetic analysis would also be expected to increase the likelihood of observing these events. Parallel insertions of SINE characters can be separated into two types: near-parallel insertions and precise-parallel insertions.

Near-Parallel Insertions
The first and most common type have been termed near-parallel insertions. In these cases, a secondary insertion has occurred near the insertion originally being studied (usually within a 200- to 600-bp amplicon). Given this definition, these sorts of events are technically not instances of homoplasy. However, preliminary analysis using agarose gel electrophoresis of loci at which these events have occurred can be interpreted as homoplasy if more detailed sequence analysis of the loci is not applied. Because the vast majority of the loci in a SINE analysis will not contradict the final version of the tree, the patterns present on a developing cladogram often begin to become clear relatively early in an analysis. Thus, these events are usually easy to detect and sequencing of the locus will resolve the issue. Therefore, if an anomalous pattern is apparent when compared to the overall tree, it becomes clear that this locus should be investigated more closely.

One clear example comes from the work on platyrrhine primate phylogeny (Ray et al., 2005). Of the 190 loci investigated, 11 contained multiple insertions in the amplified regions. Thus, the vast majority of the loci were clearly consistent with the final tree. At one locus, the original Alu insertion under investigation was ascertained from the genome of Saguinus labiatus, a New World primate. Primers were designed to amplify a ~ 300-bp empty site (i.e., without an Alu insertion) based on comparisons to the orthologous locus in Homo sapiens. The filled site (containing an Alu insertion) in taxa sharing the insertion through identity-by-descent should have been ~500 to 600 bp depending on the length of the poly-A tail of the Alu element. Examination of the raw data from the agarose gel electrophoresis suggested that members of four genera share the Alu insertion—Macaca, Chlorocebus, Callithrix, Saguinus. A fifth taxon, Aotus, exhibits an anomalous pattern and the remaining taxa in the panel exhibit either the expected empty site or no amplification at all.

This pattern suggests an insertion shared by two catarrhine (Macaca and Chlorocebus) and two platyrrhine (Callithrix and Saguinus) primates and should therefore raise eyebrows. It has been well established by morphological and previous DNA sequence–based studies that there is a clear division between Old and New World taxa. Thus, familiarity with basic primate phylogeny or with the patterns already apparent on the developing SINE-derived tree would suggest that this locus needs to be investigated further via sequence analysis of the locus for all taxa. In fact, sequence analysis subsequently revealed that four independent Alu insertion events had occurred within the ancestral locus.

Despite the possible confusion generated by loci such as this one, the problems arising from near-parallel insertions should not be a major issue for the observant researcher. The final results of our data compilation indicate that near-parallel insertion events occur at only ~ 0.0004 events/insertion when considering the 11,000+ loci collected. All of these events are easily resolved by automated DNA sequence analysis, thus they do not contribute negatively to the phylogeny reconstruction. These observations suggest that a basic knowledge of the taxa involved and a sense of the overall picture generated by the majority of loci in a phylogenetic study will lead to a "red flag" at any problematic locus. In other words, as is most often the case, the anomalous amplification pattern is so different from the overall picture that further investigation is the natural next step.

In cases where near-parallel insertions are discovered, how should each insertion be handled with regard to analysis? Because the presence of each Alu element represents a unique event in the evolution of the genomes under study, it is clear that each insertion should be treated as an independent bit of information. Thus, we would even argue that when near-parallel insertions are discovered, it is often a fortuitous event because one now has two or more potentially informative insertions found within a single amplicon.

Precise-Parallel Insertions
Confusion may also be introduced into a phylogenetic analysis by the very rare precise-parallel insertions. These events may in fact mimic the earlier case of lineage sorting. Such an event occurs when a second element has inserted into exactly the same target site in a separate taxon, producing a duplication of the accessed target sequence. This event makes it appear that an insertion is shared when in actuality it represents two independent insertion events. There are only a few cases where such events can be clearly delineated in all of the published mobile element literature. In the first instance, an SVA element and an Alu element inserted into the same target site within exon 9 of the human BTK gene (Conley et al., 2005). Cantrell et al. (2001) found two loci within a Sigmodontine retrotransposon that were targets for precise-parallel insertions. The third example comes from an analysis of the Felid Y-chromosome (Slattery et al., 2000). In rats, an independent precise-parallel insertion of ID elements has been identified (Rothenburg et al., 2002). Mus musculus and M. pahari also share parallel insertions of B1 elements into the same target site (Kass et al., 2000). However, sequence analysis of these loci easily shows that these loci have been subjected to multiple parallel insertions and thus they do not contribute erroneous information for phylogeny reconstruction. Finally, a report by van de Lagemaat et al. (2005) detailing a mechanism for the precise deletion of mobile elements reported three potential instances of precise parallel insertion when comparing humans and chimpanzees.

Our examination of the current primate data revealed only five precise-parallel Alu insertions. The results indicate that of the 11,000+ loci examined, precise parallel insertions are exceedingly rare—occurring at only about 0.0005 events/insertion of the examined loci. The issue, therefore, is not to determine if these types of loci are a difficulty to overcome in phylogenetic analysis but rather to distinguish them from lineage sorting and excision events. In both cases, expanding the number of taxa and/or sequencing of the loci provide resolution of the issue. For example, in many cases, the secondary mobile element insertion belongs either to a different subfamily of the same mobile element family as the original insertion or may belong to a completely different family of repetitive elements.

In fact, sequence characteristics exist that distinguish different types of mobile elements in a majority of the instances listed above. Thus, in cases like these, the homoplasy is only apparent. The presence of differing mobile element families or distinguishing features such as characteristic truncations, deletions, or subfamily diagnostic sites make recognizing parallel insertions relatively simple. This is well illustrated by one large-scale study in which over 4800 mobile element loci were examined to elucidate mammalian phylogeny (Bashir et al., 2005). Nearly 2500 of these loci were Alu insertions. The authors suggested that 23 instances of insertion homoplasy (precise-parallel insertions) existed among these Alu insertion loci. We have manually reanalyzed a portion of Bashir et al.'s raw data and found that the true number is actually 13 precise-parallel insertion events—0.005 events/insertion. The discrepancy between the two sets of results was due to several loci being counted multiple times and a few instances of near-parallel insertions being characterized as precise. Regardless of this problem, the authors were correct in recognizing that many of the precise-parallel insertions discovered were distinguishable based on the Alu insertions belonging to distinct subfamilies.

Mobile Element Excision
The potential for the precise removal of mobile elements from the genome has long been contemplated. However, it was previously thought that no mechanism existed to remove these elements in such a way that a site identical to the pre-integration sequence was obtained. Two recent manuscripts have provided evidence that mobile element excision may play a larger role than expected in the evolution of some genomes. In plants, Lenoir et al. (2005) described a rapid turnover of SINEs in Arabidopsis. This suggests a limitation to the SINE method for phylogenetic analysis in these organisms. However, it appeared that the excision of particular elements typically proceeded over several steps and often provided a signature of the removal that could be detected using sequence analysis or high-resolution electrophoresis. A potentially more significant problem was introduced by van de Lagamaat et al. (2005). They proposed a theoretical model for the precise excision of Alu elements from human and chimpanzee genomes. Their model suggests that the target site duplications (TSDs) generated upon the insertion of the Alu elements act as locations for illegitimate recombination. This development may seem to be a setback for proponents of the idea that SINEs are essentially homoplasy-free markers. However, results of the human-chimpanzee comparisons reveal an extremely low rate of complete precise mobile element excisions, about 0.5% of length polymorphisms (van de Lagemaat et al., 2005). In addition, the model described depends on recombination between the direct repeats that flank Alu insertions. After short evolutionary times, the condition described will likely no longer exist. This may help to explain the relative rarity of the observation of precise excisions and its minor impact on phylogenetic studies using SINEs.

One of the major issues involved with these types of events is distinguishing precise deletion events from other occurrences that can mimic them: for example, lineage sorting and precise-parallel insertions. Several examples exist in the data sets from Old World primates (Xing et al., 2005) and human-chimpanzee-gorilla comparisons (Hedges et al., 2004; Salem et al., 2003). Although lineage sorting can be invoked to explain anomalous SINE patterns in taxa that either recently shared a common ancestor (Salem et al., 2003) or shared a common ancestor that was thought to have undergone a rapid speciation in the past (Takahashi et al., 2001b), it is unlikely that the polymorphism would have been retained through several successive speciation events over evolutionarily long periods of time. The most consistent interpretation in such events might be that the mobile element has been precisely deleted in some taxa. When making decisions as to the most likely scenario (deletion or lineage sorting), information on relative divergence times between taxa will have to be employed, but we may never know the correct mechanism. However, even if all of the examples currently reported as lineage sorting events are actually instances of precise excision, the total number of events would still remain low enough that their impact on phylogeny reconstruction would be minimal and, unlike most other genetic systems, it would also be well defined/quantified in a SINE-based analysis.

Paralogous Loci
One final occurrence is important enough to be included in any discussion of potentially misleading events in a SINE-based analysis. Paralogous loci include duplicated regions of the genome at which an insertion may have occurred at one of the duplicates but not the other. Given the relatively high occurrence of segmental duplications in primate and other genomes (influenced in part by the presence of the SINEs themselves (Bailey et al., 2003)), one might expect this to be an issue in SINE analysis. Ruling out potential paralogs in a phylogenetic analysis requires careful comparison. One potential indicator of paralog amplification is the consistent amplification of two bands in all samples from a single species. A single individual with a filled and empty site could be interpreted as a heterozygote, but if all individuals show the same pattern, it should be seen as a potential case of paralogous loci. If there are functional versus nonfunctional open reading frames (ORFs) in the suspect sequences, the paralogs can also be relatively easily detected. To date, only two problematic paralogs have been identified (Luis et al., 2003). Unfortunately, this aspect of SINE analysis has not been adequately addressed and may be a fertile area for future investigations.

Dealing with Potentially Misleading Events
As described above, it is clear that in most cases the vast majority of the SINE data collected for phylogeny reconstruction will be internally consistent. Thus, instead of focusing on whether or not SINE analysis is effectively free of homoplasy, the real task is to determine how best to identify and deal with potentially confounding events. Identification of the events is the most problematic issue. In small studies (< 300 loci examined), identifying potential homoplasy is relatively easy; however, at larger scales the number of dates to be examined becomes problematic. For example, several authors have developed purely computational approaches to identify informative insertion patterns for phylogeny construction (Schwartz et al., 2003; Thomas et al., 2003). We believe this is an excellent idea given the large amount of sequence data currently and soon to be available. However, using these approaches it is important to incorporate methods to identify potentially misleading events. In one example, a study incorporated the data available from the NISC Comparative Vertebrate Sequencing program to elucidate mammalian phylogeny (Bashir et al., 2005). Although the final topology is undoubtedly correct, several problems in the data analysis need to be addressed in future studies using this type of approach. For example, in some nonprimate comparisons, the authors did not realize that they compared DNA transposons with RNA transposons at (nearly) the same loci as shown in their figure 3. Furthermore, as discussed above, several loci were counted multiple times, potentially inflating the support for the affected nodes. Such incomplete evaluations make it very difficult to convince skeptics of the advantages of presence/absence data. It also serves to emphasize the need for careful interpretation of the data and for sequencing of SINE insertion loci in a phylogenetic analysis.

Once potentially misleading events have been identified, the most important next step is to obtain enough loci to support each node so that these events can be either ruled out or recognized for what they are—the occasional rare confounding factor. Waddel et al. (2001) proposed a likelihood method for evaluating support for nodes defined by SINE data. Their results suggest that the minimum number of non-contradictory loci required for a significant level of support (P = 0.037) at any node is three. Even if there is an anomalous locus that is not easily explained by near-parallel insertion or precise-parallel insertion of a different mobile element family, additional loci at the node can produce significant levels of support. For example, if one locus appears to have been subjected to a precise deletion, four noncontradictory loci will still produce a P-value of 0.045. Therefore, although distinguishing between the three potentially homoplasy-inducing events (e.g., lineage sorting, precise mobile element deletion, and precise-parallel insertion) that can most closely resemble one another is one of the more difficult problems at present, their overall effects are minimal as long as taxonomic sampling is adequate and careful consideration of the developing data is used.

Despite the possibility of several confounding events that may occur to disrupt the interpretation of a SINE-derived cladogram, it remains clear that SINE-based markers are some of the most powerful phylogenetic tools available. In the data we examined, none of the events that would be considered troublesome occurred at a high enough rate to be considered a significant problem. Near-parallel insertion events, which are the most easily resolved by sequencing efforts, were the most common but only noted in 41 of 11,000+ loci. The other potentially confounding events are more difficult to resolve but are also less common and therefore less likely to be encountered.

In conclusion, basic cautions including careful analysis and interpretation of the data can limit the potential impact of misleading events. In addition, as with any study of phylogeny, a basic knowledge of the taxa under investigation is a must in order to avoid being confused by the rare confounding locus. Complete taxon sampling and collection of sufficient numbers of informative insertion events are also basic requirements for any SINE based phylogenetic reconstruction project. We believe that most researchers follow these guidelines as a normal course of events. It is our hope that the information presented here will encourage researchers to consider the utility of the SINE method of phylogeny reconstruction for their organisms.


    Acknowledgments
 Top
 Abstract
 Acknowledgments
 References
 
We would like to thank L. van de Lagemaat and D. L. Mager for their communication clarifying the details of their mobile element deletion data. We would also like to thank A. Bashir and V. Bafna for providing us with their raw data. This research was supported by National Science Foundation BCS-0218338 (M.A.B.) and EPS-0346411 (M.A.B.); Louisiana Board of Regents Millennium Trust Health Excellence Fund HEF (2000-05)-05 (M.A.B.), (2000-05)-01 (M.A.B.), and (2001-06)-02 (M.A.B.); National Institutes of Health R01 GM59290 (M.A.B.); and the State of Louisiana Board of Regents Support Fund (M.A.B.). Funding was also provided by the Eberly College of Arts and Sciences at West Virginia University (D.A.R) and by the West Virginia University Research Corporation (D.A.R).


    References
 Top
 Abstract
 Acknowledgments
 References
 

    Bailey J. A., Liu G., Eichler E. E. An Alu transposition model for the origin and expansion of human segmental duplications. Am. J. Hum. Genet. (2003) 73:823–834.[CrossRef][Web of Science][Medline]

    Bamshad M. J., Wooding S., Watkins W. S., Ostler C. T., Batzer M. A., Jorde L. B. Human population genetic structure and inference of group membership. A. J. Hum. Genet. (2003) 72:578–589.[CrossRef]

    Bashir A., Ye C., Price A. L., Bafna V. Orthologous repeats and mammalian phylogenetic inference. Genome Res. (2005) 15:998–1006.[Abstract/Free Full Text]

    Batzer M. A., Deininger P. L. Alu repeats and human genomic diversity. Nat. Rev. Genet. (2002) 3:370–379.[CrossRef][Web of Science][Medline]

    Batzer M. A., Stoneking M., Alegria-Hartman M., Bazan H., Kass D. H., Shaikh T. H., Novick G. E., Ioannou P. A., Scheer W. D., Herrera R. J., et al. African origin of human-specific polymorphic Alu insertions. Proc. Natl. Acad. Sci. USA (1994) 91:12288–12292.[Abstract/Free Full Text]

    Cantrell M. A., Filanoski B. J., Ingermann A. R., Olsson K., DiLuglio N., Lister Z., Wichman H. A. An ancient retrovirus-like element contains hot spots for SINE insertion. Genetics (2001) 158:769–777.[Abstract/Free Full Text]

    Carroll M. L., Roy-Engel A. M., Nguyen S. V., Salem A. H., Vogel E., Vincent B., Myers J., Ahmad Z., Nguyen L., Sammarco M., Watkins W. S., Henke J., Makalowski W., Jorde L. B., Deininger P. L., Batzer M. A. Large-scale analysis of the Alu Ya5 and Yb8 subfamilies and their contribution to human genomic diversity. J. Mol. Biol. (2001) 311:17–40.[CrossRef][Web of Science][Medline]

    Carter A. B., Salem A. H., Hedges D. J., Keegan C. N., Kimball B., Walker J. A., Watkins W. S., Jorde L. B., Batzer M. A. Genome-wide analysis of the human Alu Yb-lineage. Hum. Genomics (2004) 1:167–178.[Medline]

    Churakov G., Smit A. F. A., Brosius J., Schmitz J. A novel abundant family of retroposed elements (DAS-SINEs) in the nine-banded Armadillo (Dasypus novemcinctus). Mol. Biol. Evol. (2005) 22:886–893.[Abstract/Free Full Text]

    Conley M. E., Partain J. D., Norland S. M., Shurtleff S. A., Kazazian H. H. Jr. Two independent retrotransposon insertions at the same site within the coding region of BTK. Hum. Mutat. (2005) 25:324–325.[Medline]

    Cotrim N. H., Auricchio M. T., Vicente J. P., Otto P. A., Mingroni-Netto R. C. Polymorphic Alu insertions in six Brazilian African-derived populations. Am. J. Hum. Genet. (2004) 16:264–277.

    Flower W. H. On the arrangement of the orders and families of existing Mammalia. Proc. Zool. Soc. Lond. (1883) 1883:178–186.

    Gatesy J., Milinkovitch M., Waddell V., Stanhope M. Stability of cladistic relationships between Cetacea and higher-level artiodactyl taxa. Syst. Biol. (1999) 48:6–20.[Abstract/Free Full Text]

    Gentles A. J., Kohany O., Jurka J. Evolutionary diversity and potential recombinogenic role of integration targets of non-LTR retrotransposons. Mol. Biol. Evol. (2005) 22:1983–1991.[Abstract/Free Full Text]

    Goodman M., Porter C. A., Czelusniak J., Page S. L., Schneider H., Shoshani J., Gunnell G., Groves C. P. Toward a phylogenetic classification of Primates based on DNA evidence complemented by fossil evidence. Mol. Phylogenet. Evol. (1998) 9:585–598.[CrossRef][Web of Science][Medline]

    Greally J. M. Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome. Proc. Nati. Acad. Sci. USA (2002) 99:327–332.[Abstract/Free Full Text]

    Han K., Xing J., Wang H., Hedges D. J., Garber R. K., Cordaux R., Batzer M. A. Under the genomic radar: The stealth model of Alu amplification. Genome Res. (2005) 15:655–664.[Abstract/Free Full Text]

    Hedges D. J., Callinan P. A., Cordaux R., Xing J., Barnes E., Batzer M. A. Differential Alu mobilization and polymorphism among the human and chimpanzee lineages. Genome Res. (2004) 14:1068–1075.[Abstract/Free Full Text]

    Hillis D. M. SINEs of the perfect character. Proc. Nat. Acad. Sci. USA (1999) 96:9979–9981.[Free Full Text]

    Jurka J. Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons. Proc. Nat. Acad. Sci. USA (1997) 94:1872–1877.[Abstract/Free Full Text]

    Jurka J., Kohany O., Pavlicek A., Kapitonov V. V., Jurka M. V. Clustering, duplication and chromosomal distribution of mouse SINE retrotransposons. Cytogenet. Genome Res. (2005) 110:117–123.[CrossRef][Web of Science][Medline]

    Kass D. H., Raynor M. E., Williams T. M. Evolutionary history of B1 retroposons in the genus Mus. J. Mol. Evol. (2000) 51:256–264.[Web of Science][Medline]

    Kawai K., Nikaido M., Harada M., Matsumura S., Lin L. K., Wu Y., Hasegawa M., Okada N. Intra- and interfamily relationships of Vespertilionidae inferred by various molecular markers including SINE insertion data. J. Mol. Evol. (2002) 55:284–301.[CrossRef][Web of Science][Medline]

    Lenoir A., Pelissier T., Bousquet-Antonelli C., Deragon J. M. Comparative evolution history of SINEs in Arabidopsis thaliana and Brassica oleracea: Evidence for a high rate of SINE loss. Cytogenet. Genome Res. (2005) 110:441–447.[CrossRef][Web of Science][Medline]

    Luis J. R., Terreros M. C., Martinez L., Rojas D., Herrera R. J. Two problematic human polymorphic Alu insertions. Electrophoresis (2003) 24:2290–2294.[CrossRef][Web of Science][Medline]

    Minghetti P. P., Dugaiczyk A. The emergence of new DNA repeats and the divergence of primates. Proc. Nat. Acad. Sci. USA (1993) 90:1872–1876.[Abstract/Free Full Text]

    Murata S., Takasaki N., Okazaki T., Kobayashi T., Numachi K., Chang K.-H., Okada N. Molecular evidence from short interspersed elements (SINEs) that Onchorhynchus masou (cherry salmon) is monophyletic. Can. J. Fish. Aqua. Sci. (1998) 55:1864–1870.[CrossRef]

    Murata S., Takasaki N., Saitoh M., Okada N. Determination of the phylogenetic relationships among Pacific salmonids by using short interspersed elements (SINEs) as temporal landmarks of evolution. Proc. Nat. Acad. Sci. USA (1993) 90:6995–6999.[Abstract/Free Full Text]

    Murata S., Takasaki N., Saitoh M., Tachida H., Okada N. Details of retropositional genome dynamics that provide a rationale for a generic division: The distinct branching of all the pacific salmon and trout (Oncorhynchus) from the Atlantic salmon and trout (Salmo). Genetics (1996) 142:915–926.[Abstract]

    Nasidze I., Risch G. M., Robichaux M., Sherry S. T., Batzer M. A., Stoneking M. Alu insertion polymorphisms and the genetic structure of human populations from the Caucasus. Eur. J. Hum. Genet. (2001) 9:267–272.[CrossRef][Web of Science][Medline]

    Nei M. Molecular evolutionary genetics (1987) New York: Columbia University Press.

    Nikaido M., Rooney A. P., Okada N. Phylogenetic relationships among cetartiodactyls based on insertions of short and long interspersed elements: Hippopotamuses are the closest extant relatives of whales. Proc. Nat. Acad. Sci. USA (1999) 96:10261–10266.[Abstract/Free Full Text]

    Nishihara H., Terai Y., Okada N. Characterization of novel Alu- and tRNA-related SINEs from the tree shrew and evolutionary implications of their origins. Mol. Biol. Evol. (2002) 19:1964–1972.[Abstract/Free Full Text]

    O'Huigin C., Satta Y., Takahata N., Klein J. Contribution of homoplasy and of ancestral polymorphism to the evolution of genes in anthropoid primates. Mol. Biol. Evol. (2002) 19:1501–1513.[Abstract/Free Full Text]

    Otieno A. C., Carter A. B., Hedges D. J., Walker J. A., Ray D. A., Garber R. K., Anders B. A., Stoilova N., Laborde M. E., Fowlkes J. D., Huang C. H., Perodeau B., Batzer M. A. Analysis of the human Alu Ya-lineage. J. Mol. Biol. (2004) 342:109–118.[CrossRef][Web of Science][Medline]

    Pamilo P., Nei M. Relationships between gene trees and species trees. Mol. Biol. Evol. (1988) 5:568–583.[Abstract]

    Perna N. T., Batzer M. A., Deininger P. L., Stoneking M. Alu insertion polymorphism: A new type of marker for human population studies. Hum. Biol. (1992) 64:641–648.[Web of Science][Medline]

    Ray D. A., Xing J., Hedges D. J., Hall M. A., Laborde M. E., Anders B. A., White B. R., Stoilova N., Fowlkes J. D., Landry K. E., Chemnick L. G., Ryder O. A., Batzer M. A. Alu insertion loci and platyrrhine primate phylogeny. Mol. Phylogenet. Evol. (2005) 35:117–126.[CrossRef][Web of Science][Medline]

    Roos C., Schmitz J., Zischler H. Primate jumping genes elucidate strepsirrhine phylogeny. Proc. Natl. Acad. Sci. USA (2004) 101:10650–10654.[Abstract/Free Full Text]

    Rothenburg S., Eiben M., Koch-Nolte F., Haag F. Independent integration of rodent identifier (ID) elements into orthologous sites of some RT6 alleles of Rattus norvegicus and Rattus rattus. J. Mol. Evol. (2002) 55:251–259.[Web of Science][Medline]

    Roy-Engel A. M., Carroll M. L., El-Sawy M., Salem A. H., Garber R. K., Nguyen S. V., Deininger P. L., Batzer M. A. Non-traditional Alu evolution and primate genomic diversity. J. Mol. Biol. (2002) 316:1033–1040.[CrossRef][Web of Science][Medline]

    Ryan S. C., Dugaiczyk A. Newly arisen DNA repeats in primate phylogeny. Proc. Nat. Acad. Sci. USA (1989) 86:9360–9364.[Abstract/Free Full Text]

    Salem A. H., Ray D. A., Hedges D. J., Jurka J., Batzer M. A. Analysis of the human Alu Ye lineage. BMC Evol. Biol. (2005) 5:18.[CrossRef][Medline]

    Salem A. H., Ray D. A., Xing J., Callinan P. A., Myers J. S., Hedges D. J., Garber R. K., Witherspoon D. J., Jorde L. B., Batzer M. A. Alu elements and hominid phylogenetics. Proc. Nat. Acad. Sci. USA (2003) 100:12787–12791.[Abstract/Free Full Text]

    Sarich V. M. Rodent macromolecular systematics. In: Evolutionary relationships among rodents: A multidisciplinary analysis—Luckett W. P., Hartenberger J.-L., eds. (1985) Berlin: Springer-Verlag. 423–452.

    Sasaki T., Takahashi K., Nikaido M., Miura S., Yasukawa Y., Okada N. First application of the SINE (short interspersed repetitive element) method to infer phylogenetic relationships in reptiles: An example from the turtle superfamily Testudinoidea. Mol. Biol. Evol. (2004) 21:705–715.[Abstract/Free Full Text]

    Satta Y., Klein J., Takahata N. DNA archives and our nearest relative: The trichotomy problem revisited. Mol. Phylogenet. Evol. (2000) 14:259–275.[CrossRef][Web of Science][Medline]

    Schmitz J., Ohme M., Zischler H. SINE insertions in cladistic analyses and the phylogenetic affiliations of Tarsius bancanus to other primates. Genetics (2001) 157:777–784.[Abstract/Free Full Text]

    Schmitz J., Roos C., Zischler H. Primate phylogeny: Molecular evidence from retroposons. Cytogenet. Genome Res. (2005) 108:26–37.[CrossRef][Web of Science][Medline]

    Schmitz J., Zischler H. A novel family of tRNA-derived SINEs in the colugo and two new retrotransposable markers separating Dermopterans from Primates. Mol. Phylogenet. Evol. (2003) 28:341–349.[CrossRef][Web of Science][Medline]

    Schwartz S., Elnitski L., Li M., Weirauch M., Riemer C., Smit A., Green E. D., Hardison R. C., Miller W. MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res. (2003) 31:3518–3524.[Abstract/Free Full Text]

    Shedlock A. M., Okada N. SINE insertions: Powerful tools for molecular systematics. Bioessays (2000) 22:148–160.[CrossRef][Web of Science][Medline]

    Shedlock A. M., Takahashi K., Okada N. SINEs of speciation: Tracking lineages with retroposons. Trends Ecol. Evol. (2004) 19:545–553.[CrossRef][Medline]

    Shimamura M., Yasue H., Ohshima K., Abe H., Kato H., Kishiro T., Goto M., Munechika I., Okada N. Molecular evidence from retroposons that whales form a clade within even-toed ungulates. Nature (1997) 388:666–670.[CrossRef][Medline]

    Sibley C. G., Ahlquist J. E. The phylogeny of the hominoid primates, as indicated by DNA-DNA hybridization. J. Mol. Evol. (1984) 20:2–15.[CrossRef][Web of Science][Medline]

    Singer S. S., Schmitz J., Schwiegk C., Zischler H. Molecular cladistic markers in New World monkey phylogeny (Platyrrhini, Primates). Mol. Phylogenet. Evol. (2003) 26:490–501.[CrossRef][Web of Science][Medline]

    Slattery J. P., Murphy W. J., O'Brien S. J. Patterns of diversity among SINE elements isolated from three Y-chromosome genes in carnivores. Mol. Biol. Evol. (2000) 17:825–829.[Free Full Text]

    Takahashi K., Nishida M., Yuma M., Okada N. Retroposition of the AFC family of SINEs (short interspersed repetitive elements) before and during the adaptive radiation of cichlid fishes in Lake Malawi and related inferences about phylogeny. J. Mol. Evol. (2001a) 53:496–507.[CrossRef][Web of Science][Medline]

    Takahashi K., Terai Y., Nishida M., Okada N. Phylogenetic relationships and ancient incomplete lineage sorting among cichlid fishes in Lake Tanganyika as revealed by analysis of the insertion of retroposons. Mol. Biol. and Evol. (2001b) 18:2057–2066.

    Takahata N. Gene genealogy in three related populations: Consistency probability between gene and population trees. Genetics (1989) 122:957–966.[Abstract/Free Full Text]

    Terai Y., Takahashi K., Nishida M., Sato T., Okada N. Using SINEs to probe ancient explosive speciation: "Hidden" radiation of African cichlids? Mol. Biol. Evol. (2003) 20:924–930.[Abstract/Free Full Text]

    Terai Y., Takezaki N., Mayer W. E., Tichy H., Takahata N., Klein J., Okada N. Phylogenetic relationships among East African haplochromine fish as revealed by short interspersed elements (SINEs). J. Mol. Evol. (2004) 58:64–78.[CrossRef][Web of Science][Medline]

    Thomas J. W., Touchman J. W., Blakesley R. W., Bouffard G. G., Beckstrom-Sternberg S. M., Margulies E. H., Blanchette M., Siepel A. C., Thomas P. J., McDowell J. C., Maskeri B., Hansen N. F., Schwartz M. S., Weber R. J., Kent W. J., Karolchik D., Bruen T. C., Bevan R., Cutler D. J., Schwartz S., Elnitski L., Idol J. R., Prasad A. B., Lee-Lin S. Q., Maduro V. V., Summers T. J., Portnoy M. E., Dietrich N. L., Akhter N., Ayele K., Benjamin B., Cariaga K., Brinkley C. P., Brooks S. Y., Granite S., Guan X., Gupta J., Haghighi P., Ho S. L., Huang M. C., Karlins E., Laric P. L., Legaspi R., Lim M. J., Maduro Q. L., Masiello C. A., Mastrian S. D., McCloskey J. C., Pearson R., Stantripop S., Tiongson E. E., Tran J. T., Tsurgeon C., Vogt J. L., Walker M. A., Wetherby K. D., Wiggins L. S., Young A. C., Zhang L. H., Osoegawa K., Zhu B., Zhao B., Shu C. L., De Jong P. J., Lawrence C. E., Smit A. F., Chakravarti A., Haussler D., Green P., Miller W., Green E. D. Comparative analyses of multi-species sequences from targeted genomic regions. Nature (2003) 424:788–793.[CrossRef][Medline]

    van de Lagemaat L. N., Gagnier L., Medstrand P., Mager D. L. Genomic deletions and precise removal of transposable elements mediated by short identical DNA segments in primates. Genome Res. (2005) 15:1243–1249.[Abstract/Free Full Text]

    Waddell P. J., Kishino H., Ota R. A phylogenetic foundation for comparative mammalian genomics. Genome Inform. Ser. (2001) 12:141–154.

    Watkins W. S., Rogers A. R., Ostler C. T., Wooding S., Bamshad M. J., Brassington A. M., Carroll M. L., Nguyen S. V., Walker J. A., Prasad B. V., Reddy P. G., Das P. K., Batzer M. A., Jorde L. B. Genetic variation among world populations: Inferences from 100 Alu insertion polymorphisms. Genome Res. (2003) 13:1607–1618.[Abstract/Free Full Text]

    Xing J., Salem A. H., Hedges D. J., Kilroy G. E., Watkins W. S., Schienman J. E., Stewart C. B., Jurka J., Jorde L. B., Batzer M. A. Comprehensive analysis of two Alu Yd subfamilies. J. Mol. Evol. (2003) 57:S76–S89.[CrossRef][Web of Science][Medline]

    Xing J., Wang H., Han K., Ray D. A., Huang C. H., Chemnick L. G., Stewart C. B., Disotell T. R., Ryder O. A., Batzer M. A. A mobile element based phylogeny of Old World monkeys. Mol. Phylogenet. Evol. (2005) 37:872–880.[CrossRef][Web of Science][Medline]

    Zampicinini G., Blinov A., Cervella P., Guryev V., Sella G. Insertional polymorphism of a non-LTR mobile element (NLRCth1) in European populations of Chironomus riparius (Diptera, Chironomidae) as detected by transposon insertion display. Genome (2004) 47:1154–1163.[Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
E. Hazkani-Covo
Mitochondrial Insertions into Primate Nuclear Genomes Suggest the Use of numts as a Tool for Phylogeny
Mol. Biol. Evol., October 1, 2009; 26(10): 2175 - 2179.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
G. Churakov, J. O. Kriegs, R. Baertsch, A. Zemann, J. Brosius, and J. Schmitz
Mosaic retroposon insertion patterns in placental mammals
Genome Res., May 1, 2009; 19(5): 868 - 875.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. J. Robinson, A. Ruiz-Herrera, and J. C. Avise
Hemiplasy and homoplasy in the karyotypic phylogenies of mammals
PNAS, September 23, 2008; 105(38): 14477 - 14481.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Moller-Krull, F. Delsuc, G. Churakov, C. Marker, M. Superina, J. Brosius, E. J. P. Douzery, and J. Schmitz
Retroposed Elements and Their Flanking Regions Resolve the Evolutionary History of Xenarthran Mammals (Armadillos, Anteaters, and Sloths)
Mol. Biol. Evol., November 1, 2007; 24(11): 2573 - 2582.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
K. Han, M. K. Konkel, J. Xing, H. Wang, J. Lee, T. J. Meyer, C. T. Huang, E. Sandifer, K. Hebert, E. W. Barnes, et al.
Mobile DNA in Old World Monkeys: A Glimpse Through the Rhesus Macaque Genome
Science, April 13, 2007; 316(5822): 238 - 240.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (13)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Ray, D. A.
Right arrow Articles by Batzer, M. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Ray, D. A.
Right arrow Articles by Batzer, M. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?