Is RAD-seq suitable for phylogenetic inference? An in silico assessment and optimization

Marie Cariou, Laurent Duret, Sylvain Charlat

Research output: Contribution to journalArticlepeer-review


INFERRING PHYLOGENETIC RELATIONSHIPS BETWEEN CLOSELY RELATED TAXA CAN BE HINDERED BY THREE FACTORS: (1) the lack of informative molecular variation at short evolutionary timescale; (2) the lack of established markers in poorly studied taxa; and (3) the potential phylogenetic conflicts among different genomic regions due to incomplete lineage sorting or introgression. In this context, Restriction site Associated DNA sequencing (RAD-seq) seems promising as this technique can generate sequence data from numerous DNA fragments scattered throughout the genome, from a large number of samples, and without preliminary knowledge on the taxa under study. However, divergence beyond the within-species level will necessarily reduce the number of conserved and non-duplicated restriction sites, and therefore the number of loci usable for phylogenetic inference. Here, we assess the suitability of RAD-seq for phylogeny using a simulated experiment on the 12 Drosophila genomes, with divergence times ranging from 5 to 63 million years. These simulations show that RAD-seq allows the recovery of the known Drosophila phylogeny with strong statistical support, even for relatively ancient nodes. Notably, this conclusion is robust to the potentially confounding effects of sequencing errors, heterozygosity, and low coverage. We further show that clustering RAD-seq data using the BLASTN and SiLiX programs significantly improves the recovery of orthologous RAD loci compared with previously proposed approaches, especially for distantly related species. This study therefore validates the view that RAD sequencing is a powerful tool for phylogenetic inference.

Original languageEnglish
Pages (from-to)846-52
Number of pages7
JournalEcology and Evolution
Issue number4
Publication statusPublished - 2013
Externally publishedYes


  • Journal Article


Dive into the research topics of 'Is RAD-seq suitable for phylogenetic inference? An in silico assessment and optimization'. Together they form a unique fingerprint.

Cite this