Published July 15, 2016 | Version v1
Dataset Open

Data from: RADcap: sequence capture of dual-digest RADseq libraries with identifiable duplicates and reduced missing data

  • 1. University of Georgia
  • 2. University of Illinois System
  • 3. Louisiana State University of Alexandria

Description

Molecular ecologists seek to genotype hundreds to thousands of loci from hundreds to thousands of individuals at minimal cost per sample. Current methods, such as restriction site associated DNA sequencing (RADseq) and sequence capture, are constrained by costs associated with inefficient use of sequencing data and sample preparation. Here, we introduce RADcap, an approach that combines the major benefits of RADseq (low cost with specific start positions) with those of sequence capture (repeatable sequencing of specific loci) to significantly increase efficiency and reduce costs relative to current approaches. RADcap uses a new version of dual-digest RADseq (3RAD) to identify candidate SNP loci for capture bait design, and subsequently uses custom sequence capture baits to consistently enrich candidate SNP loci across many individuals. We combined this approach with a new library preparation method for identifying and removing PCR duplicates from 3RAD libraries, which allows researchers to process RADseq data using traditional pipelines, and we tested the RADcap method by genotyping sets of 96 to 384 Wisteria plants. Our results demonstrate that our RADcap method: (1) methodologically reduces (to <5%) and allows computational removal of PCR duplicate reads from data; (2) achieves 80-90% reads-on-target in 11 of 12 enrichments; (3) returns consistent coverage (≥4x) across >90% of individuals at up to 99.8% of the targeted loci; (4) produces consistently high occupancy matrices of genotypes across hundreds of individuals; and (5) costs significantly less than current approaches.

Notes

Files

Passing_and_filtered_SNP_VCF_files.zip

Files (7.6 GB)

Name Size Download all
md5:29b1a0572cb378532b530c107c4bb70a
741.4 MB Download
md5:2f7a433c01edb5017d5b803833f715bf
796.4 MB Download
md5:3ca71d7d020075c50733dfee4e6552cd
72.1 MB Preview Download
md5:fc6d0c959c1caa60b69223bbabe92e2c
382.5 MB Download
md5:3c334e135afda57f742c63731e00497c
389.0 MB Download
md5:67fce643314ee0a4ff819728070b3c1a
302.7 MB Download
md5:8170361871c4a8850668b5ca4c1bc942
310.6 MB Download
md5:76c18cb2fc42270888bc24d2ed727a8c
316.7 MB Download
md5:7f12a6d3c9a4a31533723d4bc739b945
322.2 MB Download
md5:59060658ede72698f867c034702b12a6
358.8 MB Download
md5:42f10c4eb3be39719efce83298c58adb
364.5 MB Download
md5:3d389aac7e69249ff16a2f820f61a41a
369.2 MB Download
md5:cd68426a524ad9429c5ebf8e9d23609a
379.4 MB Download
md5:bbdac62cfd0513ca952ea8315df0f419
369.6 MB Download
md5:e402ae4e8b8f98ad1c1d3aa0ef7d3241
377.1 MB Download
md5:3a6dcd7c71c5efe95232e71e62f811f2
49.5 MB Download
md5:5c85af5a34b405a846e2523ac9a234ac
50.8 MB Download
md5:1606593b967f724787661cc630ccd729
377.5 MB Download
md5:2dddcc39338cb892176ea92f9491adea
385.2 MB Download
md5:3c8a7dff0c8998c3313b2a1f95ea2f82
65.2 MB Download
md5:726f63cdc95af553621b6912631b3a53
68.1 MB Download
md5:e39a5e4292ef8b9a37f93ec1b2597f01
366.3 MB Download
md5:aa42e134563113b0289cc7c9fceaab51
392.7 MB Download

Additional details

Related works

Is cited by
10.1111/1755-0998.12566 (DOI)