Published November 14, 2017 | Version v1
Dataset Open

Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference

  • 1. University of California System

Description

The accelerating rate at which DNA sequence data is now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short-read sequences that are provided by high-throughput sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine "microhaplotypes" and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi-allelic marker data from 96 such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false positive rates by several orders of magnitude, relative to calling bi-allelic SNPs, for two challenging analytical procedures, full sibling and single parent-offspring pair identification. The advent of phased short-read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.

Notes

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: 1260693

Files

Genotypes_144kelpRF.csv

Files (2.2 GB)

Name Size Download all
md5:df2635f65b233c0feb079a71bab0debf
25.7 kB Download
md5:eed1f4651e69eb50ef862cce87f6270b
794.1 kB Preview Download
md5:b167d09719807a7b734c187216225399
1.5 GB Preview Download
md5:acd8d6c29976b6376491af8ee50e0662
703.9 MB Preview Download
md5:195019210669f97f66391a1093bc9cd2
48.5 kB Download
md5:ee9b4fc066656056c523c35f2cd0f2c7
47.7 MB Preview Download
md5:56a043b4950a06780732afde686607a4
4.0 MB Download
md5:162a5fa247be0910c99b65412757a572
3.0 MB Download

Additional details

Related works

Is cited by
10.1111/1755-0998.12737 (DOI)