Published November 6, 2014 | Version v1
Dataset Open

Data from: Optimization of the genotyping-by-sequencing strategy for population genomic analysis in conifers

  • 1. Umeå University
  • 2. Chinese Academy of Sciences
  • 3. Beijing Forestry University

Description

Flexibility and low cost make genotyping-by-sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI-MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference-free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000–11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking.

Notes

Files

9py-reference.txt

Files (15.8 MB)

Name Size Download all
md5:cdcaf9d5f5a5b78a2dba52cc81f5436e
1.5 MB Download
md5:39aed72c1073b879fefa07ca69de97e6
3.0 MB Download
md5:93f285e617bd69a76b0b8e48e6ddb80d
1.8 MB Download
md5:7feed09882fec8742455b595d411d1c5
1.6 MB Download
md5:11b98a5eb28ad3a8d55d75a8584c2d4a
2.5 MB Download
md5:fd09da3db4df8133863f53d0b2a0f10e
1.9 MB Preview Download
md5:aef10c1162a9cdc2cc5e0e6f76729a76
1.8 MB Download
md5:929325ebe868ce96100b688494473aac
2.2 kB Download
md5:af8f3abb9abc69f24bc24c35e2e93d2b
3.9 kB Download
md5:02ceef313a095f80a28eb76cc575ae8f
550 Bytes Download
md5:fac2ae2296d9ca22ac541dcb4e5398c5
823 Bytes Download
md5:69077198040a877c4268ac37df1e4b50
23.7 kB Download
md5:0fee03548d16efad41b4597d014a4b7a
1.8 kB Download
md5:ffe9030455b04928c541c4dbaeecc61a
1.8 MB Download
md5:b2d22780d2018a31fd4301dcecf15473
2.1 kB Download
md5:6df746161b0a276b7514c1e6a561a0eb
1.3 kB Download
md5:93223b16dbefc1bc84e3d12ab991606e
777 Bytes Download
md5:591faffa6ba08f5b3470d6f70e3f12a0
5.8 kB Download

Additional details

Related works

Is cited by
10.1111/1755-0998.12342 (DOI)