Dataset Open Access

Structural variant discovery and genotyping in next-generation sequencing data

Gilks, William

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Gilks, William</dc:creator>
  <dc:description>Code, logs, data, and summaries for detection and genotyping of genomic structural variants in the D.melanogaster Sussex LHM hemiclones (and one in-house reference line individual), using Genomestrip/2.0

The unfiltered CNV pipleline results are lhm_gs.cnvs.raw.vcf.gz

Filtered CNV results (including removal of bad samples) are filtered.goodS.lhm_gs.cnvs.raw.vcf.gz

The file uploaded to NCBI dbVAR (which comprises of the filtered CNVs and indels &gt;50bp from the HaplotypeCaller method) is lhm_sx16.dbVAR.vcf.gz

The NCBI dbVAR accession number is nstd134. Code, logs and summary data are in the zipped archives, named accordingly. The archive contains additional input files required for Genomestrip, including a shell script for making some of them. The file gstrip_lhm_RG_bams.list is also an input for Genomestrip, indicating bam file names and paths.

The pre-print manuscript for this data is available on biorxiv: "Whole genome resequencing of a laboratory-adapted Drosophila melanogaster population sample" doi:

  <dc:title>Structural variant discovery and genotyping in next-generation sequencing data</dc:title>
All versions This version
Views 483483
Downloads 211211
Data volume 13.2 GB13.2 GB
Unique views 435435
Unique downloads 9292


Cite as