There is a newer version of the record available.

Published August 8, 2019 | Version v1.0
Dataset Open

SDG Paper Datasets

Description

This resource contains dataset used in the paper describing the SDG software framework: "A Sequence Distance Graph framework for genome assembly and analysis".

Specifically for the section of the paper "Hybrid assembly of short and long reads", the folder "ecoli" contains the Illumina Miseq 2x300bp reads that were used.

Specifically for the section of the paper "Analysing a simulation of heterozygous parent-child trio with short reads", the "trio" folder contains the simulated genomes of the trio's 2 parental individuals and 1 offspring individual, as well as simulated reads produced from those genomes.

Notes

This work was strategically funded by the BBSRC Core Strategic Programme Grant (BBS/E/T/000PR9818). Work by GGA and BJC was also partially funded by the BBSRC grant "OctoSeq: Sequencing the octoploid strawberry" (BB/N009819/1).

Files

sgd_data.zip

Files (622.5 MB)

Name Size Download all
md5:7f648b0cddd742f812a07353ea3aaa92
622.5 MB Preview Download