Published September 29, 2021 | Version v1.0.0
Dataset Open

Synthetic Escherichia coli mixture samples with variable coverage

Authors/Creators

  • 1. Helsinki Institute for Information Technology, Department of Computer Science, University of Helsinki

Description

This dataset contains the synthetic mixture samples and reference sequences - as well as the appropriate metadata - that were originally used in the 2021 revision of the mSWEEP manuscript.

There are 87 samples in total, each containing 100bp paired-end Illumina sequencing reads from 10 different Escherichia coli strains from 10 different lineages. The number of reads is set so that the sequencing coverage of the individual strains varies between 50x and 0.10x and sums up to 100x.

Files

Files (49.5 GB)

Name Size Download all
md5:9d9802bc29f62a080ff18b4c9e0d43e9
49.5 GB Download

Additional details

Related works

Is supplement to
Journal article: 10.12688/wellcomeopenres.15639.1 (DOI)