Published May 29, 2023 | Version 1.1
Dataset Open

Methylation-free E.coli nanopore sequencing (ONT R9.4.1) data set

  • 1. KTH
  • 2. SciLife Lab

Description

The data set consists of fast5 files divided into 5 zip files (fast5_[1-5].zip), a genome record (Ecoli_K12_MG1655.fasta), an Illumina assembly genome (illumina_contigs.fasta) and a fastq file from Guppy 5 (guppy_basecalled.fastq.gz). We sequenced the Ecoli non-methylated genomic DNA (D5016, Zymo Research) with an ONT MinION device. The sequencing libraries were prepared by fragmenting the genomic DNA using Covaris g-TUBE and a Ligation sequencing kit (SQK-LSK109, Oxford Nanopore) with Flow Cell chemistry R9.4.1. We also performed short-read Illumina sequencing on the same sample using the TruSeq PCR-free library preparation on the MiSeq sequencing platform (Illumina, USA), and constructed a draft assembly from the Illumina sequencing results using SPAdes v3.6.0. We also upload a reference genome directly obtained from the E.coli sample producer website. 

In addition, the data set contains two fastq files that produced by the Lokatt basecaller (lokatt_basecalled.fasta.gz) and local-trained Bonito basecaller (bonito_local_basecalled.fastq.gz ), respectively, which are used for benchmarking in the Lokatt basecaller paper.

Notes

This work has been supported by the Swedish Research Council Research Environment Grant QuantumSense [VR 2018-06169]; and the Swedish Foundation for Strategic Research (SSF) grant ASSEMBLE [RIT15-0012].

Files

fast5_1.zip

Files (188.6 GB)

Name Size Download all
md5:dfdb53dd9af8dd04eae841d11733860d
5.2 GB Download
md5:182c4c48740aa62f1b6ca3c03afd663a
4.7 MB Download
md5:b4defd9449891a9edb46fb33ca4c2ceb
33.0 GB Preview Download
md5:49b2bef1330aa2ab5a77561b87853a91
31.4 GB Preview Download
md5:b278272c95c866195c435eec696d1bbf
31.4 GB Preview Download
md5:c55b4acfd014b05f962f8df59eb8b460
31.5 GB Preview Download
md5:48fda2360b0c1cedeb4a2b8d297d36a5
30.8 GB Preview Download
md5:59a7a1a8803cca6da8daa8da72747397
19.8 GB Download
md5:50901984297553943c8ff959c3a523bb
4.6 MB Download
md5:6ea9b56355f1b02b975fb4c238fd97ff
5.4 GB Download
md5:823a65031be457dd66734b00972bfd9d
3.8 kB Preview Download

Additional details

Related works

Is part of
Preprint: 10.1101/2022.07.13.499873 (DOI)