Published May 25, 2022 | Version v1
Dataset Open

3682 E. coli assemblies from NCBI

  • 1. University of Helsinki

Description

This is a dataset of 3682 E. coli assemblies downloaded from NCBI circa 2020 aiming to replicate the E. coli dataset in the paper "Succinct colored de Bruijn graphs" by Muggli et al. https://doi.org/10.1093/bioinformatics/btx067. The data is in 3682 FASTA files, one for each assembly. The uncompressed size is 18GB.

Files

Files (5.8 GB)

Name Size Download all
md5:244cde0be80717d2a7a0af0c47eb9f86
5.8 GB Download