Published May 25, 2022
| Version v1
Dataset
Open
3682 E. coli assemblies from NCBI
Description
This is a dataset of 3682 E. coli assemblies downloaded from NCBI circa 2020 aiming to replicate the E. coli dataset in the paper "Succinct colored de Bruijn graphs" by Muggli et al. https://doi.org/10.1093/bioinformatics/btx067. The data is in 3682 FASTA files, one for each assembly. The uncompressed size is 18GB.
Files
Files
(5.8 GB)
Name | Size | Download all |
---|---|---|
md5:244cde0be80717d2a7a0af0c47eb9f86
|
5.8 GB | Download |