Published September 2, 2022 | Version 1.1
Dataset Open

Host removal database: Homo sapiens, Sars-Cov-2, PhiX174

Authors/Creators

  • 1. Quadram Institute Bioscience

Description

💾 cleanup-db

Kraken2 database, built upon a viral sequence masked human reference from:

  • Handley, Scott A. (2020). Virus+ Sequence Masked Human Reference Genome (hg19) (1.0) [Data set]. Zenodo. [10.5281/zenodo.4116107]

but separating chromosomes as artificial taxa to allow for QC, and includes Sars-Cov-2 and PhiX 174

💾 gutcheck-db

A very small DB containg some common gut bacteria and Human and Murine mitochondrial genome:

  • Akkermansia muciniphila
  • Bacteroides fragilis
  • Bifidobacterium longum
  • Blautia obeum strain
  • Escherichia coli
  • Enterococcus faecium
  • Prevotella copri

 

See: https://github.com/telatin/cleanup

Files

cleanup-db.zip

Files (4.9 GB)

Name Size Download all
md5:227aef00b16c2bc0381096f08b9f85d5
4.9 GB Preview Download
md5:c8e0e230adf059c4f5a0db5a454f66d3
33.6 MB Preview Download

Additional details

References

  • Handley, Scott A. (2020). Virus+ Sequence Masked Human Reference Genome (hg19) (1.0) [Data set]. Zenodo. [10.5281/zenodo.4116107]