Published June 25, 2024
| Version v1
Dataset
Open
Metabuli GTDB database part 1
Description
Metabuli GTDB Database (Version 214.1) PART 1
The Metabuli GTDB database is a pre-built database based on the Genome Taxonomy Database (GTDB) genomes and taxonomy. This database is divided into two parts due to its large size (greater than 50 GB), so please download PART 2 in the same directory.
PART 2: 10.5281/zenodo.12207188
Database Specifications
- GTDB Version: 214.1
- Genome Quality Control (QC):
- Assembly Level: Complete Genome or Chromosome level assemblies
- Completeness: CheckM completeness > 90%
- Contamination: CheckM. contamination < 5%
Usage
- Put the files downloaded from PART 1 and PART 2 in the same directory.
- `gzip -d` for `info.gz` from PART1 and `diffIdx.gz` from PART2.
- The directory is now DBDIR for `classify` module.
Files
Files
(36.2 GB)
Name | Size | Download all |
---|---|---|
md5:cf6d355948e0ea288880016c1e92614d
|
36.2 GB | Download |
Additional details
Identifiers
Related works
- References
- Journal article: 10.1038/s41592-024-02273-y (DOI)
Dates
- Available
-
2024-06-22Uploaded date
Software
- Repository URL
- https://github.com/steineggerlab/Metabuli
- Programming language
- C++
- Development Status
- Active