Published June 25, 2024 | Version v1
Dataset Open

Metabuli GTDB database part 1

  • 1. ROR icon Seoul National University

Contributors

Researcher:

  • 1. ROR icon Seoul National University

Description

Metabuli GTDB Database (Version 214.1) PART 1

The Metabuli GTDB database is a pre-built database based on the Genome Taxonomy Database (GTDB) genomes and taxonomy. This database is divided into two parts due to its large size (greater than 50 GB), so please download PART 2 in the same directory.

PART 2: 10.5281/zenodo.12207188

Database Specifications

  • GTDB Version: 214.1
  • Genome Quality Control (QC):
    • Assembly Level: Complete Genome or Chromosome level assemblies
    • Completeness: CheckM completeness > 90%
    • Contamination: CheckM. contamination < 5% 

Usage

  • Put the files downloaded from PART 1 and PART 2 in the same directory.
  • `gzip -d` for `info.gz` from PART1 and `diffIdx.gz` from PART2.
  • The directory is now DBDIR for `classify` module.

Files

Files (36.2 GB)

Name Size Download all
md5:cf6d355948e0ea288880016c1e92614d
36.2 GB Download

Additional details

Related works

References
Journal article: 10.1038/s41592-024-02273-y (DOI)

Dates

Available
2024-06-22
Uploaded date

Software

Repository URL
https://github.com/steineggerlab/Metabuli
Programming language
C++
Development Status
Active