There is a newer version of the record available.

Published November 20, 2020 | Version 1.0
Dataset Open

Bakta database

Authors/Creators

  • 1. Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen, 35392, Germany; Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen, 35392, Germany; German Centre for Infection Research (DZIF), partner site Giessen-Marburg-Langen, Giessen, Germany

Description

This data repository contains the mandatory DB for Bakta (db.tar.gz).

Bakta is a tool for the rapid & standardized local annotation of bacterial genomes & plasmids. It provides dbxref-rich and sORF-including annotations in machine-readble JSON & bioinformatics standard file formats for automatic downstream analysis: https://github.com/oschwengers/bakta

This db provides protein sequence hash digests and lengths of UniProt's UniRef100/UniRef90 clusters for ultra-fast identification & lookups. It has been pre-annotated with several specialized db and enriched with Dbxrefs. All conducted pre-annotations are logged and provided in the db.log.gz file.

External db used & versions:

  • Rfam: 14.2
  • UniProtKB/Swiss-Prot: 2020_04
  • RefSeq: r202
  • NCBI AMRFinderPlus: 2020-09-22.2
  • COG: 2014
  • Mob-suite: 2.0
  • DoriC: 10
     

Files

versions.json

Files (42.9 GB)

Name Size Download all
md5:4ac3b2100ba8e2427d25439637193f39
18.0 GB Download
md5:cdaa2f6661211ab6a429e58d0bcecb6f
24.8 GB Download
md5:d826e9ece7794981215ca6e568fd17b2
125 Bytes Preview Download

Additional details

Related works

Is required by
Software: https://github.com/oschwengers/bakta (URL)