Bakta database
Authors/Creators
- 1. Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen, 35392, Germany; Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen, 35392, Germany; German Centre for Infection Research (DZIF), partner site Giessen-Marburg-Langen, Giessen, Germany
Description
This data repository contains the mandatory DB for Bakta (db.tar.gz).
Bakta is a tool for the rapid & standardized local annotation of bacterial genomes & plasmids. It provides dbxref-rich and sORF-including annotations in machine-readble JSON & bioinformatics standard file formats for automatic downstream analysis: https://github.com/oschwengers/bakta
This db provides protein sequence hash digests and lengths of UniProt's UniRef100/UniRef90 clusters for ultra-fast identification & lookups. It has been pre-annotated with several specialized db and enriched with Dbxrefs. All conducted pre-annotations are logged and provided in the db.log.gz file.
External DB versions:
- NCBI AMRFinderPlus: 2021-03-01
- COG: 2020
- DoriC: 10
- ISFinder: 2019-09-25
- Mob-suite: 2.0
- Pfam: 34
- RefSeq: r205
- Rfam: 14.5
- UniProtKB/Swiss-Prot: 2021_01
- VFDB: 2021-04-05
Files
versions.json
Files
(41.8 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:dfa0abf61c35f0fffb40c690e7a976ca
|
14.9 GB | Download |
|
md5:9839b8c1dfbc596c403d8e81542a7a0a
|
26.8 GB | Download |
|
md5:296ba36a32a17afb30e6d2bb6f4c0fda
|
372 Bytes | Preview Download |
Additional details
Related works
- Is required by
- Software: https://github.com/oschwengers/bakta (URL)