Published March 10, 2022
| Version v4
Dataset
Open
microbetag : building a thorough database of genome-scale KO annotations
Description
In this repository we keep internal data for the microbetag microbial co-occurrence network annotator.
microbetag makes use of 2-column files for each genome, indicating the KO term found and a KEGG module in which this terms takes part into. As a single KO term might participates in more than one KEGG modules, the same KO might be more than once in an annotation file.
- gtdb_kofam_scan_per_module.tar.gz: all representative genomes of GTDB (v.202) were parsed and their corresponding `.faa` files were retrieved from the NCBI FTP. Then the kofam_scan tool was used to annotate them and finally a manual script was used to keep KOs of each genome per module.
- gtdb_modelseed_gems.zip: for all the GTDB genomes their corresponding PATRIC annotations were gathered. Then, using modelseedpy we constructed their genome scale metabolic reconstructions