There is a newer version of the record available.

Published March 10, 2022 | Version v4
Dataset Open

microbetag : building a thorough database of genome-scale KO annotations

Authors/Creators

  • 1. IMBBC - HCMR

Description

In this repository we keep internal data for the microbetag microbial co-occurrence network annotator.

microbetag makes use of 2-column files for each genome, indicating the KO term found and a KEGG module in which this terms takes part into. As a single KO term might participates in more than one KEGG modules, the same KO might be more than once in an annotation file. 

  • gtdb_kofam_scan_per_module.tar.gz: all representative genomes of GTDB (v.202) were parsed and their corresponding `.faa` files were retrieved from the NCBI FTP. Then the kofam_scan tool was used to annotate them and finally a manual script was used to keep KOs of each genome per module. 
  • gtdb_modelseed_gems.zipfor all the GTDB genomes their corresponding PATRIC annotations were gathered. Then, using modelseedpy we constructed their genome scale metabolic reconstructions

 

Files

gtdb_modelseed_gems.zip

Files (6.3 GB)

Name Size Download all
md5:cbcc9aa1a28a5bd5f6661f832d27bcbf
307.3 MB Download
md5:e3e62b305e64b27da7b80655d7f92f2c
5.6 GB Preview Download
md5:62a98c1e62065bca5a1ab1f650fbf186
72.7 MB Download
md5:1ece66230000e19a54c46e4579a4fa5e
282.3 MB Download