Published September 16, 2025 | Version v3
Dataset Open

Fermented Foods Microbial Genomes Species-Representatives Functional Annotations

Description

This repository contains raw files and results for functional annotations from 1,236 species-representative bacterial genomes assembled from diverse fermented foods. The raw genome FASTA files and corresponding metadata area available in the Zenodo repository Fermented Foods Microbial Genomes Database. These files and results were produced as part of a collaboration with Tatta Bio to make these annotations available through their platform. This repository contains the following ZIP files and structures: 

  • species-reps-predicted-orfs.zip: For each genome is a .ffn, .faa, .gbk, and .gff file
  • species-reps-main-results.zip: Output of the bac-mining workflow including summaries of each molecule type and counts across all genomes
  • species-reps-summaries.zip: Individual annotation summary TSVs per genome
  • merged-species-reps-annotation-summaries.tsv: Same format as the individual annotation summary TSVs, collated all together for all 1,236 genomes

This repository is slightly different from the data repository Predicted Biosynthetic Gene Clusters and Peptides from Fermented Food Microbial Genomes but contains some overlapping data. This repository contains all raw files and functional annotation information and summaries for 1236 species-representative genomes. These files were formatted specifically for the Tatta Bio platform. The other repository contains annotated summary files for cleavage peptides, smORFs, and BGCs predicted with antiSMASH for ~11,500 HQ genomes. 

Files

species-reps-main-results.zip

Files (5.0 GB)

Name Size Download all
md5:4026b136fad4f80748c12246df28c28d
439.6 MB Download
md5:6f2ccfefa5368562c5a1cbb56ac3c9d9
98.5 MB Preview Download
md5:dace705038325017894328ac8eb9efb0
4.4 GB Preview Download
md5:575bc320541c9e349f6464e06fdf1a89
71.9 MB Preview Download