Published March 31, 2026
| Version v1
Dataset
Open
Baktfold Manuscript Supplementary Files Too Large For Github
Authors/Creators
Description
Contains all supplementary files that go with the `baktfold-analysis` repository (https://github.com/gbouras13/baktfold-analysis) that are too large for GitHub.
File list:
- baktfold-benchmark.tar.gz - Bakta manuscript benchmark genomes - relevantly, contains genbank and mag dataset genomes
- combined_plasmid_annotations.tsv.gz - IMG/PR annotations for Bakta + Baktfold
- all_chunks_with_go.tsv.gz - all GlobDB per protein Baktfold annotations with mapped GO Terms for all Swiss-Prot hits
- protist_baktfold_jsons.tar - all Ensembl protists Baktfold JSON annotation files
- smag_combined_baktfold_with_eggnog.tsv.gz - SMAG dataset protein eggnog-Mapper (from original Delmont et al publication) + baktfold annotations
- updated_arc_protein.trimmed.faa.gz - 1,993,306 custom archaeal protein database raw FASTA
- updated_arc_protein.headers.tsv.gz - 1,993,306 custom archaeal protein database 2 column TSV for use with baktfold's custom DB --custom-annotations parameter
- updated_arc_protein.trimmed.fs.db.tar.gz - 1,993,306 custom archaeal protein database Foldseek database for use with --custom-db
- genbank_predictions_esm.tar genbank_hypotheticals_structures.tar mag_predictions_esm.tar mag_hypotheticals_structures.tar - ESMFold and ColabFold predictions for hypothetical proteins for mag and genbank benchmarking datasets
Files
Files
(29.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:7ce6112abe0f5ee90fad68863c284c4f
|
8.0 GB | Download |
|
md5:91742b404b664482fe0d6b15710a0b16
|
623.4 MB | Download |
|
md5:5e492106aa41a51d1d01d7a17dd3e2aa
|
182.0 MB | Download |
|
md5:4ffbb7e316461aced3f9e76684256681
|
3.3 GB | Download |
|
md5:0b27b2fca9d3feb862d579bfcca5fd08
|
3.6 GB | Download |
|
md5:ff109211e7459d796eb7fbdc9b77ffc2
|
2.6 GB | Download |
|
md5:6a5a9b7e4c75b1039be5d056e40fd1bf
|
2.7 GB | Download |
|
md5:5d19ad58679197d66b814f389159ee83
|
7.1 GB | Download |
|
md5:99b3ed5dbd1f71aabd7bd1dc7a156035
|
349.6 MB | Download |
|
md5:f35af5477e116127f744634a68ac3953
|
22.1 MB | Download |
|
md5:6e1f80a5d00acdb5279cca4bfa97d7ba
|
386.2 MB | Download |
|
md5:a750afe3549ca5b911efbe6af3b4c26c
|
674.7 MB | Download |