Published August 14, 2025 | Version v1.0.0
Dataset Open

Dataset for: Predicting Experimental Success in De Novo Binder Design: A Meta-Analysis of 3,766 Experimentally Characterised Binders

Description

Benchmarking dataset for the publication: Predicting Experimental Success in De Novo Binder Design: A Meta-Analysis of 3,766 Experimentally Characterised Binders
Inlcludes the following data: 

  • final_data.csv: Dataset with all collected binder features with the following relevant columns
    • binder_id: name of binder
    • target_id: name of target
    • binder: binary experimental binding info
    • source: source publication of the binder
    • All other columns are the features described in the publication
  • input_pdbs.tar.zst: Input .pdb files used for analysis where file names correspond to binder_id in the final_data.csv; sourced from public available de novo binder design campains
  • AF2_initial_guess_outputs.tar.zstd: Output .pdb files generated with the initial guess AF2 implementation (https://github.com/nrbennet/dl_binder_design)
  • AF3_outputs.tar.zstd: Compressed output from AF3 including structure (.cif) and confidence files (.json) with one subfolder per binder_id
  • Boltz1_outputs.tar.zstd: Compressed output from Boltz-1 including structure (.cif), summary confidence (.json) and pae/plddt (.npz) files with one subfolder per binder_id
  • ColabFold_outputs.tar.zstd: Compressed output from ColabFold including structure (.pdb) and confidedence files (.json)

Files

final_dataset.csv

Files (9.3 GB)

Name Size Download all
md5:d5af7a8dc87f614f88b24a629c351c6b
256.6 MB Download
md5:6430cbacd4ea54191e71583db5faa90d
2.0 GB Download
md5:55478634159352ecf9f7d2dfb32144e8
3.8 GB Download
md5:402f362c10b5d53d2e91b41b82d7d067
2.9 GB Download
md5:3a69ee9b0fecf53924a8c6479bac146e
82.0 MB Preview Download
md5:18a1e6c7ed2460a96011b3860143c3dc
244.4 MB Download