Published August 14, 2025
| Version v1.0.0
Dataset
Open
Dataset for: Predicting Experimental Success in De Novo Binder Design: A Meta-Analysis of 3,766 Experimentally Characterised Binders
Creators
Description
Benchmarking dataset for the publication: Predicting Experimental Success in De Novo Binder Design: A Meta-Analysis of 3,766 Experimentally Characterised Binders
Inlcludes the following data:
- final_data.csv: Dataset with all collected binder features with the following relevant columns
- binder_id: name of binder
- target_id: name of target
- binder: binary experimental binding info
- source: source publication of the binder
- All other columns are the features described in the publication
- input_pdbs.tar.zst: Input .pdb files used for analysis where file names correspond to binder_id in the final_data.csv; sourced from public available de novo binder design campains
- AF2_initial_guess_outputs.tar.zstd: Output .pdb files generated with the initial guess AF2 implementation (https://github.com/nrbennet/dl_binder_design)
- AF3_outputs.tar.zstd: Compressed output from AF3 including structure (.cif) and confidence files (.json) with one subfolder per binder_id
- Boltz1_outputs.tar.zstd: Compressed output from Boltz-1 including structure (.cif), summary confidence (.json) and pae/plddt (.npz) files with one subfolder per binder_id
- ColabFold_outputs.tar.zstd: Compressed output from ColabFold including structure (.pdb) and confidedence files (.json)
Files
final_dataset.csv
Files
(9.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:d5af7a8dc87f614f88b24a629c351c6b
|
256.6 MB | Download |
|
md5:6430cbacd4ea54191e71583db5faa90d
|
2.0 GB | Download |
|
md5:55478634159352ecf9f7d2dfb32144e8
|
3.8 GB | Download |
|
md5:402f362c10b5d53d2e91b41b82d7d067
|
2.9 GB | Download |
|
md5:3a69ee9b0fecf53924a8c6479bac146e
|
82.0 MB | Preview Download |
|
md5:18a1e6c7ed2460a96011b3860143c3dc
|
244.4 MB | Download |