Published September 24, 2025
| Version v3
Dataset
Open
GEMS: Resolving Data Bias Improves Generalization in Binding Affinity Prediction
Authors/Creators
Description
For fast reproduction of our results, we provide PyTorch datasets of precomputed interaction graphs for the entire PDBbind database on Zenodo. To enable quick establishment of leakage-free evaluation setups with PDBbind, we also provide pairwise similarity matrices for the entire PDBbind dataset on Zenodo.
Version 2 - Updated to improve the accuracy of Tanimoto Scores in the pairwise similarity matrices, which also caused minor changes in the composition of PDBbind CleanSplit.
Version 3 - Including pairwise similarity matrix for sequence identity (from TM-align)
Files
pairwise_similarity_complexes.json
Additional details
Dates
- Updated
-
2025-09-24Version 2 - Including pairwise similarity matrix for sequence identity (from TM-align)
Software
- Repository URL
- https://github.com/camlab-ethz/GEMS