DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
Description
Code: https://github.com/gcorso/DiffDock
DiffDock (https://arxiv.org/abs/2210.01776) is a deep-learning diffusion generative model that predicts the 3D structure in which a small molecule binds to a protein structure without prior knowledge of the binding pocket. This blind docking task is often evaluated with holo-protein structures, which means that the methods receive the bound protein structure as input - the coordinates it will take on when the small molecule is already bound to it. In most use cases, this structure is not available.
To evaluate how able docking methods are to dock to computationally generated protein structures, we provide this dataset. It contains the time-split based test set used in DiffDock and prior work such as EquiBind (https://arxiv.org/abs/2202.05146), but with the protein structures generated by ESMFold: https://github.com/facebookresearch/esm.
The generated proteins have their pockets RMSD/Kabsch aligned to the original protein structure as described in our paper. Code is provided in the GitHub repository. The RMSDs after alignment are provided as well.
Files
alignment_rmsds.csv
Files
(76.6 MB)
Name | Size | Download all |
---|---|---|
md5:5659108005bd37bb791d113fea9595f1
|
15.1 kB | Preview Download |
md5:485b325b2fbc05fc86c845a9836d1023
|
76.6 MB | Preview Download |
Additional details
Related works
- Cites
- https://github.com/gcorso/DiffDock (URL)
- https://arxiv.org/abs/2210.01776 (URL)