DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

Gabriele Corso; Hannes Stärk; Bowen Jing; Regina Barzilay; Tommi Jaakkola

doi:10.48550/arXiv.2210.01776

Published March 28, 2023 | Version v1

Conference paper Open

DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

1. MIT

Code: https://github.com/gcorso/DiffDock
DiffDock (https://arxiv.org/abs/2210.01776) is a deep-learning diffusion generative model that predicts the 3D structure in which a small molecule binds to a protein structure without prior knowledge of the binding pocket. This blind docking task is often evaluated with holo-protein structures, which means that the methods receive the bound protein structure as input - the coordinates it will take on when the small molecule is already bound to it. In most use cases, this structure is not available.

To evaluate how able docking methods are to dock to computationally generated protein structures, we provide this dataset. It contains the time-split based test set used in DiffDock and prior work such as EquiBind (https://arxiv.org/abs/2202.05146), but with the protein structures generated by ESMFold: https://github.com/facebookresearch/esm.

The generated proteins have their pockets RMSD/Kabsch aligned to the original protein structure as described in our paper. Code is provided in the GitHub repository. The RMSDs after alignment are provided as well.

Files

alignment_rmsds.csv

Files (76.6 MB)

Name	Size	Download all
alignment_rmsds.csv md5:5659108005bd37bb791d113fea9595f1	15.1 kB	Preview Download
timesplit_testset_zenodo.zip md5:485b325b2fbc05fc86c845a9836d1023	76.6 MB	Preview Download

Additional details

Cites: https://github.com/gcorso/DiffDock (URL); https://arxiv.org/abs/2210.01776 (URL)

	All versions	This version
Views	133	132
Downloads	100	99
Data volume	2.2 GB	2.1 GB

DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking

Creators

Description

Files

alignment_rmsds.csv

Files (76.6 MB)

Additional details

Related works