Published March 12, 2024 | Version v6
Publication Open

Data for Umol

Description

posebusters_benchmark_set.tar.zst - files for the prediction (features to Umol/Umol-pocket) and scoring of the pose busters benchmark for Umol/Umol-pocket 

posebusters_pred_native.tar.zst - pdb and sdf files of proteins and ligands. Includes native structures, predicted structures and relaxed predicted structures with plDDT in the B factor column. Refers to Umol-pocket.

posebusters_pred_structures_no_pocket.tar.zst - pdb and sdf files of proteins and ligands. Includes predicted structures. Refers to Umol without pocket information.

posebusters_scores.csv - contains ligand RMSD and other metrics for the unrelaxed structures predicted with Umol-pocket (40000 steps).

posebusters_no_pocket_unrelaxed.csv - contains ligand RMSD and other metrics for the unrelaxed structures predicted with Umol (60000 steps).

binding_db_results.csv - results for binding db predicted with Umol.

binding_db_above_85.tar.zst - structures and results from binding db with ligand plDDT>85

params40000.npy - the parameters used in Umol-pocket (best check point)

valid_meta.csv - ids and meta for validation set complexes

valid_scores.tar.zst - scores (ligand RMSD) on the validation set for both Umol and Umol-pocket

calibration_meta.csv - structural clusters for affinity set

calibration_set_affinities.csv - affinity values (only Kd <1000 nM was used in the paper)

train_losses.csv - losses for training Umol-pocket

train_meta.csv - ids and meta for train set complexes

params60000.npy - the parameters used in Umol (best check point)

cross_dataset_tanimoto_similarity.csv - Tanimoto similarity of training set to PoseBusters test set

af_pred.tar.zst - AlphaFold2 predictions for the PoseBusters test set used with DiffDock.

diffdock_af_pred.tar.zst  - pdb and sdf files of proteins and ligands for DiffDock+AF on the PoseBusters test set. Also includes a csv with l-RMSD scores.

neuralplexer.tar.zst  - pdb and sdf files of proteins and ligands for NeuralPlexer on the PoseBusters test set. Also includes a csv with l-RMSD scores.

rfaa.tar.zst - pdb  files of proteins and ligands for RFAA w/o template information on the PoseBusters test set. Also includes a csv with l-RMSD scores

hhblits.tar.zst - PoseBusters hhblits MSAs used to predict the AF, Umol and RFAA protein structures.

Files

binding_db_results.csv

Files (4.0 GB)

Name Size Download all
md5:8b43a483aa09debb475d9cf0cb699727
19.7 MB Download
md5:919138db3a0a1606d2861d43ef4ef394
11.1 MB Download
md5:abf4c0fdb7870be08bd87177ffd06cb7
13.1 MB Preview Download
md5:67c2d787db4f469448bd9e00a5c406a5
10.3 kB Preview Download
md5:2f79b45af0dde9399ec8e050c2dfe48d
14.5 kB Preview Download
md5:139f0e87d6ec45971afca37348977c35
150.8 MB Preview Download
md5:40ddce5ea5386f1d1f700db1b3a7b794
19.6 MB Download
md5:6f6097d4e9415461edb3cc423e75c3e3
1.4 GB Download
md5:246bdad87f0a0fef029f62f88283323f
17.4 MB Download
md5:fc3bc73e2b1c43dcdcf3a02e88eb4d11
371.6 MB Download
md5:8a73a58da548ca57f7caa5ad9a2e9b83
371.6 MB Download
md5:4de64509a8ff09bd95fc1aede5074e4b
1.4 GB Download
md5:0cd31c91e65aff100415bc07cfcb9e0d
64.0 kB Preview Download
md5:7aac693f318a0e3beffd2ccdc6be60c4
85.3 MB Download
md5:375c6e6523b5233de1420ce1fc385868
19.6 MB Download
md5:137d3cca23b6bebd58e2d6b7d85ce036
84.4 kB Preview Download
md5:5463d1922d8ff130d98df38e2edea359
34.8 MB Download
md5:8d3119c38594a23a8d74619e41ee0cbe
6.3 MB Preview Download
md5:d56599a4134320c1757971d5c1150a1a
253.5 kB Preview Download
md5:e7a1b6cf2416c4bdec53cc33e29bb124
12.8 kB Preview Download
md5:8f09ae9c6a97987999796bf8be77d21a
719.5 kB Download