There is a newer version of the record available.

Published April 25, 2023 | Version 1.0
Dataset Open

Synthesizing maximal unbiased benchmarking data for virtual screening via deep reinforcement learning

  • 1. Chinese Academy of Medical Sciences & Peking Union Medical College Institute of Materia Medica

Description

This compressed file contains all datasets made for the validation of MUBDsyn.

  • datasets_int_val: 17 cases in this folder are derived from MUBD for GPCRs (https://github.com/jwxia2014/ULS-UDS). MUBDreal was made by MUBD-DecoyMaker2.0 and MUBDsyn was made by MUBD-DecoyMakersyn.
  • datasets_ext_val_classical_VS: Five cases in this folder are derived from the shared cases of MUV and DUD-E. The active sets of MUV were taken as the input to make corresponding MUBD datasets. Files in SBVS are raw molecular docking results by smina.
  • datasets_ext_val_ML_VS: Ten cases in this folder are derived from NRLiSt-BDB. Corresponding MUBD datasets were made as described above. 

All these datasets can be used for the reproduction of validation performed in the manuscript or to benchmark various virtual screening methods. 

Files

Files (522.4 MB)

Name Size Download all
md5:3b46c21c6f8e37c7651459df74dc1abb
522.4 MB Download