Published 2025 | Version v2
Publication Open

ArtiDock: accurate Machine Learning approach to protein-ligand docking optimized for high-throughput virtual screening

  • 1. Receptor.AI Inc., 20-22 Wenlock Road, London N1 7GU, United Kingdom
  • 2. Department of Physics of Biological Systems, Institute of Physics of The National Academy of Sciences of Ukraine, 46 Nauky Ave., 03038, Kyiv, Ukraine
  • 3. Department of Biophysics and Medical Informatics, Educational and Scientific Centre "Іnstitute of Biology and Medicine", Taras Shevchenko Kyiv National University, 64 Volodymyrska Str., 01601, Kyiv, Ukraine
  • 4. Department of Cellular, Computational and Integrative Biology, The University of Trento, Via Sommarive 9, 38123 Povo (Trento), Italy
  • 5. Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, CZ-166 10 Prague 6, Czech Republic
  • 6. Department of Physical Chemistry, Faculty of Science, Palacký University Olomouc, 17. listopadu 12, 771 46 Olomouc, Czech Republic

Description

This repository contains:

  • plinder-v2-eval.tar.gz: the preprocessed PLINDER test subsets;
  • docking-results.tar.gz: predicted ligand poses from all compared tools and tables with per-pose ligand scores;
  • posex-results.tar.gz: predicted ligand poses for PoseX benchmark.

Paper Abstract:

Classical protein-ligand docking has been a cornerstone technique in computational drug discovery for decades, but has reached an accuracy and performance plateau. Recently introduced Machine Learning (ML) based docking methods offer a promising paradigm shift, but their practical adoption is hampered by accuracy-to-speed trade-offs, inadequate benchmarking standards, and questionable chemical validity of predicted poses. In this study, we introduce ArtiDock - an ML-based docking technique optimized for high-throughput virtual screening applications. To evaluate ArtiDock, we developed a dedicated performance and accuracy benchmark for pocket-specific rigid protein-ligand docking, which mimics realistic industrial drug discovery scenarios and is based on the novel PLINDER dataset. We demonstrate that ArtiDock is 29-38% more accurate in comparison to leading open-source and commercial classical docking techniques such as AutoDock, Vina, and Glide, while providing a low computational cost. ArtiDock notably excels in challenging docking scenarios involving unbound protein structures and binding sites containing ions and structured water molecules. Additionally, we demonstrated competitive accuracy of our approach at considerably higher throughput compared to a wide range of AI docking and AI co-folding methods using the PoseX benchmark. Our results show that ArtiDock could be considered as a method of choice in high-throughput virtual screening scenarios.

Files

NOTICE-plinder.txt

Files (182.6 MB)

Name Size Download all
md5:115ddef3778431bc583bb9300c618e7d
21.8 MB Download
md5:5100dbbe594490330d91ed4ae619cd52
927 Bytes Preview Download
md5:31b08c3012377a5c5cc30c6527f1fa5a
154.0 MB Download
md5:677cad66c5dc133bc8ebdc371f9ba79b
6.8 MB Download

Additional details

Software

Repository URL
https://github.com/receptor-ai/dock-eval.git
Programming language
Python