Decoding the limits of deep learning in molecular docking for drug discovery

Yue Li, Jiacai Yi, Hui Li, Kun Li, Fenghua Kang, Youchao Deng, Chengkun Wu, Xiangzheng Fu, Dejun Jiang*, Dongsheng Cao*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Structure-based molecular docking, a cornerstone of computational drug design, is undergoing a paradigm shift fueled by deep learning (DL) innovations. However, the rapid proliferation of DL-driven docking methods has created uncharted challenges in translating in silico predictions to biomedical reality. Here, we delve into the performance and prospects of traditional methods and state-of-the-art DL docking paradigms—encompassing generative diffusion models, regression-based architectures, and hybrid frameworks—across five critical dimensions: pose prediction accuracy, physical plausibility, interaction recovery, virtual screening (VS) efficacy, and generalization across diverse protein–ligand landscapes. We reveal that generative diffusion models achieve superior pose accuracy, while hybrid methods offer the best balance. Regression models, however, often fail to product physically valid poses, and most DL methods exhibit high steric tolerance. Furthermore, our analysis reveals significant challenges in generalization, particularly when encountering novel protein binding pockets, limiting the current applicability of DL methods. Finally, we explore failure mechanisms from a model perspective and propose optimization strategies, offering actionable insights to guide docking tool selection and advance robust, generalizable DL frameworks for molecular docking.

Original languageEnglish
Pages (from-to)17374-17390
Number of pages17
JournalChemical Science
Volume16
Issue number37
Early online date19 Aug 2025
DOIs
Publication statusPublished - 7 Oct 2025

Fingerprint

Dive into the research topics of 'Decoding the limits of deep learning in molecular docking for drug discovery'. Together they form a unique fingerprint.

Cite this