TY - JOUR
T1 - Decoding the limits of deep learning in molecular docking for drug discovery
AU - Li, Yue
AU - Yi, Jiacai
AU - Li, Hui
AU - Li, Kun
AU - Kang, Fenghua
AU - Deng, Youchao
AU - Wu, Chengkun
AU - Fu, Xiangzheng
AU - Jiang, Dejun
AU - Cao, Dongsheng
N1 - Publisher Copyright:
© 2025 The Author(s). Published by the Royal Society of Chemistry.
Funding Information:
This work was supported by National Natural Science Foundation of China (22173118, 22307112, 82304316), Young Scientists Fund of Natural Science Foundation of Hunan Province of China (2025JJ60651), Noncommunicable Chronic Diseases-National Science and Technology Major Project [2023ZD0507104]. We acknowledge Haikun Xu, and the High-Performance Computing Center of Central South University for support.
PY - 2025/10/7
Y1 - 2025/10/7
N2 - Structure-based molecular docking, a cornerstone of computational drug design, is undergoing a paradigm shift fueled by deep learning (DL) innovations. However, the rapid proliferation of DL-driven docking methods has created uncharted challenges in translating in silico predictions to biomedical reality. Here, we delve into the performance and prospects of traditional methods and state-of-the-art DL docking paradigms—encompassing generative diffusion models, regression-based architectures, and hybrid frameworks—across five critical dimensions: pose prediction accuracy, physical plausibility, interaction recovery, virtual screening (VS) efficacy, and generalization across diverse protein–ligand landscapes. We reveal that generative diffusion models achieve superior pose accuracy, while hybrid methods offer the best balance. Regression models, however, often fail to product physically valid poses, and most DL methods exhibit high steric tolerance. Furthermore, our analysis reveals significant challenges in generalization, particularly when encountering novel protein binding pockets, limiting the current applicability of DL methods. Finally, we explore failure mechanisms from a model perspective and propose optimization strategies, offering actionable insights to guide docking tool selection and advance robust, generalizable DL frameworks for molecular docking.
AB - Structure-based molecular docking, a cornerstone of computational drug design, is undergoing a paradigm shift fueled by deep learning (DL) innovations. However, the rapid proliferation of DL-driven docking methods has created uncharted challenges in translating in silico predictions to biomedical reality. Here, we delve into the performance and prospects of traditional methods and state-of-the-art DL docking paradigms—encompassing generative diffusion models, regression-based architectures, and hybrid frameworks—across five critical dimensions: pose prediction accuracy, physical plausibility, interaction recovery, virtual screening (VS) efficacy, and generalization across diverse protein–ligand landscapes. We reveal that generative diffusion models achieve superior pose accuracy, while hybrid methods offer the best balance. Regression models, however, often fail to product physically valid poses, and most DL methods exhibit high steric tolerance. Furthermore, our analysis reveals significant challenges in generalization, particularly when encountering novel protein binding pockets, limiting the current applicability of DL methods. Finally, we explore failure mechanisms from a model perspective and propose optimization strategies, offering actionable insights to guide docking tool selection and advance robust, generalizable DL frameworks for molecular docking.
UR - http://www.scopus.com/inward/record.url?scp=105016847383&partnerID=8YFLogxK
U2 - 10.1039/d5sc05395a
DO - 10.1039/d5sc05395a
M3 - Journal article
AN - SCOPUS:105016847383
SN - 2041-6520
VL - 16
SP - 17374
EP - 17390
JO - Chemical Science
JF - Chemical Science
IS - 37
ER -