TY - JOUR
T1 - An image-based protein-ligand binding representation learning framework via multi-level flexible dynamics trajectory pre-training
AU - Xiang, Hongxin
AU - Liu, Mingquan
AU - Hou, Linlin
AU - Jin, Shuting
AU - Wang, Jianmin
AU - Xia, Jun
AU - Du, Wenjie
AU - Yuan, Sisi
AU - Fu, Xiangzheng
AU - Yang, Xinyu
AU - Zeng, Li
AU - Xu, Lei
N1 - The work was supported by the National Natural Science Foundation of China [62422113], Shenzhen Science and Technology Program [20231129091450002], Shenzhen Polytechnic University Research Fund [6025310047K], and Key Field of Department of Education of Guangdong Province [2022ZDZX2082].
Publisher Copyright:
© 2025 The Author(s). Published by Oxford University Press.
PY - 2025/10/1
Y1 - 2025/10/1
N2 - Motivation Accurate prediction of protein-ligand binding (PLB) relationships plays a crucial role in drug discovery, which helps identify drugs that modulate the activity of specific targets. Traditional biological assays for measuring PLB relationships are time consuming and costly. In addition, models for predicting PLB relationships have been developed and widely used in drug discovery tasks. However, learning more accurate PLB representations is essential to meet the stringent standards required for drug discovery. Results We propose an image-based PLB representation learning framework, called ImagePLB, which equips ligand representation learner (LRL) and protein representation learner (PRL) to accept 3D multi-view ligand images and protein graphs as input, respectively, and learns rich interaction information between ligand and protein through a binding representation learner (BRL). Considering the scarcity of protein-ligand pairs, we further propose a multi-level next trajectory prediction (MLNTP) task to pre-train ImagePLB on the 4D flexible dynamics trajectory of 16 972 complexes, including ligand level, protein level, and complex level, to learn information related to trajectories. Besides, by introducing trajectory regularization (TR), we effectively alleviate the problem of high (even almost identical) feature similarity caused by adjacent trajectories. Compared with the current state-of-the-art methods, ImagePLB has achieved competitive improvements on PLB-related prediction tasks, including protein-ligand affinity and efficacy prediction tasks. This study opens the door to the image-based PLB learning paradigm. Availability and implementation All data and implementation details of code can be obtained from https://github.com/HongxinXiang/ImagePLB.
AB - Motivation Accurate prediction of protein-ligand binding (PLB) relationships plays a crucial role in drug discovery, which helps identify drugs that modulate the activity of specific targets. Traditional biological assays for measuring PLB relationships are time consuming and costly. In addition, models for predicting PLB relationships have been developed and widely used in drug discovery tasks. However, learning more accurate PLB representations is essential to meet the stringent standards required for drug discovery. Results We propose an image-based PLB representation learning framework, called ImagePLB, which equips ligand representation learner (LRL) and protein representation learner (PRL) to accept 3D multi-view ligand images and protein graphs as input, respectively, and learns rich interaction information between ligand and protein through a binding representation learner (BRL). Considering the scarcity of protein-ligand pairs, we further propose a multi-level next trajectory prediction (MLNTP) task to pre-train ImagePLB on the 4D flexible dynamics trajectory of 16 972 complexes, including ligand level, protein level, and complex level, to learn information related to trajectories. Besides, by introducing trajectory regularization (TR), we effectively alleviate the problem of high (even almost identical) feature similarity caused by adjacent trajectories. Compared with the current state-of-the-art methods, ImagePLB has achieved competitive improvements on PLB-related prediction tasks, including protein-ligand affinity and efficacy prediction tasks. This study opens the door to the image-based PLB learning paradigm. Availability and implementation All data and implementation details of code can be obtained from https://github.com/HongxinXiang/ImagePLB.
UR - http://www.scopus.com/inward/record.url?scp=105018022404&partnerID=8YFLogxK
UR - https://academic.oup.com/bioinformatics/article/41/10/btaf535/8262955?login=true
U2 - 10.1093/bioinformatics/btaf535
DO - 10.1093/bioinformatics/btaf535
M3 - Journal article
C2 - 40991325
AN - SCOPUS:105018022404
SN - 1367-4803
VL - 41
JO - Bioinformatics
JF - Bioinformatics
IS - 10
M1 - btaf535
ER -