TY - JOUR
T1 - Improving docking-based virtual screening ability by integrating multiple energy auxiliary terms from molecular docking scoring
AU - Ye, Wen Ling
AU - Shen, Chao
AU - Xiong, Guo Li
AU - Ding, Jun Jie
AU - Lu, Ai Ping
AU - Hou, Ting Jun
AU - Cao, Dong Sheng
N1 - Funding Information:
This work is financially supported by the National Key Basic Research Program (2015CB910700), the National Science Foundation of China (81603031, 21575128, 81773632), the Ministry of Science and Technology of China (2016YFA0501700), and HKBU Strategic Development Fund project (SDF19-0402-P02). The studies meet with the approval of the university’s review board.
Publisher copyright:
© 2020 American Chemical Society
PY - 2020/9/28
Y1 - 2020/9/28
N2 - Virtual Screening (VS) based on molecular docking is an efficient method used for retrieving novel hit compounds in drug discovery. However, the accuracy of the current docking scoring function (SF) is usually insufficient. In this study, in order to improve the screening power of SF, a novel approach named EAT-Score was proposed by directly utilizing the energy auxiliary terms (EAT) provided by molecular docking scoring through eXtreme Gradient Boosting (XGBoost). Here, EAT specifically refers to the output of the Molecular Operating Environment (MOE) scoring, including the energy scores of five different classical SFs and the Protein−Ligand Interaction Fingerprint (PLIF) terms. The performance of EAT-Score to discriminate actives from decoys was strictly validated on the DUD-E diverse subset by using different performance metrics. The results showed that EAT-Score performed much better than classical SFs in VS, with its AUC values exhibiting an improvement of around 0.3. Meanwhile, EAT-Score could achieve comparable even better prediction performance compared with other state-of-the-art VS methods, such as some machine learning (ML)-based SFs and classical SFs implemented in docking programs, in terms of AUC, LogAUC, or BEDROC. Furthermore, the EAT-Score model can capture important binding pattern information from protein−ligand complexes by Shapley additive explanations (SHAP) analysis, which may be very helpful in interpreting the ligand binding mechanism for a certain target and thereby guiding drug design.
AB - Virtual Screening (VS) based on molecular docking is an efficient method used for retrieving novel hit compounds in drug discovery. However, the accuracy of the current docking scoring function (SF) is usually insufficient. In this study, in order to improve the screening power of SF, a novel approach named EAT-Score was proposed by directly utilizing the energy auxiliary terms (EAT) provided by molecular docking scoring through eXtreme Gradient Boosting (XGBoost). Here, EAT specifically refers to the output of the Molecular Operating Environment (MOE) scoring, including the energy scores of five different classical SFs and the Protein−Ligand Interaction Fingerprint (PLIF) terms. The performance of EAT-Score to discriminate actives from decoys was strictly validated on the DUD-E diverse subset by using different performance metrics. The results showed that EAT-Score performed much better than classical SFs in VS, with its AUC values exhibiting an improvement of around 0.3. Meanwhile, EAT-Score could achieve comparable even better prediction performance compared with other state-of-the-art VS methods, such as some machine learning (ML)-based SFs and classical SFs implemented in docking programs, in terms of AUC, LogAUC, or BEDROC. Furthermore, the EAT-Score model can capture important binding pattern information from protein−ligand complexes by Shapley additive explanations (SHAP) analysis, which may be very helpful in interpreting the ligand binding mechanism for a certain target and thereby guiding drug design.
UR - http://www.scopus.com/inward/record.url?scp=85091807222&partnerID=8YFLogxK
U2 - 10.1021/acs.jcim.9b00977
DO - 10.1021/acs.jcim.9b00977
M3 - Journal article
C2 - 32352294
AN - SCOPUS:85091807222
SN - 1549-9596
VL - 60
SP - 4216
EP - 4230
JO - Journal of Chemical Information and Modeling
JF - Journal of Chemical Information and Modeling
IS - 9
ER -