ChemFH: an integrated tool for screening frequent false positives in chemical biology and drug discovery

Shaohua Shi, Li Fu, Jiacai Yi, Ziyi Yang, Xiaochen Zhang, Youchao Deng, Wenxuan Wang, Chengkun Wu, Wentao Zhao, Tingjun Hou, Xiangxiang Zeng*, Aiping Lyu*, Dongsheng Cao*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

High-throughput screening rapidly tests an extensive array of chemical compounds to identify hit compounds for specific biological targets in drug discovery. However, false-positive results disrupt hit compound screening, leading to wastage of time and resources. To address this, we propose ChemFH, an integrated online platform facilitating rapid virtual evaluation of potential false positives, including colloidal aggregators, spectroscopic interference compounds, firefly luciferase inhibitors, chemical reactive compounds, promiscuous compounds, and other assay interferences. By leveraging a dataset containing 823 391 compounds, we constructed high-quality prediction models using multi-task directed message-passing network (DMPNN) architectures combining uncertainty estimation, yielding an average AUC value of 0.91. Furthermore, ChemFH incorporated 1441 representative alert substructures derived from the collected data and ten commonly used frequent hitter screening rules. ChemFH was validated with an external set of 75 compounds. Subsequently, the virtual screening capability of ChemFH was successfully confirmed through its application to five virtual screening libraries. Furthermore, ChemFH underwent additional validation on two natural products and FDA-approved drugs, yielding reliable and accurate results. ChemFH is a comprehensive, reliable, and computationally efficient screening pipeline that facilitates the identification of true positive results in assays, contributing to enhanced efficiency and success rates in drug discovery. ChemFH is freely available via https://chemfh.scbdd.com/.
Original languageEnglish
Pages (from-to)W439–W449
Number of pages11
JournalNucleic Acids Research
Volume52
Issue numberW1
Early online date23 May 2024
DOIs
Publication statusPublished - 5 Jul 2024

Scopus Subject Areas

  • Genetics

Fingerprint

Dive into the research topics of 'ChemFH: an integrated tool for screening frequent false positives in chemical biology and drug discovery'. Together they form a unique fingerprint.

Cite this