TY - JOUR
T1 - ADME Properties Evaluation in Drug Discovery
T2 - Prediction of Caco-2 Cell Permeability Using a Combination of NSGA-II and Boosting
AU - Wang, Ning Ning
AU - Dong, Jie
AU - Deng, Yin Hua
AU - Zhu, Min Feng
AU - Wen, Ming
AU - Yao, Zhi Jiang
AU - LYU, Aiping
AU - Wang, Jian Bing
AU - Cao, Dong Sheng
N1 - Funding Information:
We thank three anonymous referees and the editor for their constructive comments, which greatly helped improve upon the original version of the paper. This work is financially supported by grants from the Project of Innovation-driven Plan in Central South University, the National Natural Science Foundation of China (Grant No. 81402853), and the Postdoctoral Science Foundation of Central South University, the Chinese Postdoctoral Science Foundation (2014T70794, 2014M562142). The studies meet with the approval of the university's review board.
PY - 2016/4/25
Y1 - 2016/4/25
N2 - The Caco-2 cell monolayer model is a popular surrogate in predicting the in vitro human intestinal permeability of a drug due to its morphological and functional similarity with human enterocytes. A quantitative structure-property relationship (QSPR) study was carried out to predict Caco-2 cell permeability of a large data set consisting of 1272 compounds. Four different methods including multivariate linear regression (MLR), partial least-squares (PLS), support vector machine (SVM) regression and Boosting were employed to build prediction models with 30 molecular descriptors selected by nondominated sorting genetic algorithm-II (NSGA-II). The best Boosting model was obtained finally with R2 = 0.97, RMSEF = 0.12, Q2 = 0.83, RMSECV = 0.31 for the training set and RT2 = 0.81, RMSET = 0.31 for the test set. A series of validation methods were used to assess the robustness and predictive ability of our model according to the OECD principles and then define its applicability domain. Compared with the reported QSAR/QSPR models about Caco-2 cell permeability, our model exhibits certain advantage in database size and prediction accuracy to some extent. Finally, we found that the polar volume, the hydrogen bond donor, the surface area and some other descriptors can influence the Caco-2 permeability to some extent. These results suggest that the proposed model is a good tool for predicting the permeability of drug candidates and to perform virtual screening in the early stage of drug development.
AB - The Caco-2 cell monolayer model is a popular surrogate in predicting the in vitro human intestinal permeability of a drug due to its morphological and functional similarity with human enterocytes. A quantitative structure-property relationship (QSPR) study was carried out to predict Caco-2 cell permeability of a large data set consisting of 1272 compounds. Four different methods including multivariate linear regression (MLR), partial least-squares (PLS), support vector machine (SVM) regression and Boosting were employed to build prediction models with 30 molecular descriptors selected by nondominated sorting genetic algorithm-II (NSGA-II). The best Boosting model was obtained finally with R2 = 0.97, RMSEF = 0.12, Q2 = 0.83, RMSECV = 0.31 for the training set and RT2 = 0.81, RMSET = 0.31 for the test set. A series of validation methods were used to assess the robustness and predictive ability of our model according to the OECD principles and then define its applicability domain. Compared with the reported QSAR/QSPR models about Caco-2 cell permeability, our model exhibits certain advantage in database size and prediction accuracy to some extent. Finally, we found that the polar volume, the hydrogen bond donor, the surface area and some other descriptors can influence the Caco-2 permeability to some extent. These results suggest that the proposed model is a good tool for predicting the permeability of drug candidates and to perform virtual screening in the early stage of drug development.
UR - http://www.scopus.com/inward/record.url?scp=84969277735&partnerID=8YFLogxK
U2 - 10.1021/acs.jcim.5b00642
DO - 10.1021/acs.jcim.5b00642
M3 - Journal article
C2 - 27018227
AN - SCOPUS:84969277735
SN - 1549-9596
VL - 56
SP - 763
EP - 773
JO - Journal of Chemical Information and Modeling
JF - Journal of Chemical Information and Modeling
IS - 4
ER -