TY - JOUR
T1 - Accelerating the Discovery of Hybrid Perovskites with Targeted Band Gaps via Interpretable Machine Learning
AU - Yang, Chao
AU - Chong, Xiaoyu
AU - Hu, Mingyu
AU - Yu, Wei
AU - He, Jingjin
AU - Zhang, Yalan
AU - Feng, Jing
AU - Zhou, Yuanyuan
AU - Wang, Lin Wang
N1 - This work was supported by the National Key R&D Program of China (no. 2019YFB1503200).
Publisher Copyright:
© 2023 American Chemical Society.
PY - 2023/8/30
Y1 - 2023/8/30
N2 - The band gap of hybrid organic-inorganic perovskites (HOIPs) is a key factor affecting the light absorption characteristics and thus the performance of perovskite solar cells (PSCs). However, band gap engineering, using experimental trial and error and high-throughput density functional theory calculations, is blind and costly. Therefore, it is critical to statistically identify the multiple factors influencing band gaps and to rationally design perovskites with targeted band gaps. This study combined feature engineering, the gradient-boosted regression tree (GBRT) algorithm, and the genetic algorithm-based symbolic regression (GASR) algorithm to develop an interpretable machine learning (ML) strategy for predicting the band gap of HOIPs accurately and quantitatively interpreting the factors affecting the band gap. Seven best physical features were selected to construct a GBRT model with a root-mean-square error of less than 0.060 eV, and the most important feature is the electronegativity difference between the B-site and the X-site (χB-X). Further, a mathematical formula (Eg = χB-X2 + 0.881χB-X) was constructed with GASR for a quantitative interpretation of the band gap influence patterns. According to the ML model, the HOIP MA0.23FA0.02Cs0.75Pb0.59Sn0.41Br0.24I2.76 was obtained, with a suitable band gap of 1.39 eV. Our proposed interpretable ML strategy provides an effective approach for developing HOIP structures with targeted band gaps, which can also be applied to other material fields.
AB - The band gap of hybrid organic-inorganic perovskites (HOIPs) is a key factor affecting the light absorption characteristics and thus the performance of perovskite solar cells (PSCs). However, band gap engineering, using experimental trial and error and high-throughput density functional theory calculations, is blind and costly. Therefore, it is critical to statistically identify the multiple factors influencing band gaps and to rationally design perovskites with targeted band gaps. This study combined feature engineering, the gradient-boosted regression tree (GBRT) algorithm, and the genetic algorithm-based symbolic regression (GASR) algorithm to develop an interpretable machine learning (ML) strategy for predicting the band gap of HOIPs accurately and quantitatively interpreting the factors affecting the band gap. Seven best physical features were selected to construct a GBRT model with a root-mean-square error of less than 0.060 eV, and the most important feature is the electronegativity difference between the B-site and the X-site (χB-X). Further, a mathematical formula (Eg = χB-X2 + 0.881χB-X) was constructed with GASR for a quantitative interpretation of the band gap influence patterns. According to the ML model, the HOIP MA0.23FA0.02Cs0.75Pb0.59Sn0.41Br0.24I2.76 was obtained, with a suitable band gap of 1.39 eV. Our proposed interpretable ML strategy provides an effective approach for developing HOIP structures with targeted band gaps, which can also be applied to other material fields.
KW - band gap
KW - GBRT
KW - machine learning
KW - perovskite solar cell
KW - symbolic regression
UR - http://www.scopus.com/inward/record.url?scp=85169026789&partnerID=8YFLogxK
U2 - 10.1021/acsami.3c06392
DO - 10.1021/acsami.3c06392
M3 - Journal article
C2 - 37594363
AN - SCOPUS:85169026789
SN - 1944-8244
VL - 15
SP - 40419
EP - 40427
JO - ACS Applied Materials and Interfaces
JF - ACS Applied Materials and Interfaces
IS - 34
ER -