TY - JOUR
T1 - MicroEnvPPI
T2 - Microenvironment-Aware Optimization Enables Generalizable Protein–Protein Interaction Prediction
AU - Yang, Kun
AU - Chen, Yifan
AU - Wei, Yanshi
AU - Xiang, Mingrong
AU - Zhuo, Linlin
AU - Fu, Xiangzheng
AU - Cao, Dongsheng
AU - Zhang, Wenqian
N1 - The work was supported by the National Natural Science Foundation of China (Nos. 62302339 and 62372158).
Publisher Copyright:
© 2025 American Chemical Society
PY - 2025/11/10
Y1 - 2025/11/10
N2 - Protein–protein interactions (PPIs) play a fundamental role in shaping cellular functional networks and guiding therapeutic target discovery. Although models such as AlphaFold have achieved impressive results in protein structure prediction and PPI inference, they tend to overlook the structural and contextual importance of residue-level microenvironments, which limits their predictive capacity. Here, we present MicroEnvPPI, a microenvironment-aware optimization framework designed to improve the accuracy and generalizability of PPI prediction. MicroEnvPPI integrates residue-level physicochemical features and contextual embeddings derived from the ESM-2 language model with structural information predicted by AlphaFold, enabling a comprehensive characterization of residue microenvironments. Additionally, auxiliary tasks that incorporate graph contrastive learning and masking mechanisms optimize the residue microenvironment representation, enhancing both its quality and the model’s generalization ability. Finally, MicroEnvPPI strengthens its advantage in PPI prediction by jointly training global PPI and microenvironment optimization tasks. Notably, MicroEnvPPI achieves strong performance under challenging data partition schemes, such as DFS and BFS, indicating its ability to generalize to previously unseen interactions. These findings underscore the potential of MicroEnvPPI to advance our understanding of protein interaction networks.
AB - Protein–protein interactions (PPIs) play a fundamental role in shaping cellular functional networks and guiding therapeutic target discovery. Although models such as AlphaFold have achieved impressive results in protein structure prediction and PPI inference, they tend to overlook the structural and contextual importance of residue-level microenvironments, which limits their predictive capacity. Here, we present MicroEnvPPI, a microenvironment-aware optimization framework designed to improve the accuracy and generalizability of PPI prediction. MicroEnvPPI integrates residue-level physicochemical features and contextual embeddings derived from the ESM-2 language model with structural information predicted by AlphaFold, enabling a comprehensive characterization of residue microenvironments. Additionally, auxiliary tasks that incorporate graph contrastive learning and masking mechanisms optimize the residue microenvironment representation, enhancing both its quality and the model’s generalization ability. Finally, MicroEnvPPI strengthens its advantage in PPI prediction by jointly training global PPI and microenvironment optimization tasks. Notably, MicroEnvPPI achieves strong performance under challenging data partition schemes, such as DFS and BFS, indicating its ability to generalize to previously unseen interactions. These findings underscore the potential of MicroEnvPPI to advance our understanding of protein interaction networks.
UR - http://www.scopus.com/inward/record.url?scp=105021089654&partnerID=8YFLogxK
U2 - 10.1021/acs.jcim.5c01907
DO - 10.1021/acs.jcim.5c01907
M3 - Journal article
C2 - 41134148
AN - SCOPUS:105021089654
SN - 1549-9596
VL - 65
SP - 11860
EP - 11877
JO - Journal of Chemical Information and Modeling
JF - Journal of Chemical Information and Modeling
IS - 21
ER -