TY - JOUR
T1 - Predicting protein conformational motions using energetic frustration analysis and AlphaFold2
AU - Guan, Xingyue
AU - Tang, Qian Yuan
AU - Ren, Weitong
AU - Chen, Mingchen
AU - Wang, Wei
AU - Wolynes, Peter G.
AU - Li, Wenfei
N1 - This work was supported by the National Natural Science Foundation of China (Grant Nos. 11974173, 11934008, and 12305052), the grant of Wenzhou Institute, University of Chinese Academy of Sciences (WIUCASQD2021010, WIUCASQD2023015), and Hong Kong Research Grant Council (No. 22302723). P.G.W. acknowledges support for the Center for Theoretical Biological Physics from NSF grant PHY-2019745 and the support of the Bullard-Welch Chair from Grant C-0016 from the Welch Foundation to Rice. We also thank the support of High Performance Computing Center of Nanjing University and High Performance Computing Center of Wenzhou Institute, University of Chinese Academy of Sciences.
Publisher Copyright:
© 2024 the Author(s).
PY - 2024/8/27
Y1 - 2024/8/27
N2 - Proteins perform their biological functions through motion. Although high throughput prediction of the three-dimensional static structures of proteins has proved feasible using deep-learning-based methods, predicting the conformational motions remains a challenge. Purely data-driven machine learning methods encounter difficulty for addressing such motions because available laboratory data on conformational motions are still limited. In this work, we develop a method for generating protein allosteric motions by integrating physical energy landscape information into deep-learning-based methods. We show that local energetic frustration, which represents a quantification of the local features of the energy landscape governing protein allosteric dynamics, can be utilized to empower AlphaFold2 (AF2) to predict protein conformational motions. Starting from ground state static structures, this integrative method generates alternative structures as well as pathways of protein conformational motions, using a progressive enhancement of the energetic frustration features in the input multiple sequence alignment sequences. For a model protein adenylate kinase, we show that the generated conformational motions are consistent with available experimental and molecular dynamics simulation data. Applying the method to another two proteins KaiB and ribose-binding protein, which involve large-amplitude conformational changes, can also successfully generate the alternative conformations. We also show how to extract overall features of the AF2 energy landscape topography, which has been considered by many to be black box. Incorporating physical knowledge into deep-learning-based structure prediction algorithms provides a useful strategy to address the challenges of dynamic structure prediction of allosteric proteins.
AB - Proteins perform their biological functions through motion. Although high throughput prediction of the three-dimensional static structures of proteins has proved feasible using deep-learning-based methods, predicting the conformational motions remains a challenge. Purely data-driven machine learning methods encounter difficulty for addressing such motions because available laboratory data on conformational motions are still limited. In this work, we develop a method for generating protein allosteric motions by integrating physical energy landscape information into deep-learning-based methods. We show that local energetic frustration, which represents a quantification of the local features of the energy landscape governing protein allosteric dynamics, can be utilized to empower AlphaFold2 (AF2) to predict protein conformational motions. Starting from ground state static structures, this integrative method generates alternative structures as well as pathways of protein conformational motions, using a progressive enhancement of the energetic frustration features in the input multiple sequence alignment sequences. For a model protein adenylate kinase, we show that the generated conformational motions are consistent with available experimental and molecular dynamics simulation data. Applying the method to another two proteins KaiB and ribose-binding protein, which involve large-amplitude conformational changes, can also successfully generate the alternative conformations. We also show how to extract overall features of the AF2 energy landscape topography, which has been considered by many to be black box. Incorporating physical knowledge into deep-learning-based structure prediction algorithms provides a useful strategy to address the challenges of dynamic structure prediction of allosteric proteins.
KW - deep-learning
KW - energy landscapes
KW - multiple sequence alignment
KW - protein folding
KW - structure prediction
UR - http://www.scopus.com/inward/record.url?scp=85201851908&partnerID=8YFLogxK
U2 - 10.1073/pnas.2410662121
DO - 10.1073/pnas.2410662121
M3 - Journal article
C2 - 39163334
AN - SCOPUS:85201851908
SN - 0027-8424
VL - 121
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 35
M1 - e2410662121
ER -