TY - JOUR
T1 - Mendelian randomization for causal inference accounting for pleiotropy and sample structure using genome-wide summary statistics
AU - Hu, Xianghong
AU - Zhao, Jia
AU - Lin, Zhixiang
AU - Wang, Yang
AU - Peng, Heng
AU - Zhao, Hongyu
AU - Wan, Xiang
AU - Yang, Can
N1 - ACKNOWLEDGMENTS. We thank the editor and two anonymous reviewers for their very detailed and constructive comments, which have greatly helped to im-proveourmanuscript.WealsothankProf.LinS.Chen,Prof.LanWang,Prof.Baolin Wu, Prof. Zhigang Bao, and Prof. Dong Xia for their helpful comments and insightful discussions. This work is supported in part by Chinese Key-Area Research and Development Program of Guangdong Province Grant 2020B0101350001; Hong Kong Research Grant Council Grants 16307818, 16301419, 16308120, 12303618, 24301419, 14301120, and 16307221; Hong Kong Innovation and Technology Fund Grant PRP/029/19FX; Hong Kong University of Science and Technology Startup Grants R9405 and Z0428 from the Big Data Institute; Chinese University of Hong Kong Direct Grants 4053360 and 4053423; Chinese University of Hong Kong Startup Grant 4930181; the Chinese University of Hong Kong’s Project Impact Enhancement Fund and Science Faculty’s Collaborative Research Impact Matching Scheme; National Science Foundation of China Grant 12026610; Open Research Fund from Shenzhen Research Institute of Big Data Grant 2019ORF01004; and the Guangdong Provincial Key Laboratory of Big Data Computing, The Chinese University of Hong Kong, Shenzhen. The computational task for this work was performed by using the X-GPU cluster supported by the Research Grants Council Collaborative Research Fund Grant C6021-19EF.
Publisher Copyright:
Copyright © 2022 the Author(s). Published by PNAS. This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).
PY - 2022/7/12
Y1 - 2022/7/12
N2 - Mendelian randomization (MR) is a valuable tool for inferring causal relationships among a wide range of traits using summary statistics from genome-wide association studies (GWASs). Existing summary-level MR methods often rely on strong assumptions, resulting in many false-positive findings. To relax MR assumptions, ongoing research has been primarily focused on accounting for confounding due to pleiotropy. Here, we show that sample structure is another major confounding factor, including population stratification, cryptic relatedness, and sample overlap. We propose a unified MR approach, MR-APSS, which 1) accounts for pleiotropy and sample structure simultaneously by leveraging genome-wide information; and 2) allows the inclusion of more genetic variants with moderate effects as instrument variables (IVs) to improve statistical power without inflating type I errors. We first evaluated MR-APSS using comprehensive simulations and negative controls and then applied MR-APSS to study the causal relationships among a collection of diverse complex traits. The results suggest that MR-APSS can better identify plausible causal relationships with high reliability. In particular, MR-APSS can perform well for highly polygenic traits, where the IV strengths tend to be relatively weak and existing summary-level MR methods for causal inference are vulnerable to confounding effects.
AB - Mendelian randomization (MR) is a valuable tool for inferring causal relationships among a wide range of traits using summary statistics from genome-wide association studies (GWASs). Existing summary-level MR methods often rely on strong assumptions, resulting in many false-positive findings. To relax MR assumptions, ongoing research has been primarily focused on accounting for confounding due to pleiotropy. Here, we show that sample structure is another major confounding factor, including population stratification, cryptic relatedness, and sample overlap. We propose a unified MR approach, MR-APSS, which 1) accounts for pleiotropy and sample structure simultaneously by leveraging genome-wide information; and 2) allows the inclusion of more genetic variants with moderate effects as instrument variables (IVs) to improve statistical power without inflating type I errors. We first evaluated MR-APSS using comprehensive simulations and negative controls and then applied MR-APSS to study the causal relationships among a collection of diverse complex traits. The results suggest that MR-APSS can better identify plausible causal relationships with high reliability. In particular, MR-APSS can perform well for highly polygenic traits, where the IV strengths tend to be relatively weak and existing summary-level MR methods for causal inference are vulnerable to confounding effects.
KW - causal inference
KW - Mendelian randomization
KW - pleiotropy
KW - sample structure
KW - selection bias
UR - http://www.scopus.com/inward/record.url?scp=85133263532&partnerID=8YFLogxK
U2 - 10.1073/pnas.2106858119
DO - 10.1073/pnas.2106858119
M3 - Journal article
C2 - 35787050
AN - SCOPUS:85133263532
SN - 0027-8424
VL - 119
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 28
M1 - e2106858119
ER -