DBNX: A Machine Learning Method for Ensembling Polygenic Risk Scores and Non-Genetic Factors

Xiangzhe Yuan, Chonghao Wang, Shuqin Zhu, Lu Zhang*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Polygenic risk scoring (PRS) holds promise for improving disease prediction and medical treatments by evaluating an individual's genetic susceptibility through multiple genetic variants. However, current PRS calculation methods often excel only in specific diseases and populations, with no single approach consistently outperforming others across all contexts. Furthermore, these methods frequently overlook non-genetic factors, such as lifestyle, that also impact disease risk.We introduce an unsupervised Deep Belief Network (DBN) to aggregate PRS generated by various methods, achieving performance comparable to the Super Learner method'a supervised ensemble approach that combines predictions from multiple methods to improve outcomes. Unlike supervised methods, the DBN does not require training data and can directly ensemble the available PRS. Remarkably, on small-scale datasets, the DBN outperforms the Super Learner. Additionally, we present the DBNX model, which integrates PRS with non-genetic factors using a combination of DBN and XGBoost. DBNX produces a Composite Risk Score (CRS) that incorporates information from both PRS and non-genetic factors. In our experiments using the UK Biobank (UKBB) dataset across four diseases, DBNX demonstrated superior performance compared to other commonly used ensemble methods. The codes for DBNX are available at: https://github.com/Hangzen/DBNX
Original languageEnglish
Number of pages11
JournalIEEE Transactions on Computational Biology and Bioinformatics
DOIs
Publication statusE-pub ahead of print - 16 Jun 2025

User-Defined Keywords

  • DBN
  • Polygenic Risk Scores
  • Ensemble Learning
  • Machine Learning

Fingerprint

Dive into the research topics of 'DBNX: A Machine Learning Method for Ensembling Polygenic Risk Scores and Non-Genetic Factors'. Together they form a unique fingerprint.

Cite this