PATNet: Propensity-Adjusted Temporal Network for Joint Imputation and Prediction Using Binary EHRs With Observation Bias

Kejing Yin*, Dong Qian, William K. Cheung

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Predictive analysis of electronic health records (EHR) is a fundamental task that could provide actionable insights to help clinicians improve the efficiency and quality of care. EHR are commonly recorded in binary format and contain inevitable missing data. The nature of missingness may vary by patients, clinical features, and time, which incurs observation bias. It is essential to account for the binary missingness and observation bias or the predictive performance could be substantially compromised. In this paper, we develop a propensity-adjusted temporal network (PATNet) to conduct data imputation and predictive analysis simultaneously. PATNet contains three subnetworks: 1) an imputation subnetwork that generates the initial imputation based on historical observations, 2) a propensity subnetwork that infers the patient-, feature-, and time-dependent propensity scores, and 3) a prediction subnetwork that produces the missing-informative prediction using the propensity-adjusted imputations and the missing probabilities. To allow the propensity scores to be inferred from data, we use the expectation-maximization (EM) algorithm to learn the imputation and propensity subnetworks and incorporate a low-rank constraint via PARAFAC2 approximation. Extensive evaluation using the MIMIC-III and eICU datasets demonstrates that PATNet outperforms the state-of-the-art methods in terms of binary data imputation, disease progression modeling, and mortality prediction tasks.
Original languageEnglish
Article number10285044
Pages (from-to)2600-2613
Number of pages14
JournalIEEE Transactions on Knowledge and Data Engineering
Volume36
Issue number6
Early online date13 Oct 2023
DOIs
Publication statusE-pub ahead of print - 13 Oct 2023

Scopus Subject Areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

User-Defined Keywords

  • Binary data imputation
  • Data models
  • Diseases
  • Medical diagnostic imaging
  • Predictive analytics
  • Predictive models
  • Task analysis
  • Time series analysis
  • clinical risk prediction
  • disease progression modeling
  • electronic health records
  • missing at random
  • missing data
  • propensity score

Fingerprint

Dive into the research topics of 'PATNet: Propensity-Adjusted Temporal Network for Joint Imputation and Prediction Using Binary EHRs With Observation Bias'. Together they form a unique fingerprint.

Cite this