Robust Remote Photoplethysmography Estimation With Environmental Noise Disentanglement

Si Qi Liu, Pong C. Yuen*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

4 Citations (Scopus)

Abstract

Remote Photoplethysmography (rPPG) has been attracting increasing attention due to its potential in a wide range of application scenarios such as physical training, clinical monitoring, and face anti-spoofing. On top of conventional solutions, deep-learning approach starts to dominate in rPPG estimation and achieves top-level performance. However, most of them try to integrate preprocessing steps such as the ROI selection into an end-to-end network, which may diverge the attention and also limit the generalization in other scenarios with different input skin regions. In this work, we focus on learning the intrinsic rPPG feature and design a lightweight but effective rPPG estimation network based on spatiotemporal convolution. To further improve the robustness, on top of the basic design we propose the Noise-Disentangled DeeprPPG (ND-DeeprPPG) by disentangling the environmental noise from the raw rPPG feature with an adversarial canonical correlation analysis learning strategy. Background regions are employed as a reference to guide the noise disentangling in a self-supervised manner. Extensive experiments show that our ND-DeeprPPG not only outperforms the state-of-the-arts on heart rate estimation but also exhibits promising robustness in cross-skin-region, cross-dataset scenarios and other rPPG-based tasks.

Original languageEnglish
Pages (from-to)27-41
Number of pages15
JournalIEEE Transactions on Image Processing
Volume33
Early online date9 Nov 2023
DOIs
Publication statusPublished - Jan 2024

Scopus Subject Areas

  • Software
  • Computer Graphics and Computer-Aided Design

User-Defined Keywords

  • remote heart rate estimation
  • Remote photoplethysmography (rPPG)

Fingerprint

Dive into the research topics of 'Robust Remote Photoplethysmography Estimation With Environmental Noise Disentanglement'. Together they form a unique fingerprint.

Cite this