Projection Valued-based Quantum Machine Learning Adapting to Differential Privacy Algorithm for Word-level Lipreading

Hang Chen, Chang Wang, Jun Du*, Chao-Han Huck Yang, Jun Qi

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Deep neural network (DNN)-based lipreading models have achieved excellent recognition accuracy but are currently facing challenges related to user privacy. To address this, we propose a novel hybrid quantum-classical neural network (HQCNN) for lipreading that balances superior performance with enhanced privacy protection. The HQCNN-based lipreading model features an innovative variational quantum circuit (VQC) back-end, which transforms the output of the DNN front-end into quantum representations and predicts the posterior probability of each word. Furthermore, we introduce projection-valued encoding (PVE) and projection-valued measurement (PVM), enabling the VQC to handle inputs and outputs of dimensions that scale exponentially with the number of qubits, thereby substantially increasing its expressive power. Additionally, we explore the privacy-preserving properties of the HQCNN-based lipreading model by integrating differentially private stochastic gradient descent (DP-SGD). Experiments conducted on the LRW dataset demonstrate the model’s exceptional recognition accuracy and privacy-preserving capabilities.
Original languageEnglish
Title of host publicationProceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
EditorsBhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
PublisherIEEE
Number of pages5
ISBN (Electronic)9798350368741
ISBN (Print)9798350368758
DOIs
Publication statusPublished - 6 Apr 2025
Event2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 - Hyderabad, India
Duration: 6 Apr 202511 Apr 2025
https://ieeexplore.ieee.org/xpl/conhome/10887540/proceeding

Publication series

NameProceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublisherIEEE

Conference

Conference2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025
Country/TerritoryIndia
CityHyderabad
Period6/04/2511/04/25
Internet address

User-Defined Keywords

  • Quantum machine learning
  • visual speech recognition
  • quantum multi-classifier
  • differential privacy

Fingerprint

Dive into the research topics of 'Projection Valued-based Quantum Machine Learning Adapting to Differential Privacy Algorithm for Word-level Lipreading'. Together they form a unique fingerprint.

Cite this