Abstract
Deep neural network (DNN)-based lipreading models have achieved excellent recognition accuracy but are currently facing challenges related to user privacy. To address this, we propose a novel hybrid quantum-classical neural network (HQCNN) for lipreading that balances superior performance with enhanced privacy protection. The HQCNN-based lipreading model features an innovative variational quantum circuit (VQC) back-end, which transforms the output of the DNN front-end into quantum representations and predicts the posterior probability of each word. Furthermore, we introduce projection-valued encoding (PVE) and projection-valued measurement (PVM), enabling the VQC to handle inputs and outputs of dimensions that scale exponentially with the number of qubits, thereby substantially increasing its expressive power. Additionally, we explore the privacy-preserving properties of the HQCNN-based lipreading model by integrating differentially private stochastic gradient descent (DP-SGD). Experiments conducted on the LRW dataset demonstrate the model’s exceptional recognition accuracy and privacy-preserving capabilities.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Editors | Bhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta |
Publisher | IEEE |
Number of pages | 5 |
ISBN (Electronic) | 9798350368741 |
ISBN (Print) | 9798350368758 |
DOIs | |
Publication status | Published - 6 Apr 2025 |
Event | 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 - Hyderabad, India Duration: 6 Apr 2025 → 11 Apr 2025 https://ieeexplore.ieee.org/xpl/conhome/10887540/proceeding |
Publication series
Name | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
---|---|
Publisher | IEEE |
Conference
Conference | 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 |
---|---|
Country/Territory | India |
City | Hyderabad |
Period | 6/04/25 → 11/04/25 |
Internet address |
User-Defined Keywords
- Quantum machine learning
- visual speech recognition
- quantum multi-classifier
- differential privacy