TY - JOUR
T1 - Facial Expression Guided Diagnosis of Parkinson's Disease Via High-Quality Data Augmentation
AU - Huang, Wei
AU - Zhou, Yintao
AU - Cheung, Yiu-ming
AU - Zhang, Peng
AU - Zha, Yufei
AU - Pang, Meng
N1 - Wei Huang (email: [email protected]) and Yintao Zhou (email: [email protected]) are with the School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China. Wei Huang, Peng Zhang, and Yufei Zha were supported in part by grants 62271239, 61862043, and 61971352 approved by National Natural Science Foundation of China, as well as grants from Natural Science Foundation of Ningbo (2021J048, 2021J049).
Meng Pang is with the School of Mathematics and Computer Sciences and the Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China, email: [email protected]. Meng Pang is the corresponding author.
Yiu-ming Cheung is with the Department of Computer Science, Hong Kong Baptist University, Hong Kong, China, email: [email protected]. Yiu-ming Cheung was supported in part by NSFC / Research Grants Council (RGC) Joint Research Scheme under Grant N HKBU214/21, General Research Fund of RGC under Grant 12201321, and Hong Kong Baptist University (HKBU) under Grant RC-FNRA-IG/18-19/SCI/03.
Peng Zhang and Yufei Zha are with School of Computer Science, North-western Polytechnical University, Xi’an, China and with Ningbo Institute of Northwestern Polytechnical University, Ningbo, China.
Publisher Copyright:
© 2022 IEEE.
PY - 2023/1
Y1 - 2023/1
N2 - Parkinson's disease (PD) is a neurodegenerative disease which is prevalent among the elder population and severely affects the life quality of patients and their families. Therefore, it is important to conduct an early diagnosis for potential patients with PD, so as to promote prompt treatment and avoid the aggravation of the disease. Recently, the in-vitro PD diagnosis based on facial expressions has received increasing attention because of its distinguishability (i.e., PD patients always possess the characteristics of “masked face”) and affordability. However, the performance of the existing facial expression-based PD diagnosis approaches is limited by: 1) the small-scale training data on PD patients' facial expressions, and 2) the weak prediction model. To address these two problems, we propose a new facial expression guided PD diagnosis method based on high-quality training data augmentation and deep neural network prediction. Specifically, the proposed method consists of three stages: Firstly, we synthesize virtual facial expression images with 6 basic emotions (i.e., anger, disgust, fear, happiness, sadness, and surprise) based on multi-domain adversarial learning to approximate the premorbid expressions of PD patients. Secondly, we introduce three facial image quality assessment (FIQA) criteria to measure the quality of these synthesized facial expression images and design a fusion screening strategy that shortlists the high-quality ones to augment the training data. Finally, we train a deep neural network prediction model based on the original and synthesized high-quality facial expression images for PD diagnosis. To show real-world impacts and evaluate the proposed method under different facial expressions, we also create a (currently largest) multiple facial expressions-based PD face dataset in collaboration with a hospital. Extensive experiments are performed to demonstrate the effectiveness of the multi-domain adversarial learning-based facial expression synthesis and the fusion screening strategy, particularly the superior performance of the proposed method for PD diagnosis.
AB - Parkinson's disease (PD) is a neurodegenerative disease which is prevalent among the elder population and severely affects the life quality of patients and their families. Therefore, it is important to conduct an early diagnosis for potential patients with PD, so as to promote prompt treatment and avoid the aggravation of the disease. Recently, the in-vitro PD diagnosis based on facial expressions has received increasing attention because of its distinguishability (i.e., PD patients always possess the characteristics of “masked face”) and affordability. However, the performance of the existing facial expression-based PD diagnosis approaches is limited by: 1) the small-scale training data on PD patients' facial expressions, and 2) the weak prediction model. To address these two problems, we propose a new facial expression guided PD diagnosis method based on high-quality training data augmentation and deep neural network prediction. Specifically, the proposed method consists of three stages: Firstly, we synthesize virtual facial expression images with 6 basic emotions (i.e., anger, disgust, fear, happiness, sadness, and surprise) based on multi-domain adversarial learning to approximate the premorbid expressions of PD patients. Secondly, we introduce three facial image quality assessment (FIQA) criteria to measure the quality of these synthesized facial expression images and design a fusion screening strategy that shortlists the high-quality ones to augment the training data. Finally, we train a deep neural network prediction model based on the original and synthesized high-quality facial expression images for PD diagnosis. To show real-world impacts and evaluate the proposed method under different facial expressions, we also create a (currently largest) multiple facial expressions-based PD face dataset in collaboration with a hospital. Extensive experiments are performed to demonstrate the effectiveness of the multi-domain adversarial learning-based facial expression synthesis and the fusion screening strategy, particularly the superior performance of the proposed method for PD diagnosis.
KW - Parkinson's disease diagnosis
KW - data augmentation
KW - deep learning
KW - multi-domain adversarial learning
KW - Data augmentation
UR - http://www.scopus.com/inward/record.url?scp=85141487392&partnerID=8YFLogxK
U2 - 10.1109/TMM.2022.3216961
DO - 10.1109/TMM.2022.3216961
M3 - Journal article
SN - 1520-9210
VL - 25
SP - 7037
EP - 7050
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
ER -