TY - JOUR
T1 - Heterogeneous Prototype Learning From Contaminated Faces Across Domains via Disentangling Latent Factors
AU - Pang, Meng
AU - Wang, Binghui
AU - Ye, Mang
AU - Cheung, Yiu Ming
AU - Zhou, Yintao
AU - Huang, Wei
AU - Wen, Bihan
N1 - This work was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 62271239 and Grant 62361166629; in part by the NSFC/Research Grants Council (RGC) Joint Research Scheme under Grant N_HKBU214/21; in part by the General Research Fund of RGC under Grant 12201321, Grant 12202622, and Grant 12201323; in part by the RGC Senior Research Fellow Scheme under Grant SRFS2324-2S02; in part by the Natural Science Foundation of Jiangxi Province under Grant 20232BAB212025; in part by the High-level and Urgently Needed Overseas Talent Programs of Jiangxi Province under Grant 20232BCJ25024; in part by the Jiangxi Double Thou- sand Plan under Grant JXSQ2023201022; and in part by the Ministry of Education, Republic of Singapore, through its Start-Up Grant and Aca- demic Research Fund Tier 1 under Grant RG61/22.
PY - 2025/4
Y1 - 2025/4
N2 - This article studies an emerging practical problem called heterogeneous prototype learning (HPL). Unlike the conventional heterogeneous face synthesis (HFS) problem that focuses on precisely translating a face image from a source domain to another target one without removing facial variations, HPL aims at learning the variation-free prototype of an image in the target domain while preserving the identity characteristics. HPL is a compounded problem involving two cross-coupled subproblems, that is, domain transfer and prototype learning (PL), thus making most of the existing HFS methods that simply transfer the domain style of images unsuitable for HPL. To tackle HPL, we advocate disentangling the prototype and domain factors in their respective latent feature spaces and then replacing the source domain with the target one for generating a new heterogeneous prototype. In doing so, the two subproblems in HPL can be solved jointly in a unified manner. Based on this, we propose a disentangled HPL framework, dubbed DisHPL, which is composed of one encoder–decoder generator and two discriminators. The generator and discriminators play adversarial games such that the generator embeds contaminated images into a prototype feature space only capturing identity information and a domain-specific feature space, while generating realistic-looking heterogeneous prototypes. Experiments on various heterogeneous datasets with diverse variations validate the superiority of DisHPL.
AB - This article studies an emerging practical problem called heterogeneous prototype learning (HPL). Unlike the conventional heterogeneous face synthesis (HFS) problem that focuses on precisely translating a face image from a source domain to another target one without removing facial variations, HPL aims at learning the variation-free prototype of an image in the target domain while preserving the identity characteristics. HPL is a compounded problem involving two cross-coupled subproblems, that is, domain transfer and prototype learning (PL), thus making most of the existing HFS methods that simply transfer the domain style of images unsuitable for HPL. To tackle HPL, we advocate disentangling the prototype and domain factors in their respective latent feature spaces and then replacing the source domain with the target one for generating a new heterogeneous prototype. In doing so, the two subproblems in HPL can be solved jointly in a unified manner. Based on this, we propose a disentangled HPL framework, dubbed DisHPL, which is composed of one encoder–decoder generator and two discriminators. The generator and discriminators play adversarial games such that the generator embeds contaminated images into a prototype feature space only capturing identity information and a domain-specific feature space, while generating realistic-looking heterogeneous prototypes. Experiments on various heterogeneous datasets with diverse variations validate the superiority of DisHPL.
KW - Disentangled representation learning (DRL)
KW - generative adversarial learning
KW - heterogeneous face synthesis (HFS)
KW - heterogeneous prototype learning (HPL)
UR - http://www.scopus.com/inward/record.url?scp=85192179147&partnerID=8YFLogxK
U2 - 10.1109/TNNLS.2024.3393072
DO - 10.1109/TNNLS.2024.3393072
M3 - Journal article
AN - SCOPUS:85192179147
SN - 2162-237X
VL - 34
SP - 7169
EP - 7183
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
IS - 6
ER -