TY - GEN
T1 - Facial Structure Guided GAN for Identity-preserved Face Image De-occlusion
AU - Cheung, Yiu Ming
AU - Li, Mengke
AU - Zou, Rong
N1 - Funding Information:
This work was supported by ITF grant: ITS/339/18 and HKBU grant: RC-FNRA-IG/18-19/SCI/03.
Publisher Copyright:
© 2021 ACM.
PY - 2021/8
Y1 - 2021/8
N2 - In some practical scenarios, such as video surveillance and personal identification, we often have to address the recognition problem of occluded faces, where content replacement by serious occlusion with non-face objects always produces partial appearance and ambiguous representation. Under the circumstances, the performance of face recognition algorithms will often deteriorate to a certain degree. In this paper, we therefore address this problem by removing occlusions on face images and present a new two-stage Facial Structure Guided Generative Adversarial Network (FSG-GAN). In Stage I of the FSG-GAN, the variational auto-encoder is used to predict the facial structure. In Stage II, the predicted facial structure and the occluded image are concatenated and fed into a generative adversarial network (GAN) based model to synthesize the de-occlusion face image. In this way, the facial structure knowledge can be transferred to the synthesis network. Especially, in order to enable the occluded face image to be perceived well, the generator in the GAN based synthesis network utilizes the hybrid dilated convolution modules to extend the receptive field. Furthermore, aiming at further eliminating the appearance ambiguity as well as unnatural texture, a multi-receptive fields discriminator is proposed to utilize the features from different levels. Experiments on the benchmark datasets show the efficacy of the proposed FSG-GAN.
AB - In some practical scenarios, such as video surveillance and personal identification, we often have to address the recognition problem of occluded faces, where content replacement by serious occlusion with non-face objects always produces partial appearance and ambiguous representation. Under the circumstances, the performance of face recognition algorithms will often deteriorate to a certain degree. In this paper, we therefore address this problem by removing occlusions on face images and present a new two-stage Facial Structure Guided Generative Adversarial Network (FSG-GAN). In Stage I of the FSG-GAN, the variational auto-encoder is used to predict the facial structure. In Stage II, the predicted facial structure and the occluded image are concatenated and fed into a generative adversarial network (GAN) based model to synthesize the de-occlusion face image. In this way, the facial structure knowledge can be transferred to the synthesis network. Especially, in order to enable the occluded face image to be perceived well, the generator in the GAN based synthesis network utilizes the hybrid dilated convolution modules to extend the receptive field. Furthermore, aiming at further eliminating the appearance ambiguity as well as unnatural texture, a multi-receptive fields discriminator is proposed to utilize the features from different levels. Experiments on the benchmark datasets show the efficacy of the proposed FSG-GAN.
KW - Face de-occlusion
KW - Generative adversarial networks
KW - Partial face recognition
UR - http://www.scopus.com/inward/record.url?scp=85114880203&partnerID=8YFLogxK
U2 - 10.1145/3460426.3463642
DO - 10.1145/3460426.3463642
M3 - Conference proceeding
AN - SCOPUS:85114880203
T3 - Proceedings of the International Conference on Multimedia Retrieval
SP - 46
EP - 54
BT - ICMR '21: Proceedings of the 2021 International Conference on Multimedia Retrieval
PB - Association for Computing Machinery (ACM)
T2 - 11th ACM International Conference on Multimedia Retrieval, ICMR 2021
Y2 - 16 November 2021 through 19 November 2021
ER -