TY - JOUR
T1 - Convolution by Multiplication
T2 - Accelerated Two- Stream Fourier Domain Convolutional Neural Network for Facial Expression Recognition
AU - Huang, Mengyu
AU - Zhang, Xingming
AU - Lan, Xiangyuan
AU - Wang, Haoxiang
AU - Tang, Yan
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2022/3
Y1 - 2022/3
N2 - Facial expression plays an important role in human communication as a type of nonverbal language and has been widely used in various areas such as psychology, human-computer interaction and robotics. Nowadays, convolutional neural network is a promising approach for facial expression recognition. However, convolutional layers can be time-consuming and computationally expensive because a large number of parameters participate in the calculations and need to be updated during training. To improve the performance of deep neural network in facial expression recognition and accelerate training and calculation, we propose a novel framework which adopts efficient element-wise multiplication to replace traditional convolution. To disentangle reliable feature representation for more effective recognition and further enhance the recognition performance while maintaining the efficiency, we propose a representation scheme which can retain informative feature components while removing unreliable ones in Fourier domain based on the proposed multiplication framework. Extensive comparison and ablation studies are conducted on several benchmark datasets, which shows the efficiency and effectiveness of the proposed model.
AB - Facial expression plays an important role in human communication as a type of nonverbal language and has been widely used in various areas such as psychology, human-computer interaction and robotics. Nowadays, convolutional neural network is a promising approach for facial expression recognition. However, convolutional layers can be time-consuming and computationally expensive because a large number of parameters participate in the calculations and need to be updated during training. To improve the performance of deep neural network in facial expression recognition and accelerate training and calculation, we propose a novel framework which adopts efficient element-wise multiplication to replace traditional convolution. To disentangle reliable feature representation for more effective recognition and further enhance the recognition performance while maintaining the efficiency, we propose a representation scheme which can retain informative feature components while removing unreliable ones in Fourier domain based on the proposed multiplication framework. Extensive comparison and ablation studies are conducted on several benchmark datasets, which shows the efficiency and effectiveness of the proposed model.
KW - Facial Expression Recognition
KW - Frequency Domain
KW - Neural Network
UR - http://www.scopus.com/inward/record.url?scp=85104606288&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2021.3073558
DO - 10.1109/TCSVT.2021.3073558
M3 - Journal article
AN - SCOPUS:85104606288
SN - 1051-8215
VL - 32
SP - 1431
EP - 1442
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 3
ER -