TY - GEN
T1 - Learning perceptual embeddings with two related tasks for joint predictions of media interestingness and emotions
AU - LIU, Yang
AU - Gu, Zhonglei
AU - Ko, Tobey H.
AU - Hua, Kien A.
N1 - Funding information:
This work was supported in part by the National Natural Science Foundation of China under Grant 61503317, in part Special Session 1: Predicting User Perceptions of Multimedia Content ICMR’18, June 11-14, 2018, Yokohama, Japan by the Grant from the Research Grant Council of Hong Kong SAR under Project RGC/HKBU12202417, in part by the Science and Technology Research and Development Fund of Shenzhen with Project Code JCYJ20170307161544087, and in part by the Faculty Research Grant of Hong Kong Baptist University (HKBU) under Project FRG2/16-17/032.
Publisher copyright:
© 2018 Association for Computing Machinery.
PY - 2018/6/5
Y1 - 2018/6/5
N2 - Integrating media elements of various medium, multimedia is capable of expressing complex information in a neat and compact way. Early studies have linked different sensory presentation in multimedia with the perception of human-like concepts. Yet, the richness of information in multimedia makes understanding and predicting user perceptions in multimedia content a challenging task both to the machine and the human mind. This paper presents a novel multi-task feature extraction method for accurate prediction of user perceptions in multimedia content. Differentiating from the conventional feature extraction algorithms which focus on perfecting a single task, the proposed model recognizes the commonality between different perceptions (e.g., interestingness and emotional impact), and attempts to jointly optimize the performance of all the tasks through uncovered commonality features. Using both a media interestingness dataset and a media emotion dataset for user perception prediction tasks, the proposed model attempts to simultaneously characterize the individualities of each task and capture the commonalities shared by both tasks, and achieves better accuracy in predictions than other competing algorithms on real-world datasets of two related tasks: MediaEval 2017 Predicting Media Interestingness Task and MediaEval 2017 Emotional Impact of Movies Task.
AB - Integrating media elements of various medium, multimedia is capable of expressing complex information in a neat and compact way. Early studies have linked different sensory presentation in multimedia with the perception of human-like concepts. Yet, the richness of information in multimedia makes understanding and predicting user perceptions in multimedia content a challenging task both to the machine and the human mind. This paper presents a novel multi-task feature extraction method for accurate prediction of user perceptions in multimedia content. Differentiating from the conventional feature extraction algorithms which focus on perfecting a single task, the proposed model recognizes the commonality between different perceptions (e.g., interestingness and emotional impact), and attempts to jointly optimize the performance of all the tasks through uncovered commonality features. Using both a media interestingness dataset and a media emotion dataset for user perception prediction tasks, the proposed model attempts to simultaneously characterize the individualities of each task and capture the commonalities shared by both tasks, and achieves better accuracy in predictions than other competing algorithms on real-world datasets of two related tasks: MediaEval 2017 Predicting Media Interestingness Task and MediaEval 2017 Emotional Impact of Movies Task.
KW - Multi-task feature extraction
KW - Multimedia emotional impact prediction
KW - Multimedia interestingness prediction
KW - Perceptual embedding
UR - http://www.scopus.com/inward/record.url?scp=85053869136&partnerID=8YFLogxK
U2 - 10.1145/3206025.3206071
DO - 10.1145/3206025.3206071
M3 - Conference proceeding
AN - SCOPUS:85053869136
SN - 9781450350464
T3 - ICMR 2018 - Proceedings of the 2018 ACM International Conference on Multimedia Retrieval
SP - 420
EP - 427
BT - ICMR 2018 - Proceedings of the 2018 ACM International Conference on Multimedia Retrieval
PB - Association for Computing Machinery (ACM)
T2 - 8th ACM International Conference on Multimedia Retrieval, ICMR 2018
Y2 - 11 June 2018 through 14 June 2018
ER -