Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward

Kaiyang Zhou, Yu Qiao*, Tao Xiang

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

327 Citations (Scopus)

Abstract

Video summarization aims to facilitate large-scale video browsing by producing short, concise summaries that are diverse and representative of original videos. In this paper, we formulate video summarization as a sequential decision-making process and develop a deep summarization network (DSN) to summarize videos. DSN predicts for each video frame a probability, which indicates how likely a frame is selected, and then takes actions based on the probability distributions to select frames, forming video summaries. To train our DSN, we propose an end-to-end, reinforcement learning-based framework, where we design a novel reward function that jointly accounts for diversity and representativeness of generated summaries and does not rely on labels or user interactions at all. During training, the reward function judges how diverse and representative the generated summaries are, while DSN strives for earning higher rewards by learning to produce more diverse and more representative summaries. Since labels are not required, our method can be fully unsupervised. Extensive experiments on two benchmark datasets show that our unsupervised method not only outperforms other state-of-the-art unsupervised methods, but also is comparable to or even superior than most of published supervised approaches.

Original languageEnglish
Title of host publicationProceedings of the 32nd AAAI Conference on Artificial Intelligence, AAAI 2018
PublisherAAAI press
Pages7582-7589
Number of pages8
ISBN (Print)9781577358008
DOIs
Publication statusPublished - 8 Feb 2018
Externally publishedYes
Event32nd AAAI Conference on Artificial Intelligence, AAAI 2018 - New Orleans, United States
Duration: 2 Feb 20187 Feb 2018
https://ojs.aaai.org/index.php/AAAI/issue/view/301
https://aaai.org/papers/530-ws0496-aaaiw-18-17111/

Publication series

NameProceedings of the AAAI Conference on Artificial Intelligence
Number1
Volume32
ISSN (Print)2159-5399
ISSN (Electronic)2374-3468

Conference

Conference32nd AAAI Conference on Artificial Intelligence, AAAI 2018
Country/TerritoryUnited States
CityNew Orleans
Period2/02/187/02/18
Internet address

Scopus Subject Areas

  • Artificial Intelligence

User-Defined Keywords

  • Video Summarization
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward'. Together they form a unique fingerprint.

Cite this