Published October 8, 2023 | Version v1
Publication Open

Selecting a Diverse Set of Aesthetically-pleasing and Representative Video Thumbnails using Reinforcement Learning

  • 1. CERTH-ITI
  • 2. ROR icon Queen Mary University of London

Description

This paper presents a new reinforcement-based method for video thumbnail selection (called RL-DiVTS), that relies on estimates of the aesthetic quality, representativeness and visual diversity of a small set of selected frames, made with the help of tailored reward functions. The proposed method integrates a novel diversity-aware Frame Picking mechanism that performs a sequential frame selection and applies a reweighting process to demote frames that are visually-similar to the already selected ones. Experiments on two benchmark datasets (OVP and YouTube), using the top-3 matching evaluation protocol, show the competitiveness of RL-DiVTS against other SoA video thumbnail selection and summarization approaches from the literature.

Files

icip2023_preprint.pdf

Files (521.4 kB)

Name Size Download all
md5:e64ff9bdb854f9c54de79fb06d4a9b51
521.4 kB Preview Download

Additional details

Funding

European Commission
AI4Media – A European Excellence Centre for Media, Society and Democracy 951911