Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published December 31, 2016 | Version v1
Conference paper Open

Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers

  • 1. CERTH-ITI, Thessaloniki, Greece / Queen Mary University of London, UK
  • 2. CERTH-ITI, Thessaloniki, Greece
  • 3. Queen Mary University of London, UK

Description

The problem of Near-Duplicate Video Retrieval (NDVR) has attracted increasing interest due to the huge growth of video content on the Web, which is characterized by high degree of near duplicity. This calls for efficient NDVR approaches. Motivated by the outstanding performance of Convolutional Neural Networks (CNNs) over a wide variety of computer vision problems, we leverage intermediate CNN features in a novel global video representation by means of a layer-based feature aggregation scheme. We perform extensive experiments on the widely used CC_WEB_VIDEO dataset, evaluating three popular deep architectures (AlexNet, VGGNet, GoogLeNet) and demonstrating that the proposed approach exhibits superior performance over the state-of-the-art, achieving a mean Average Precision (mAP) score of 0.976.

Files

duplicate-video-retrieval.pdf

Files (1.5 MB)

Name Size Download all
md5:956387b9cf4f35a7240a1da13b902a58
1.5 MB Preview Download

Additional details

Funding

InVID – In Video Veritas – Verification of Social Media Video Content for the News Industry 687786
European Commission