Published October 23, 2017 | Version v1
Conference paper Open

Near-Duplicate Video Retrieval with Deep Metric Learning

  • 1. CERTH-ITI, Thessaloniki, Greece / Queen Mary University of London, UK
  • 2. CERTH-ITI, Thessaloniki, Greece
  • 3. Queen Mary University of London, UK

Description

This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tandem with a Deep Metric Learning (DML) framework with two fusion variations, trained to approximate an embedding function for accurate distance calculation between two near-duplicate videos. In contrast to most state-of-the-art methods, which exploit information deriving from the same source of data for both development and evaluation (which usually results to dataset-specific solutions), the proposed model is fed during training with sampled triplets generated from an independent dataset and is thoroughly tested on the widely used CC_WEB_VIDEO dataset, using two popular deep CNN architectures (AlexNet, GoogleNet). We demonstrate that the proposed approach achieves outstanding performance against the state-of-the-art, either with or without access to the evaluation dataset.

Files

ndvr_w_dml.pdf

Files (1.4 MB)

Name Size Download all
md5:791315a064d664c71b60d5541cacbe16
1.4 MB Preview Download

Additional details

Funding

InVID – In Video Veritas – Verification of Social Media Video Content for the News Industry 687786
European Commission