Near-Duplicate Video Retrieval with Deep Metric Learning
- 1. CERTH-ITI, Thessaloniki, Greece / Queen Mary University of London, UK
- 2. CERTH-ITI, Thessaloniki, Greece
- 3. Queen Mary University of London, UK
Description
This work addresses the problem of Near-Duplicate Video Retrieval (NDVR). We propose an effective video-level NDVR scheme based on deep metric learning that leverages Convolutional Neural Network (CNN) features from intermediate layers to generate discriminative global video representations in tandem with a Deep Metric Learning (DML) framework with two fusion variations, trained to approximate an embedding function for accurate distance calculation between two near-duplicate videos. In contrast to most state-of-the-art methods, which exploit information deriving from the same source of data for both development and evaluation (which usually results to dataset-specific solutions), the proposed model is fed during training with sampled triplets generated from an independent dataset and is thoroughly tested on the widely used CC_WEB_VIDEO dataset, using two popular deep CNN architectures (AlexNet, GoogleNet). We demonstrate that the proposed approach achieves outstanding performance against the state-of-the-art, either with or without access to the evaluation dataset.
Files
ndvr_w_dml.pdf
Files
(1.4 MB)
Name | Size | Download all |
---|---|---|
md5:791315a064d664c71b60d5541cacbe16
|
1.4 MB | Preview Download |