Conference paper Open Access

Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers

Kordopatis-Zilos, Giorgos; Papadopoulos, Symeon; Patras, Ioannis; Kompatsiaris, Yiannis

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Kordopatis-Zilos, Giorgos</dc:creator>
  <dc:creator>Papadopoulos, Symeon</dc:creator>
  <dc:creator>Patras, Ioannis</dc:creator>
  <dc:creator>Kompatsiaris, Yiannis</dc:creator>
  <dc:description>The problem of Near-Duplicate Video Retrieval (NDVR) has attracted increasing interest due to the huge growth of video content on the Web, which is characterized by high degree of near duplicity. This calls for efficient NDVR approaches. Motivated by the outstanding performance of Convolutional Neural Networks (CNNs) over a wide variety of computer vision problems, we leverage intermediate CNN features in a novel global video representation by means of a layer-based feature aggregation scheme. We perform extensive experiments on the widely used CC_WEB_VIDEO dataset, evaluating three popular deep architectures (AlexNet, VGGNet, GoogLeNet) and demonstrating that the proposed approach exhibits superior performance over the state-of-the-art, achieving a mean Average Precision (mAP) score of 0.976.</dc:description>
  <dc:subject>Video retrieval</dc:subject>
  <dc:subject>Bag of keyframes</dc:subject>
  <dc:title>Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers</dc:title>
Views 243
Downloads 927
Data volume 1.4 GB
Unique views 230
Unique downloads 806


Cite as