InVID FIVR-200K

doi:10.5281/zenodo.2564864

Published February 14, 2019 | Version v1

Dataset Open

InVID FIVR-200K

1. CERTH-ITI, Thessaloniki, Greece / Queen Mary University of London, UK
2. CERTH-ITI, Thessaloniki, Greece
3. Queen Mary University of London, UK

The InVID FIVR-200K dataset has been developed in the context of the InVID project with the aim of simulating the problem of Fine-grained Incident Video Retrieval (FIVR). FIVR is the problem where: given a query video, the objective is to retrieve all associated videos, considering several types of associations that range from duplicate videos to videos from the same incident. To address the benchmarking needs of such problem, the large-scale video dataset FIVR-200K has been constructed. It comprises 225,960 YouTube videos collected based on 4,687 major news events crawled from Wikipedia, and 100 video queries selected based on an automatic selection process. For the annotation of the dataset, an annotation protocol has been devised with respect to four types of video associations, i.e., Near-Duplicate Videos (ND), Duplicate Scene Videos (DS), Complementary Scene Videos (CS), and Incident Scene Videos (IS). To this end, FIVR-200K dataset contains the list of the collected Youtube ids, the crawled events from Wikipedia and the video annotations, which include the set of videos for each associations type for each query in the dataset.

Files

fivr_200k_dataset.zip

Files (5.8 MB)

Name	Size	Download all
fivr_200k_dataset.zip md5:30727c4f373fb8b5541ef3c27a886e90	5.8 MB	Preview Download

Additional details

Is supplement to: https://arxiv.org/abs/1809.04094 (URL); 10.1109/TMM.2019.2905741 (DOI)

InVID – In Video Veritas – Verification of Social Media Video Content for the News Industry 687786: European Commission

	All versions	This version
Views	877	877
Downloads	96	96
Data volume	648.6 MB	648.6 MB

InVID FIVR-200K

Files

fivr_200k_dataset.zip

Files (5.8 MB)

Additional details

Related works

Funding

InVID FIVR-200K

Creators

Description

Files

fivr_200k_dataset.zip

Files (5.8 MB)

Additional details

Related works

Funding