Published February 20, 2022 | Version V1
Dataset Open

HISTORIAN: a large-scale HISTORIcal film dataset with cinematographic ANnotation

  • 1. TU Wien, Institute of Visual Computing & Human-Centered Technology, Computer Vision Lab Austria, Vienna

Description

Developing automated tools for sustainable film preservation of extensive historical film collections assumes an understanding of fundamental cinematographic settings. In order to be able to investigate new approaches to detect and classify cinematographic settings, this paper proposes a novel large-scale historical film dataset with cinematographic annotations (HISTORIAN), i.e., shot boundaries, shot types, camera movements. The dataset consists of 98 digitized original analog film reels related to the Second World War and 10593 film shots manually annotated by human film experts. Moreover, annotations for overscan areas such as sprocket holes are included. A baseline film analysis pipeline is introduced and evaluated. To the best of our knowledge, HISTORIAN is the first dataset that covers the challenges and characteristics of historical film documentaries and provides novel possibilities for exploring automatic film analysis tools.

This repository presents a tiny set including a few examples for demonstration.

A link to the Github repository (including helper scripts and readme) can be found here

 

 

 

 

Files

Files (95.6 GB)

Name Size Download all
md5:a9e286dc5f8511794e5b3750b3cb3ad1
95.6 GB Download

Additional details

Funding

VHH – Visual History of the Holocaust: Rethinking Curation in the Digital Age 822670
European Commission