Published June 10, 2020 | Version v1
Dataset Open

HisVis: Scene Detection Pilot Training Set

Authors/Creators

  • 1. University of Amsterdam

Description

The De Boer photo collection, provided by the Noord-Hollands Archief, consists of approximately two million negatives and metadata for the period 1945-2004. This training set consists of a subset of 2,545 images from the NHA online repository, an early batch of digitized images, and digitized small negatives.

We took the categorization scheme for Places-365 as a starting point. We combined this scheme with information from the catalog cards that De Boer used. In making these categories, we kept in mind whether there remained a historical and visual consistency in the categories and whether the category would be of use for users of the collection. To gauge the interest of the users, we conducted several interviews.

Together with archivists and cultural historians, we studied the images in the training set and settled on a categorization scheme of 159 categories. During annotation, we noticed the difficulties in distinguishing between scenes that were characterized by a particular object and scenes defined by the location or the action performed in the image.

Notes

The repository containes three models using different types of augmentations, as well as the training data.

Files

Files (2.3 GB)

Name Size Download all
md5:c5cb0aca0e0d64e384298d76d36ae79a
2.0 GB Download
md5:d1e3c8047f40219cccec4beebe146676
104.4 MB Download
md5:3a0c8ecda5002c925ba8e215286aa93f
120.9 MB Download
md5:7eeb32d4e12d900b731727c71af63f64
103.2 MB Download