There is a newer version of the record available.

Published September 12, 2022 | Version 0.1
Dataset Open

Fotopersbureau De Boer Training Set on Scene Detection

Creators

  • 1. Melvin
  • 1. Noord-Hollands Archief
  • 2. Universiteit van Amsterdam

Description

This dataset was created as part of the projects HisVis and Fotografisch Geheugen conducted by the Noord-Hollands Archief and University of Amsterdam in the Netherlands. These projects examined to what extent Computer Vision, and more specifically, scene detection could be applied to a collection of historical press photographs. A computer was trained to automatically recognize specific scenes on historical press photographs, like a ‘protest’, ‘marriage’, ‘shopping street’ or ‘baseball game’. The key aspect of scene recognition is to identify the place in which the objects seat.

The specific aim of the enrichments provided by scene detection was to benefit users of the archive and cultural historians studying historical photographs.

The training set contains historical press photographs of the collection of Fotopersbureau De Boer (1945-2005). This file includes the training data used for the training of a scene detection model as well as a model to detect whether a picture was taken indoors or outdoors. The model cards, data sheet, and label sheet include more information on the dataset.

- examples.tar.gz contains example images for each label. The label sheet contains more information

- indoor_out.tar.gz contains the training set for the indoor / outdoor model

- scene_detection.tar.gz contains the training set for the scene detection model.

- no_description_found.tar.gz contains images that were not linked to any class during annotation.

- HisVis2-0.1.beta.tar.gz contains the source code for the software. It stems from this GitHub repo: https://github.com/melvinwevers/HisVis2

- models.tar.gz contains the trained models. These should be placed in the repo. The models are not included here because of their size.

 

More information on the data is available in the data sheet:

 


 

 

Files

Files (13.0 GB)

Name Size Download all
md5:1b7299e646b5fc4cf22c7107c1a17f25
19.7 MB Download
md5:4055efcae065091d156a1a5afc772a16
96.4 MB Download
md5:73b69e14ecaa7e368ad1bf31df0f12c2
1.7 GB Download
md5:6dd9c9a52005d096cbd88a46b4a70f31
369.1 MB Download
md5:55e34a058c19c4f63ed019cb6b1f9d37
240.7 MB Download
md5:3594aa28b65d5fb13590f71524622621
10.6 GB Download

Additional details

Related works

Is supplemented by
Dataset: 10.5281/zenodo.3888006 (DOI)