Published February 14, 2020 | Version 0.3.1
Dataset Open

Example computer vision classification training data derived from British Library 19th Century Books Image collection

Description

Example computer vision classification training data derived from British Library 19th Century Books Image collection

This dataset provides training data for image classification for use in a computer vision workshop. The images are derived from 'Digitised Books - Images identified as Embellishments. c. 1510 - c. 1900. JPG' from the year '1839'.

Currently, included are four folders containing a variety of images derived from the BL books corpus.

  • 'cv_workshop_exercise_data' include images of: 'building', 'people', 'coat of arms'
  • 'humancats' contains images of humans and images of cats

The 'fashion' and 'portraits' folders both contain images of people organised into 'female' and 'male'. These labels were annotated by a single annotator and these categories may themselves not be meaningful. They are included in the workshop data as a point of discussion about how we should label data both in general and when working with historical data. 

This data is intended primarily as an educational resource.

Files

cv_workshop_exercise_data.zip

Files (215.3 MB)

Name Size Download all
md5:dfcf9e9b1546ed4011c01553ee328b1c
23.7 MB Preview Download
md5:f97282d59d01ac4f16a52b3d6e24bea5
70.1 MB Preview Download
md5:a56e691d9ac5f06c4ff7fb570e8b57d5
94.9 MB Preview Download
md5:0fbcdd95002cdd8c6c577ff1190c49b6
1.2 kB Preview Download
md5:0114ea8a9b09aa2a00bc1fbd2def4647
26.7 MB Preview Download

Additional details

Related works

Is derived from
Dataset: 10.21250/db17 (DOI)

Funding

Living with Machines AH/S01179X/1
UK Research and Innovation