Published April 26, 2024 | Version 3.0.2
Dataset Open

The Object Detection for Olfactory References (ODOR) Dataset

  • 1. Friedrich-Alexander-Universität Erlangen-Nürnberg
  • 2. Philipps-Universität Marburg
  • 1. Friedrich-Alexander-Universität Erlangen-Nürnberg
  • 2. University of Cambridge
  • 3. KNAW Humanities Cluster Amsterdam
  • 4. Anglia Ruskin University Cambridge

Description

The Object Detection for Olfactory References (ODOR) Dataset

Real-world applications of computer vision in the humanities require algorithms to be robust against artistic abstraction, peripheral objects, and subtle differences between fine-grained target classes. 

Existing datasets provide instance-level annotations on artworks but are generally biased towards the image centre and limited with regard to detailed object classes. The ODOR dataset fills this gap, offering 38,116 object-level annotations across 4,712 images, spanning an extensive set of 139 fine-grained categories. 

It has challenging dataset properties, such as a detailed set of categories, dense and overlapping objects, and spatial distribution over the whole image canvas. 

Inspiring further research on artwork object detection and broader visual cultural heritage studies, the dataset challenges researchers to explore the intersection of object recognition and smell perception.

How to use

The annotations are provided in COCO JSON format. To represent the two-level hierarchy of the object classes, we make use of the supercategory field in the categories array as defined by COCO. In addition to the object-level annotations, we provide an additional CSV file with image-level metadata, which includes content-related fields, such as Iconclass codes or image descriptions, as well as formal annotations, such as artist, license, or creation year. 

In addition to a zip containing the dataset images, we provide links to their source collections in the metadata file and a Python script to conveniently download the artwork images (`download_imgs.py`).

The mapping between the `images` array of the `annotations.json` and the `metadata.csv` file can be accomplished via the `file_name` attribute of the elements of the `images` array and the unique `File Name` column of the `metadata.csv` file, respectively.

Files

images.zip

Files (758.0 MB)

Name Size Download all
md5:f92ec2a59615789d5ca2e976a6d673ec
3.9 kB Download
md5:88992eb2acaf09ee7b20e9741751bdce
735.0 MB Preview Download
md5:7d69325d3b4761f794b1addde5e40a38
10.3 MB Preview Download
md5:196d8ee82d5e13788324f724932f8a9a
1.6 MB Preview Download
md5:8ff0f03b359abb46a56e64173cde3e4f
8.7 MB Preview Download
md5:04103f2217906411bfe8b8c3811ae732
2.4 MB Preview Download

Additional details

Funding

ODEUROPA – ODEUROPA: Negotiating Olfactory and Sensory Experiences in Cultural Heritage Practice and Research 101004469
European Commission

Software

Repository URL
https://github.com/mathiaszinnen/odor-dataset
Programming language
Python