The Object Detection for Olfactory References (ODOR) Dataset
Creators
- 1. Friedrich-Alexander-Universität Erlangen-Nürnberg
- 2. Philipps-Universität Marburg
Contributors
Data collectors:
Project leader:
Project members:
- 1. Friedrich-Alexander-Universität Erlangen-Nürnberg
- 2. University of Cambridge
- 3. KNAW Humanities Cluster Amsterdam
- 4. Anglia Ruskin University Cambridge
Description
The Object Detection for Olfactory References (ODOR) Dataset
Real-world applications of computer vision in the humanities require algorithms to be robust against artistic abstraction, peripheral objects, and subtle differences between fine-grained target classes.
Existing datasets provide instance-level annotations on artworks but are generally biased towards the image centre and limited with regard to detailed object classes. The ODOR dataset fills this gap, offering 38,116 object-level annotations across 4,712 images, spanning an extensive set of 139 fine-grained categories.
It has challenging dataset properties, such as a detailed set of categories, dense and overlapping objects, and spatial distribution over the whole image canvas.
Inspiring further research on artwork object detection and broader visual cultural heritage studies, the dataset challenges researchers to explore the intersection of object recognition and smell perception.
How to use
The annotations are provided in COCO JSON format. To represent the two-level hierarchy of the object classes, we make use of the supercategory field in the categories array as defined by COCO. In addition to the object-level annotations, we provide an additional CSV file with image-level metadata, which includes content-related fields, such as Iconclass codes or image descriptions, as well as formal annotations, such as artist, license, or creation year.
In addition to a zip containing the dataset images, we provide links to their source collections in the metadata file and a Python script to conveniently download the artwork images (`download_imgs.py`).
The mapping between the `images` array of the `annotations.json` and the `metadata.csv` file can be accomplished via the `file_name` attribute of the elements of the `images` array and the unique `File Name` column of the `metadata.csv` file, respectively.
Files
images.zip
Files
(758.0 MB)
Name | Size | Download all |
---|---|---|
md5:f92ec2a59615789d5ca2e976a6d673ec
|
3.9 kB | Download |
md5:88992eb2acaf09ee7b20e9741751bdce
|
735.0 MB | Preview Download |
md5:7d69325d3b4761f794b1addde5e40a38
|
10.3 MB | Preview Download |
md5:196d8ee82d5e13788324f724932f8a9a
|
1.6 MB | Preview Download |
md5:8ff0f03b359abb46a56e64173cde3e4f
|
8.7 MB | Preview Download |
md5:04103f2217906411bfe8b8c3811ae732
|
2.4 MB | Preview Download |
Additional details
Funding
Software
- Repository URL
- https://github.com/mathiaszinnen/odor-dataset
- Programming language
- Python