Published June 27, 2023 | Version 0.5
Dataset Restricted

Digital Humanities Multi-Task Image Classification Dataset (DHMTIC)

  • 1. Bauhaus-University Weimar

Description

This dataset was assembled to train a neural network to undertake a multi-task, multi-class classification challenge, as well as to facilitate parameterized image searches for research queries in the humanities. This is significant for several reasons. To illustrate, large quantities of textual and visual data can be processed more efficiently through specially trained neural networks. Additionally, parameterized image searches permit a detailed examination and analysis of visual data, enabling tasks such as identifying image copies or tracking the evolution and reuse of image motifs. The images hail from diverse research fields, including art and architecture, design, and life sciences. Each image has been classified for two tasks: media type classification and content type classification. The content type, which refers to the subject depicted in the image, is categorized into 14 classes, whereas the media type, denoting whether the image is a graphic, photograph, or drawing, is divided into 3 classes. The class distribution is skewed, with a single class housing a large number of entries and the remaining classes having fewer. For example, in the content type, the "object" class contains the bulk of the data, whereas classes like "musical notation" and "map" form the tail, each with fewer than 100 examples in the training and validation subsets. Similarly, in the media type, the "graphic" subset contains the majority of samples, with the "photography" and "drawing" subsets containing considerably fewer.

Files

Restricted

The record is publicly accessible, but files are restricted. <a href="https://zenodo.org/account/settings/login?next=https://zenodo.org/records/8069425">Log in</a> to check if you have access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

You are currently not logged in. Do you have an account? Log in here

Additional details

Related works

Is published in
Conference paper: 10.1145/3604951.3605524 (DOI)