Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published February 1, 2023 | Version test_private
Dataset Open

Toloka Visual Question Answering Dataset

Description

Our dataset consists of the images associated with textual questions. One entry (instance) in our dataset is a question-image pair labeled with the ground truth coordinates of a bounding box containing the visual answer to the given question. The images were obtained from a CC BY-licensed subset of the Microsoft Common Objects in Context dataset, MS COCO. All data labeling was performed on the Toloka crowdsourcing platform, https://toloka.ai/.

Our dataset has 45,199 instances split among three subsets: train (38,990 instances), public test (1,705 instances), and private test (4,504 instances). The entire train dataset was available for everyone since the start of the challenge. The public test dataset was available since the evaluation phase of the competition, but without any ground truth labels. After the end of the competition, public and private sets were released.

The datasets will be provided as files in the comma-separated values (CSV) format containing the following columns.

Column Type Description
image string URL of an image on a public content delivery network
width integer image width
height integer image height
left integer bounding box coordinate: left
top integer bounding box coordinate: top
right integer bounding box coordinate: right
bottom integer bounding box coordinate: bottom
question string question in English

This upload also contains a ZIP file with the images from MS COCO.

Files

test_private.csv

Files (7.5 GB)

Name Size Download all
md5:0277e1380cea0ae3d6ee38017eb40916
555.6 kB Preview Download
md5:59a86de28cc8d6be229545a529e2f0b1
729.9 MB Preview Download
md5:43ff094242e654888705d164f6a98090
211.4 kB Preview Download
md5:e5fc5668f7000577d4f0ccf7e26d9c44
274.3 MB Preview Download
md5:65bbfff1ad9fe258c8eb31d831375c46
4.8 MB Preview Download
md5:31fe4d950e1e0357db7a35d30fc6769a
6.3 GB Preview Download
md5:32f10a2ff738822fcdb952f629d55e5d
123.6 kB Preview Download
md5:443bfd2782bbe2384ea372804087a69a
160.7 MB Preview Download

Additional details