Toloka Visual Question Answering Dataset
Contributors
- 1. Toloka
Description
Our dataset consists of the images associated with textual questions. One entry (instance) in our dataset is a question-image pair labeled with the ground truth coordinates of a bounding box containing the visual answer to the given question. The images were obtained from a CC BY-licensed subset of the Microsoft Common Objects in Context dataset, MS COCO. All data labeling was performed on the Toloka crowdsourcing platform, https://toloka.ai/.
Our dataset has 45,199 instances split among three subsets: train (38,990 instances), public test (1,705 instances), and private test (4,504 instances). The entire train dataset was available for everyone since the start of the challenge. The public test dataset was available since the evaluation phase of the competition, but without any ground truth labels. After the end of the competition, public and private sets were released.
The datasets will be provided as files in the comma-separated values (CSV) format containing the following columns.
Column | Type | Description |
image | string | URL of an image on a public content delivery network |
width | integer | image width |
height | integer | image height |
left | integer | bounding box coordinate: left |
top | integer | bounding box coordinate: top |
right | integer | bounding box coordinate: right |
bottom | integer | bounding box coordinate: bottom |
question | string | question in English |
This upload also contains a ZIP file with the images from MS COCO.
Files
test_private.csv
Files
(7.5 GB)
Name | Size | Download all |
---|---|---|
md5:0277e1380cea0ae3d6ee38017eb40916
|
555.6 kB | Preview Download |
md5:59a86de28cc8d6be229545a529e2f0b1
|
729.9 MB | Preview Download |
md5:43ff094242e654888705d164f6a98090
|
211.4 kB | Preview Download |
md5:e5fc5668f7000577d4f0ccf7e26d9c44
|
274.3 MB | Preview Download |
md5:65bbfff1ad9fe258c8eb31d831375c46
|
4.8 MB | Preview Download |
md5:31fe4d950e1e0357db7a35d30fc6769a
|
6.3 GB | Preview Download |
md5:32f10a2ff738822fcdb952f629d55e5d
|
123.6 kB | Preview Download |
md5:443bfd2782bbe2384ea372804087a69a
|
160.7 MB | Preview Download |