Published July 4, 2023
| Version 0.1.0
Dataset
Open
Simple Shapes Dataset
- 1. CNRS, CerCo
- 2. Paul Sabatier University; CerCo; IRIT
Description
This dataset is used in the paper Semi-supervised Multimodal Representation Learning through a Global Workspace, Devillers et al., 2023 (under review).
To use this dataset, use the code provided here: https://github.com/bdvllrs/bimGW.
It consists of 32x32 pixel images of shapes with multiple attributes (size, location, rotation, color). Each image is also paired with its ground truth information (attributes), and a natural language description (English) of the image.
The dataset is composed of:
- a train set of 500,000 samples,
- a val and a test set of 1000 samples each.
It also contains already processed 12-dimensional visual features (from a VAE), and presaved BERT features of the text descriptions.
Files
Files
(6.8 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:7e769b1039c6c93fa440bd17f16fb478
|
6.8 GB | Download |