Published May 9, 2024 | Version v2
Dataset Open

fruit-SALAD

  • 1. ROR icon Tallinn University

Description

fruit-SALAD is a synthetic image dataset with 10,000 generated images of fruit depictions. This combined semantic category and style benchmark comprises 100 instances each of 10 easily recognizable fruit categories and 10 easy distinguishable styles. 

See the paper on Scientific Data or visit our project page.

The carefully designed Style Aligned Artwork Dataset (SALAD) provides a controlled and balanced platform for the comparative analysis of similarity perception of different computational models. The SALAD framework allows the comparison of how these models perform semantic category and style recognition tasks, going beyond the level of anecdotal knowledge, making them robustly quantifiable and qualitatively interpretable.

We used Stable Diffusion XL and StyleAligned to create the fruit-SALAD by carefully crafting text prompts and overseing the image generation process.

The code to reproduce the fruit-SALAD_10k is available at GitHub.

Please note that this dataset is available for academic research purposes only.

Files

embeddings.zip

Files (15.8 GB)

Name Size Download all
md5:f420e296104d28b769cef7a650ba785f
616.8 MB Preview Download
md5:fa5b042dc1b05bcce8f199c4776a8070
14.7 GB Preview Download
md5:070ad44487e6c3c24577389f26b42fc3
473.0 MB Preview Download
md5:a027fc35558ff097b83b1b5cfa80a528
182 Bytes Preview Download
md5:20402b57e1fcc68dc7e0c4a89c96360c
16.3 MB Preview Download
md5:a5a98dd192dcc44b7c17827d8fddb089
2.1 MB Preview Download
md5:e4739bac82f38ba3137bc0fa4856428e
6.8 kB Preview Download
md5:dc3ca95c75c9f70568071d777a6f1586
11.1 MB Preview Download

Additional details

Related works

Is described by
Data paper: 10.1038/s41597-025-04529-4 (DOI)

Funding

European Commission
CUDAN – Cultural Data Analytics 810961

Software

Repository URL
https://github.com/Style-Aligned-Artwork-Datasets/fruit-SALAD
Programming language
Python, Jupyter Notebook