Published February 17, 2026 | Version v1
Dataset Open

Corpus of Multilingual Gender-Specified Fashion Prompts

  • 1. University Institute of Computer Research (IUII), University of Alicante

Description

This dataset provides a multilingual corpus of fashion-related prompts designed for the analysis of gender bias in text-to-image generation systems. It includes Spanish and Chinese prompts, with and without nationality specification. For each language–nationality configuration, the dataset contains three controlled variants (male, female, and neutral) derived from the same base prompts.

Each CSV file corresponds to one language–nationality setting and contains 100 base prompts expressed as three gender-specific textual variants. The dataset is intended for reproducible evaluation of gender representation and cross-linguistic effects in multimodal generative models, particularly within the fashion domain.

Files

chinese no nationality.csv

Files (240.3 kB)

Name Size Download all
md5:5f29665caa815163909d5f5213809d21
53.7 kB Preview Download
md5:90a2b167547e5821905c1fab2f69306f
55.6 kB Preview Download
md5:1bf43682eeea476d9c6b8824fd7e3928
64.1 kB Preview Download
md5:83d5df7a5ed2d420ace732b40bab3019
66.9 kB Preview Download