Published July 13, 2023 | Version v1
Dataset Open

FaceAttDB: A Multilingual Dataset for Facial Attribute Captioning

  • 1. Manarat International University

Description

The FaceCaption dataset is a curated collection specifically created for the purpose of research in the field of facial attribute captioning. It consists of 2,000 portrait images sourced from the CelebA dataset, showcasing a diverse range of facial characteristics such as age, gender, expression, and hair color. The dataset includes five captions per image, providing both English and Google-translated Bangla versions.

The dataset is designed to facilitate the exploration of multilingual caption generation on portrait images. Each image in the dataset is accompanied by descriptive and informative captions that accurately describe the visual characteristics present in the image. The captions were generated based on the attribute annotations available in the CelebA dataset, ensuring a close alignment between the captions and the visual attributes.

The images in the BanglaFaceCaption dataset are conveniently stored in a single folder, making them easily accessible for training and evaluation purposes. Additionally, an accompanying Excel sheet is provided, linking each image file with its corresponding English and Bangla captions.

While the current version of the dataset comprises 2,000 images with five captions each, future work aims to expand the dataset size to enhance the diversity and robustness of models trained on it. The BanglaFaceCaption dataset serves as a valuable resource for researchers and practitioners interested in advancing the field of facial attribute captioning and exploring multilingual caption generation capabilities.

Files

img_caption.csv

Files (115.5 MB)

Name Size Download all
md5:90450c0fcacb1b60ea22518576771888
100.4 MB Download
md5:1a45f254877745f7320ea6562a7ce8a3
13.9 MB Download
md5:320e239408573477609f09ca842dead0
1.3 MB Preview Download