FaceAttDB: A Multilingual Dataset for Facial Attribute Captioning

Naimul Haque; Abida Sultana

doi:10.5281/zenodo.8144361

Published July 13, 2023 | Version v1

Dataset Open

FaceAttDB: A Multilingual Dataset for Facial Attribute Captioning

1. Manarat International University

The FaceCaption dataset is a curated collection specifically created for the purpose of research in the field of facial attribute captioning. It consists of 2,000 portrait images sourced from the CelebA dataset, showcasing a diverse range of facial characteristics such as age, gender, expression, and hair color. The dataset includes five captions per image, providing both English and Google-translated Bangla versions.

The dataset is designed to facilitate the exploration of multilingual caption generation on portrait images. Each image in the dataset is accompanied by descriptive and informative captions that accurately describe the visual characteristics present in the image. The captions were generated based on the attribute annotations available in the CelebA dataset, ensuring a close alignment between the captions and the visual attributes.

The images in the BanglaFaceCaption dataset are conveniently stored in a single folder, making them easily accessible for training and evaluation purposes. Additionally, an accompanying Excel sheet is provided, linking each image file with its corresponding English and Bangla captions.

While the current version of the dataset comprises 2,000 images with five captions each, future work aims to expand the dataset size to enhance the diversity and robustness of models trained on it. The BanglaFaceCaption dataset serves as a valuable resource for researchers and practitioners interested in advancing the field of facial attribute captioning and exploring multilingual caption generation capabilities.

Files

img_caption.csv

Files (115.5 MB)

Name	Size	Download all
image_array.npy md5:90450c0fcacb1b60ea22518576771888	100.4 MB	Download
img2k.rar md5:1a45f254877745f7320ea6562a7ce8a3	13.9 MB	Download
img_caption.csv md5:320e239408573477609f09ca842dead0	1.3 MB	Preview Download

	All versions	This version
Views	561	558
Downloads	433	432
Data volume	13.5 GB	13.5 GB

FaceAttDB: A Multilingual Dataset for Facial Attribute Captioning

Authors/Creators

Description

Files

img_caption.csv

Files (115.5 MB)