Published June 27, 2023 | Version 0.0.1
Dataset Open

Swahili Image Captioning Dataset

Description

The SwaFlickr8k dataset is an extension of the well-known Flickr8k dataset, specifically designed for image captioning tasks. It includes a collection of images and corresponding captions written in Swahili. With 8,091 unique images and 40,455 captions, this dataset provides a valuable resource for research and development in the field of image understanding and language processing, particularly in the context of Swahili language.

Files

captions_sw.csv

Files (6.0 MB)

Name Size Download all
md5:0eb45ca78c57b4e1d8fcdfc37fbc5b2d
6.0 MB Preview Download