Published February 29, 2024 | Version v1
Dataset Open

Flickr8k-ca

  • 1. ROR icon Barcelona Supercomputing Center

Description

Flickr8k-ca dataset is a professional translation of the Flickr8k Dataset into Catalan, commissioned by BSC LangTech Unit.

Flickr8k is a dataset for sentence-based image description. It consists of 8,000 images collected from Flickr, together with 5 reference captions provided by human annotators (https://www.kaggle.com/datasets/adityajn105/flickr8k/data).

This work was funded by the Departament de la Vicepresidència i de Polítiques Digitals i Territori de la Generalitat de Catalunya within the framework of Projecte AINA.

Files

ca_flickr8k.csv

Files (3.6 MB)

Name Size Download all
md5:69c0ba8133bf93d04b19d553534831b4
3.6 MB Preview Download