Published July 29, 2024 | Version v1

RoMEMES

  • 1. Research Institute for Artificial Intelligence "Mihai Draganescu", Romanian Academy

Description

RoMEMES is a dataset of Romanian language memes, collected from public social media platforms. The dataset was manually annotated with: associated text in Romanian language, image complexity, sentiment, political content. In addition, the dataset contains associated metadata and the text part was automatically annotated in the RELATE platform with part-of-speech tags, lemmas, and dependency parsing.

Files and folders in this dataset:

  • metadata.tsv - contains metadata and annotations; the first column is the file ID
  • text - folder containing text files, following the file naming convention ID.txt
  • images - folder containing image files, following the file naming convention ID.extension, where extension is the original file extension (sometimes this may not correspond with the mime/type of the file, as indicated in metadata.tsv)
  • conllup - folder containing automatic text annotations, created in the RELATE platform, following the file naming convention ID.conllup

 

Files

romemes.zip

Files (84.7 MB)

Name Size Download all
md5:ab0dfe42640b6aecd7bed3a7889d57d6
84.7 MB Preview Download