Published July 29, 2024
| Version v1
Dataset
Open
RoMEMES
Authors/Creators
- 1. Research Institute for Artificial Intelligence "Mihai Draganescu", Romanian Academy
Description
RoMEMES is a dataset of Romanian language memes, collected from public social media platforms. The dataset was manually annotated with: associated text in Romanian language, image complexity, sentiment, political content. In addition, the dataset contains associated metadata and the text part was automatically annotated in the RELATE platform with part-of-speech tags, lemmas, and dependency parsing.
Files and folders in this dataset:
- metadata.tsv - contains metadata and annotations; the first column is the file ID
- text - folder containing text files, following the file naming convention ID.txt
- images - folder containing image files, following the file naming convention ID.extension, where extension is the original file extension (sometimes this may not correspond with the mime/type of the file, as indicated in metadata.tsv)
- conllup - folder containing automatic text annotations, created in the RELATE platform, following the file naming convention ID.conllup
Files
romemes.zip
Files
(84.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:ab0dfe42640b6aecd7bed3a7889d57d6
|
84.7 MB | Preview Download |