Published September 18, 2023 | Version v1
Dataset Open

PolyMeme

  • 1. Aristotle University of Thessaloniki
  • 2. CERTH-ITI

Description

Internet Memes have emerged as a dominant new form of mass media and communication and they often consist of images that combine text with image and aim to express humor, irony, sarcasm, or sometimes convey hatred and misinformation. By recognizing them, we can characterize the trend of today's culture and avoid issues related to the spread of hateful and harmful content.

Memes can take various forms which can be split into several categories. Existing datasets typically do not recognize these categories and do not provide a set of images with significant size and diversity, so we created one that sufficiently satisfies those requirements. The collection is gathered from Reddit and is semi-automatically labelled. More precisely the dataset contains ~27k Internet image memes categorized as "Image Macro", "Object Labeling", "Screenshots" and "Text out of Image".

Files

polymeme.csv

Files (1.7 MB)

Name Size Download all
md5:64a8db4818c75ec9f1a137f46397a4f3
1.7 MB Preview Download

Additional details

Funding

vera.ai – vera.ai: VERification Assisted by Artificial Intelligence 101070093
European Commission