A Dataset for Classification of Tamil Memes

Shardul Suryawanshi; Bharathi Raja Chakravarthi; Pranav Verma; Mihael Arcan; John Philip McCrae; Paul Buitelaar

doi:10.5281/zenodo.3842631

Published May 11, 2020 | Version v1

Conference paper Open

A Dataset for Classification of Tamil Memes

1. National University of Ireland Galway

Social media are interactive platforms that facilitate the creation or sharing of information, ideas or other forms of expression among people. This exchange is not free from offensive, trolling or malicious contents targeting users or communities. One way of trolling is by making memes, which in most cases combines an image with a concept or catchphrase. The challenge of dealing with memes is that they are region-specific and their meaning is often obscured in humour or sarcasm. To facilitate the computational modelling of trolling in the memes for Indian languages, we created a meme dataset for Tamil (TamilMemes). We annotated and released the dataset containing suspected trolls and not-troll memes. In this paper, we use the a image classification to address the difficulties involved in the classification of troll memes with the existing methods. We found that the identification of a troll meme with such an image classifier is not feasible which has been corroborated with precision, recall and F1-score

Files

suryawanshi2020dataset.pdf

Files (530.7 kB)

Name	Size	Download all
suryawanshi2020dataset.pdf md5:dda3f955cffa35801b42ab76dfdd528a	530.7 kB	Preview Download

Additional details

European Commission
Pret-a-LLOD - Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors 825182

383

Views

184

Downloads

Show more details

	All versions	This version
Views	383	382
Downloads	184	184
Data volume	110.4 MB	110.4 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

5th Workshop on Indian Language Data: Resources and Evaluation (WILDRE-5) at LREC-2020

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 25, 2020
Modified: July 19, 2024

A Dataset for Classification of Tamil Memes

Authors/Creators

Description

Files

suryawanshi2020dataset.pdf

Files (530.7 kB)

Additional details

Funding