Dataset Open Access

SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets

Patwa, Parth; Aguilar, Gustavo; Kar, Sudipta; Pandey, Suraj; PYKL, Srinivas; Gamb ̈ack, Bj ̈orn; Chakraborty, Tanmoy; Solorio, Thamar; Das, Amitava

There are 2 sub-tasks: sentiment analysis for Spanglish (Spanish-English) and for Hinglish (Hindi-English).

The sentiment classes are Positive, negative, neutral. 

Hinglish dataset has 20k instances.

Spanglish dataset has ~19k instances. 

Website: https://ritual-uh.github.io/sentimix2020/

Files (2.8 MB)
Name Size
Semeval_2020_task9_data.zip
md5:4d93553c213b3a966b6d55b7a7c03be7
2.8 MB Download
715
353
views
downloads
All versions This version
Views 715715
Downloads 353353
Data volume 991.1 MB991.1 MB
Unique views 642642
Unique downloads 339339

Share

Cite as