Published October 17, 2022 | Version v1
Dataset Open

Code-mixed Indonesian-Javanese-English Twitter Dataset

  • 1. Universitas Islam Indonesia

Description

This is a Twitter dataset for code-mixed language identification. The dataset contains mixed Indonesian, Javanese, and English words. 

Files

Files (4.9 MB)

Name Size Download all
md5:2de5f69c8011d6bb9c825770057f81f3
2.4 MB Download
md5:54bcb5a3bec8fd0ceb4f49b988ae48ac
733.2 kB Download
md5:d7f650ac0ed78866d587dcfba1383d37
1.4 MB Download
md5:87c2b8721abcd8a934266aa3fccf774c
343.8 kB Download