There is a newer version of the record available.

Published October 17, 2022 | Version v1
Dataset Open

Code-mixed Indonesian-Javanese-English Twitter Dataset

  • 1. Universitas Islam Indonesia

Description

This is a Twitter dataset for code-mixed language identification. The dataset contains mixed Indonesian, Javanese, and English words. 

Files

Files (4.8 MB)

Name Size Download all
md5:f63b0af18b6ec8c8a51423267ed6e173
2.4 MB Download
md5:00c42ee77979589bcff87e8150bb7746
718.3 kB Download
md5:65fa93278936254ae967132de5e863e4
1.3 MB Download
md5:00baa79777cdf22fd3ff39fabc50f6ee
336.9 kB Download