Published July 11, 2025
| Version Version 1
Dataset
Open
ID2 : Indonesian Dataset2
Authors/Creators
Description
ID2 (Indonesian Dataset 2) is an Indonesian speech dataset that features dialectal variations recorded from 31 speakers belonging to various ethnic groups in Indonesia, namely Javanese, Sundanese, Batak, Balinese, and Minang. The speakers comprise both male and female individuals aged between 17 and 25 years. This dataset includes 330 sentences from diverse domains, accompanied by manually created transcriptions. The dataset has a total of 10,230 sentences, spanning 7 hours, 40 minutes, and 48 seconds.
Notes
Files
Files
(1.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:fa05ecfdde7661ac16568aa0dde1f4ea
|
1.3 GB | Download |
Additional details
Funding
- Ministry of Education and Culture
- Indonesian Education Scholarship, Center for Higher Education Funding and Assessment, and Indonesian Endowment Fund for Education, Indonesia 01431/J5.2.3./BPI.06/9/2022