Published July 10, 2023 | Version 1.0.0
Report Open

Automatic transcriptions of intonational phonology - Generating ToDI transcriptions with a small training set

Authors/Creators

  • 1. Leiden University

Description

This study aims to evaluate and compare the performance of automatic transcription systems for the ToBI (Tones and Breaks Indices) and ToDI (Transcription of Dutch Intonation) frameworks. Specifically, the focus is on matching or surpassing the results achieved by previous systems using a relatively small data set for training. By employing recent advancements in Natural Language Processing (NLP), this research demonstrates the potential to achieve comparable or superior performance in generating ToDI transcriptions of intonational phonology with limited labelled data available for boundary detection and boundary classification, while, for accent detection and accent classification, no results substantially better than the majority class baseline were obtained.

Notes

This paper was written as a BA thesis.

Files

Archive.zip

Files (117.4 kB)

Name Size Download all
md5:85e60619c67e6f375ed8ae0a83c23da3
116.6 kB Preview Download
md5:b9e9856d4e9fecd4ff0208818d969197
772 Bytes Preview Download