Automatic Geminate Insertion Algorithm for Japanese Audio Data

Hirofumi Maeda; Kenta Yamamoto

doi:10.35940/ijrte.B6284.0710221

Published July 30, 2021 | Version v1

Journal article Open

Automatic Geminate Insertion Algorithm for Japanese Audio Data

1. Department of Information Science and Technology, National Institute of Technology (KOSEN), Yuge College, Ehime Prefecture, Japan.
2. Department of General Education, National Institute of Technology (KOSEN), Yuge College, Ehime Prefecture, Japan

Contributors

Sponsor:

Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP)¹

1. Publisher

Generally, it is quite difficult for Japanese language learners to acquire Japanese special morae, namely, geminate, syllabic nasals and long vowels compared to independent morae. Among these three special morae, geminate is particularly difficult, and it takes much longer to fully acquire both production and perception of it. Especially for learners of Chinese native speakers, previous studies has shown that both production and perception of geminate are difficult in terms of the fact that not only no geminate is found in Chinese language, but also the phonological interaction between Japanese accent and Chinese tones. However, in the field of Japanese speech acquisition, research has not making progress because of a major problem, that is, researchers themselves manually create the acoustic experiment stimuli. Therefore, in this study, as a method to solve this problem, we propose an algorithm that automatically inserts geminate into the audio data used in Japanese speech acquisition research. This algorithm automates the insertion of geminate by performing three processes in order: mora extraction by noise removal, matching of original audio data and extracted mora, and insertion of soundless duration and geminate. The algorithm makes it possible to remove the noise, which is -50 dBFS and continues for 10ms or more, and replace it with soundless duration instead, allowing Japanese native speakers to percept it as geminate. The accuracy was equivalent as a result of comparing the data that was manually modified by a phonology researcher with the data that was generated by the algorithm. The result shows that the algorithm can be a practical solution for the automation of geminate insertion.

Files

B62840710221.pdf

Files (752.8 kB)

Name	Size	Download all
B62840710221.pdf md5:1d30f18bf501967b6dc31b67faa8be03	752.8 kB	Preview Download

Additional details

Is cited by: Journal article: 2277-3878 (ISSN)

ISSN: 2277-3878
Retrieval Number:: 100.1/ijrte.B62840710221

	All versions	This version
Views	112	112
Downloads	94	93
Data volume	72.3 MB	71.5 MB

Contributors

Sponsor:

B62840710221.pdf

Files (752.8 kB)

Related works

Subjects

Automatic Geminate Insertion Algorithm for Japanese Audio Data

Authors/Creators

Contributors

Sponsor:

Description

Files

B62840710221.pdf

Files (752.8 kB)

Additional details

Related works

Subjects