Speech corpus of Armenian question-answer dialogues

Samuel Chakmakjian; Hossep Dolatian

doi:10.5281/zenodo.7088127

Published September 17, 2022 | Version v1.0

Video/Audio Open

Speech corpus of Armenian question-answer dialogues

1. INALCO
2. Stony Brook University

This is a corpus of elicited controlled speech. The stimuli was a sequence of dialogues with intermittent fillers. This repository is for only the stimuli. The stimuli was designed to elicit intonation patterns for questions and answers in two Armenian dialects: Western Armenian (WA) and Eastern Armenian (EA). The recordings can be used for topics like intonation prosody or ASR (Automatic Speech Recognition).

The dataset is is open-access at 8,852 dialogues, consisting of 23,711 utterances (individual sound files), for a total of 2.7GB and 8.5hrs. Each utterance has a sound file, a Praat TextGrid (with full linguistic annotation), and text file that has orthographic forms for easier ASR uses. Pronunciation dictionaries are provided for ASR purposes as well.

Files

jhdeov/armenian-intonation-v1.0.zip

Files (2.2 GB)

Name	Size	Download all
jhdeov/armenian-intonation-v1.0.zip md5:705d0533b5017b68aa3a5507c05c7552	2.2 GB	Preview Download

Additional details

Is supplement to: https://github.com/jhdeov/armenian-intonation/tree/v1.0 (URL)

DiCanio, Christian. 2020. Sound file subdivision. Praat script. Accessed April 15 2022 from https://www.acsu.buffalo.edu/~cdicanio/scripts/Sound_file_division.praat

	All versions	This version
Views	231	23
Downloads	49	7
Data volume	114.9 GB	15.5 GB

jhdeov/armenian-intonation-v1.0.zip

Files (2.2 GB)

Related works

References

Speech corpus of Armenian question-answer dialogues

Authors/Creators

Description

Files

jhdeov/armenian-intonation-v1.0.zip

Files (2.2 GB)

Additional details

Related works

References