Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published December 19, 2017 | Version v1
Journal article Open

Syntactically Coded Corpus of Spoken Lithuanian: Developmental Issues and Pilot Studies

  • 1. Vytautas Magnus University, Lithuania
  • 2. Saint-Petersburg State Pediatric Medicine University, Russian Federation

Description

The paper deals with the main methodological issues of development of the Corpus of Spoken Lithu­anian with particular attention to its syntactic coding and applications for automatized language anal­ysis. First, we consider a methodology of development of the Corpus as well as the principles of tran­scribing and coding Lithuanian speech data. The main concepts, such as "utterance" "sentence", etc. are discussed. Second, we present results of a pilot study in interrogatives that are typical for natural spontaneous spoken Lithuanian. Results of the automatized analysis of interrogatives revealed that a frequency and distribution of the Wh- and yes/ no questions is rather similar. Among the Wh- ques­tions, the questions non-containing the interrogative particle seem to be dominant, while the ques­tions containing the interrogative particle at the beginning ot at the end were much rarer. Among the different functional subtypes of Wh- questions, adverbial ones seem to be the most freequent; among the adverbial Wh- questions, the spatial ones were the most frequent. Certainly, the present study is rather pilot due to the novelty of automatized syntactic approach to the data of spoken Lithuanian, thus much more complex studies still await for future investigations. A use of interrogative sentences will be studied from the perspective of different genres (e.g., monologue vs dialogue), social characteristic of the speakers, and a situation of conversation (e.g., public vs private speech). Generally, we believe that future systematic corpus-based research of spontaneous spoken language will give more possi­bilities to identify, evaluate, and elaborate the development of the Lithuanian language.

Files

10.5755_j01.sal.0.28.15131.pdf

Files (692.9 kB)

Name Size Download all
md5:e0ea059464048af6017f6a94cc631a7b
692.9 kB Preview Download