Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study

doi:10.5281/zenodo.4026188

Published September 9, 2020 | Version v1

Conference paper Open

Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study

1. Saint-Petersburg State University
2. Saint Petersburg State University
3. St. Petersburg State University
4. St Petersburg state university
5. SPBU

The paper presents recent results of a multilevel analysis of representative corpus data, conducted in order to identify key speech parameters (lexical, morphological and syntactic) that can diagnose some social/biological characteristics of a speaker or, more broadly, a modern Russian urban sociolect. The study is based on the everyday Russian speech corpus One Speakers Day. Specific data were obtained on the analysis of the annotated subcorpus of 289,205 tokens, which includes recorded speech days of 57 men and 48 women, which were the research participants, as well as speech fragments of 87 men and 139 women, which were their interlocutors. Thus, the total number of speakers in the subsample amounts to 144 men and 187 women. The article also begs the question of Data Mining approach usability to the subcorpus and possibilities of further research using machine learning. The results obtained are important for the optimization of speech technologies systems, for theoretical understanding of linguistic processes, as well as for monitoring various social processes taking place in modern Russian metropolis.

Files

CUsersRussiaFRUCTprocessing3.Zenodo_DOI..2.FRUCT_PublicationFRUCT27papersBog.pdf

Files (1.3 MB)

Name	Size	Download all
CUsersRussiaFRUCTprocessing3.Zenodo_DOI..2.FRUCT_PublicationFRUCT27papersBog.pdf md5:4cb669b39c849d5599ee6997637be148	1.3 MB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	42	42
Downloads	36	36
Data volume	50.3 MB	50.3 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

FRUCT Oy

Published in

Proceedings of the 27th FRUCT conference, 27, 288-293, 2020.

Imprint

ISBN: 978-952-69244-3-4.

Conference

The 27th IEEE Conference of Open Innovations Association FRUCT (FRUCT27) , Trento, Italy, 7-9 September 2020

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: September 12, 2020
Modified: July 19, 2024

Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study

Creators

Description

Files

CUsersRussiaFRUCTprocessing3.Zenodo_DOI..2.FRUCT_PublicationFRUCT27papersBog.pdf

Files (1.3 MB)