Published June 6, 2026 | Version v1

Dataset for "From Typification to Self-Analysis: Semantics of Human Properties in Zoshchenko's Novellas Based on Corpus Data"

  • 1. ROR icon Almaty University of Power Engineering and Telecommunications

Description

This dataset contains corpus-derived card indexes prepared for the article “От типизации к самоанализу: семантика свойств человека в повестях Зощенко на основе корпусных данных” [“From Typification to Self-Analysis: Semantics of Human Properties in Zoshchenko’s Novellas Based on Corpus Data”], intended for submission to Vestnik of Saint Petersburg University. Language and Literature.

The dataset is based on materials extracted from the Russian National Corpus and includes data on the semantic group “Human properties” in two novellas by Mikhail Zoshchenko: Returned Youth [«Возвращенная молодость»] and Before Sunrise [«Перед восходом солнца»]. It also includes an additional grammatical layer containing adjectival and adverbial contexts relevant to the analysis of human qualities, social characterization, self-description, and evaluative semantics.

The archive contains the original corpus exports, cleaned and annotated card indexes, normalized frequency calculations using IPM (instances per million words), CSV versions of the tables for machine readability, a data dictionary, methodological notes, file manifest, checksums, citation metadata, and rights/licensing information.

The dataset is intended to support reproducibility of the quantitative and qualitative analysis presented in the article. It may be useful for researchers working in corpus linguistics, Russian literary studies, Zoshchenko studies, semantic analysis, and the study of evaluative vocabulary in Russian prose of the 1930s–1940s.

Files

Zoshchenko_Zenodo_SPbU_Yazyk_i_literatura_dataset_v1.zip

Files (623.0 kB)