Published December 4, 2024 | Version v1
Dataset Open

Lec-nominalizations with an adjusted secondary imperfective morpheme in Slovenian

Description

Lec-nominalizations with an adjusted secondary imperfective morpheme in Slovenian



This dataset is a derivative of Arsenijević et al. (2024). The added columns are E, F, G, H, I,  and J. All other columns are retained from the original dataset, and the original source should be referenced for them. 

 

Data collection and annotation procedure

The goal of the data collection is to identify Slovenian lec-nominalizations (in the original dataset listed as lc-) that have an adjustment of the secondary imperfectivizing morpheme not attested in the corresponding verb. For instance, the verb obračunavati ‘to calculate’ contains the secondary imprefectivizer -av-, which gets adjusted to -ov- in the corresponding lec-nominalization: obračunovalec ‘calculator’. As a rule, the adjustment is always in the direction of -ov- (which has the allomorph -ev- after a set of consonants).

 

To obtain all relevant nominalizations, the national corpus Gigafida 2.0 was searched for nominalizations ending in -ovalec and -evalec using the regular expression [lemma=".*(e|o)valec"]. The search results were then cross-referenced with the verbs listed in Arsenijević et al. (2024), and all lec-nominalizations containing an adjusted version of a verb from Arsenijević et al. (2024) were extracted.

For each verb with a corresponding nominalization involving an adjusted secondary imperfectivizer, separate corpus searches were conducted. These searches focused on:

  • The lec-nominalization without adjustment (e.g., obračunavalec), regardless of whether that nominalization is marked as possible in Arsenijević et al. (2024).

  • All attested -lec nominalizations with an adjustment of the secondary imperfective suffix (e.g., obračunovalec).

 

Summary of the columns

The results are in 3 of the added columns, 3 columns give frequencies of each individual lec-nominalization:

 

E: regular lc

Lists the expected lec-nominalization without adjustment, formed by replacing the inflectional ending -ti from the citation form of the verb with -lec.

 

F: frequency regular lc

Contains the frequency of the lec-nominalization without adjustment in Gigafida 2.0. The CQL targeted the specific lemma. For instance, the CQL used for obračunavalec was  [lemma="obračunavalec"].

 

G: lc with adjustment 1

Lists the first lec-nominalization with an adjustment derived from the verb in question and attested in Gigafida 2.0. Given the procedure described above, all items listed in this column contain an -ov-/-ev- that is not present in the base verb. 

 

H: frequency lc with adjustment 1

Contains the frequency of the lec-nominalization with an adjustment in Gigafida 2.0. The CQL targeted the specific lemma. For instance, the CQL used for obračunovalec was  [lemma="obračunovalec"].

 

I: lc with adjustment 2

Lists the second lec-nominalization with an adjustment derived from the verb in question and attested in Gigafida 2.0 (if available). 

 

J: frequency lc with adjustment 2

Contains the frequency of the lec-nominalization with an adjustment in Gigafida 2.0. The CQL targeted the specific lemma. 

 

References

Arsenijević, B., Marušič, F. L., Milosavljević, S., Mišmaš, P., Simonovic, M., & Žaucer, R. (2024). Database of the Western South Slavic Verb HyperVerb (WeSoSlaV) – Deverbal Nominalizations [Data set]. Zenodo. https://doi.org/10.5281/zenodo.14230589

 









Files

Instructions for_Lec-nominalizations with an adjusted secondary imperfective morpheme in Slovenian.pdf

Additional details

Funding

FWF Austrian Science Fund
Multifunctionality in morphology I6258
The Slovenian Research and Innovation Agency
Multifunctionality in morphology J6-4614