Possible uses of syntactic complexity annotation in a parallel corpus: The case of French -ant forms in converbal function and their Czech counterparts
Authors/Creators
Contributors
Project leader:
Project member:
Researcher:
Description
The content consists of data and tools used and produced during the research presented in the article specified below. It includes some additional materials which are not described in the article, including visualizations as plots and heatmaps. Because the article is in Czech, the Protocol provides some rudimentary background needed for introducing the content. Links to datasets, plots and scripts refer to an external repository, but their names correspond to files in the zipped archive within this project.
Olga Nádvorníková, Alexandr Rosen & Karolína Pavlíková (Faculty of Arts, Charles University, Prague):
Možnosti využití anotace syntaktické komplexity v paralelním korpusu: příklad francouzských tvarů na -ant v konverbální funkci a jejich českých protějšků (Possible uses of syntactic complexity annotation in a parallel corpus: The case of French -ant forms in converbal function and their Czech counterparts)
Časopis pro moderní filologii 107, 2025, č. 1, s. 80–101
https://doi.org/10.14712/23366591.2025.1.7
The abstract:
This study explores new research opportunities offered by the InterCorp v16ud parallel corpus, annotated using the Universal Dependencies scheme and enriched with syntactic complexity (SC) measures. The analysis focuses on French sentences containing -ant forms (gerund and present participle) and their Czech translations, with participles restricted to adverbial (converbal) usage for comparability. The results show significant SC variation in literary texts, with Czech translations displaying lower values than French originals. Coefficient of variation and correlation analyses suggest that participles may function as stylistic markers, unlike gerunds. At the sentence level, participles are associated with higher SC than gerunds, though the differences are moderate. The contrastive analysis reveals substantial reductions in clausal SC measures in the Czech translations, probably due to the replacement of subordination by coordination. These shifts affect SC information hierarchy, and occasionally temporal relations. The study underscores the potential of InterCorp v16ud for syntactic research in contrastive linguistics and beyond, while emphasizing the multidimensional nature of SC.
Files
PROTOCOL.pdf
Files
(11.6 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:486c326d5f4c611aa328b5289687ea66
|
11.3 MB | Preview Download |
|
md5:6926ed06b6d336dfc0c9f3e0333ff3ce
|
281.6 kB | Preview Download |
Additional details
Additional titles
- Translated title (Czech)
- Možnosti využití anotace syntaktické komplexity v paralelním korpusu: příklad francouzských tvarů na -ant v konverbální funkci a jejich českých protějšků
Related works
- Is supplement to
- Journal article: 10.14712/23366591.2025.1.7 (DOI)
Funding
- Ministry of Education Youth and Sports
- A lifetime with language: the nature and ontogeny of linguistic communication CZ.02.01.01/00/23_025/0008726
Dates
- Issued
-
2025
Software
- Programming language
- Python