Published November 30, 2025 | Version v1
Dataset Open

Possible uses of syntactic complexity annotation in a parallel corpus: The case of French -ant forms in converbal function and their Czech counterparts

  • 1. ROR icon Charles University
  • 2. Charles University, Faculty of Arts

Contributors

Project leader:

Project member:

Researcher:

  • 1. ROR icon Charles University
  • 2. Charles University, Faculty of Arts

Description

The content consists of data and tools used and produced during the research presented in the article specified below. It includes some additional materials which are not described in the article, including visualizations as plots and heatmaps. Because the article is in Czech, the Protocol provides some rudimentary background needed for introducing the content. Links to datasets, plots and scripts refer to an external repository, but their names correspond to files in the zipped archive within this project.

Olga Nádvorníková, Alexandr Rosen & Karolína Pavlíková (Faculty of Arts, Charles University, Prague):

Možnosti využití anotace syntaktické komplexity v paralelním korpusu: příklad francouzských tvarů na -ant v konverbální funkci a jejich českých protějšků (Possible uses of syntactic complexity annotation in a parallel corpus: The case of French -ant forms in converbal function and their Czech counterparts)

Časopis pro moderní filologii 107, 2025, č. 1, s. 80–101

https://doi.org/10.14712/23366591.2025.1.7

The abstract:

This study explores new research opportunities offered by the InterCorp v16ud parallel corpus, annotated using the Universal Dependencies scheme and enriched with syntactic complexity (SC) measures. The analysis focuses on French sentences containing -ant forms (gerund and present participle) and their Czech translations, with participles restricted to adverbial (converbal) usage for comparability. The results show significant SC variation in literary texts, with Czech translations displaying lower values than French originals. Coefficient of variation and correlation analyses suggest that participles may function as stylistic markers, unlike gerunds. At the sentence level, participles are associated with higher SC than gerunds, though the differences are moderate. The contrastive analysis reveals substantial reductions in clausal SC measures in the Czech translations, probably due to the replacement of subordination by coordination. These shifts affect SC information hierarchy, and occasionally temporal relations. The study underscores the potential of InterCorp v16ud for syntactic research in contrastive linguistics and beyond, while emphasizing the multidimensional nature of SC.

Files

PROTOCOL.pdf

Files (11.6 MB)

Name Size Download all
md5:486c326d5f4c611aa328b5289687ea66
11.3 MB Preview Download
md5:6926ed06b6d336dfc0c9f3e0333ff3ce
281.6 kB Preview Download

Additional details

Additional titles

Translated title (Czech)
Možnosti využití anotace syntaktické komplexity v paralelním korpusu: příklad francouzských tvarů na -ant v konverbální funkci a jejich českých protějšků

Related works

Is supplement to
Journal article: 10.14712/23366591.2025.1.7 (DOI)

Funding

Ministry of Education Youth and Sports
A lifetime with language: the nature and ontogeny of linguistic communication CZ.02.01.01/00/23_025/0008726

Dates

Issued
2025

Software

Programming language
Python