Published March 13, 2026 | Version 1.1
Dataset Open

Annotated Corpus of English–Ukrainian Simultaneous Interpreting with Cognitive Indices (2015–2025)

  • 1. ROR icon Taras Shevchenko National University of Kyiv

Description

This dataset contains a small annotated corpus of English–Ukrainian simultaneous interpreting excerpts prepared for research in interpreting studies. The material was collected in the course of work on a dissertation devoted to cognitive processes in simultaneous interpreting.

The corpus includes 200 short fragments reflecting typical situations of institutional and conference interpreting between 2015 and 2025. The examples are based on publicly available speeches, meetings, press briefings and conference recordings. In order to make the material comparable, the excerpts were normalized and formatted as separate research segments.

Each segment is supplied with basic contextual information such as year, institutional domain, event type and interpreting direction (English–Ukrainian or Ukrainian–English). Reference links to public recordings are provided for orientation, although the corpus itself contains only short analytical fragments.

In addition to the textual data, the dataset includes cognitive index annotation used in the analysis of interpreting performance. The indices (EVS, BKN, IR, IDA, KTI, IBS) were calculated for experimental and statistical analysis within a multidimensional model of interpreting processes.

The dataset is intended for academic use in interpreting studies, translation research and corpus-based analysis.

Files

Annotated_Reference_Corpus_EN_UK_2015_2025.csv

Files (146.6 kB)

Name Size Download all
md5:a805499674c0ab7cad654a4164cf6313
113.3 kB Preview Download
md5:b3da5a524537fd58a8a1f537f216fe13
26.7 kB Download
md5:7a1153575fa2d64f5f582e95bdb21845
6.6 kB Preview Download

Additional details

Software

Development Status
Active