Published April 24, 2024 | Version v1

ELTeC-NIF: European Literary Text Collection LLOD

  • 1. University of Belgrade, Faculty of Mining and Geology
  • 2. Goethe-Universität Frankfurt am Main

Contributors

Data curator:

  • 1. University of Belgrade, Faculty of Mining and Geology

Description

The  European Literary Text Collection - ELTeC is transformed into Linguistic Linked Open Data text corpora using the NLP Interchange Format (NIF). Namely, the ELTEC corpus subset, which consists of 1000 novels from the period 1840-1920 for 10 European languages, served as the basis for this edition. From each novel, not more than 1000 sentences were used. The annotated version of the novels, in the so-called TEI level-2 format, was transformed into NIF, an RDF/OWL-based format that aims to achieve interoperability between NLP tools, language resources, and annotations.  

Files

NIF2-deu-1000.zip

Files (1.2 GB)

Name Size
md5:60036c393cacb25df5a9adf9e2639d41
120.7 MB Preview Download
md5:1d4b1e71cc9ad45a7a31902b82f04d95
107.9 MB Preview Download
md5:b6c690bb7efff6cdc9c012e5e2411d83
248.6 MB Preview Download
md5:890771db91b8bcc0896d926af9c7b491
104.5 MB Preview Download
md5:ddc680c5ecd2b0563ee1008c92a73c38
92.2 MB Preview Download
md5:e6b46d9af88d21ec64411d411ddcd512
97.8 MB Preview Download
md5:b0f97260d5e751028897d3c85b831d34
80.0 MB Preview Download
md5:e0b9f592a55ac01985de66bbf595af94
102.5 MB Preview Download
md5:5b4b1e7c74333a7477c09af05f2f467a
126.9 MB Preview Download
md5:c3d0c9a2b987f7cf785869c4f3c5f4a0
97.5 MB Preview Download

Additional details

Related works

Is identical to
Dataset: 10.57771/kwep-2b70 (DOI)
References
Conference paper: 10.5281/zenodo.10995982 (DOI)