Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data

Stanković, Ranka; Chiarcos, Christian; Utvić, Miloš; Kitanović, Olivera

doi:10.5281/zenodo.10995982

Published April 19, 2024 | Version v1

Conference proceeding Open

Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data

1. University of Belgrade, Faculty of Mining and Geology
2. Goethe-Universität Frankfurt am Main
3. University of Belgrade

This paper describes a case study on the generation of Linked Data text corpora using the NLP Interchange Format (NIF). The ELTEC corpus subset, which consists of 900 novels from the period 1840-1920 for 9 European languages, served as the basis for this research. The annotated version of the novels, in the so-called TEI level-2 format, was transformed into NIF, an RDF/OWL-based format that aims to achieve interoperability between NLP tools, language resources, and annotations. In this paper, we present our approach for transformation, and the implemented pipeline, and offer the code and results for similar use cases.

Files

2023.ldk-1.16.pdf

Files (407.3 kB)

Name	Size	Download all
2023.ldk-1.16.pdf md5:fad65ef8ab44d410cc779f03d3415402	407.3 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	79	79
Downloads	37	37
Data volume	15.9 MB	15.9 MB

More info on how stats are collected....

DOI

Resource type

Conference proceeding

Publisher

Zenodo

Conference

Proceedings of the 4th Conference on Language, Data and Knowledge (LDK2023) , Vienna, Austria

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: April 19, 2024
Modified: July 6, 2024

Towards ELTeC-LLOD: European Literary Text Collection Linguistic Linked Open Data

Authors/Creators

Description

Files

2023.ldk-1.16.pdf

Files (407.3 kB)