Planned intervention: On Wednesday April 3rd 05:30 UTC Zenodo will be unavailable for up to 2-10 minutes to perform a storage cluster upgrade.
Published July 19, 2022 | Version v1
Conference paper Open

UsingWiktionary to Create Specialized Lexical Resources and Datasets

  • 1. Austrian Academy of Sciences
  • 2. DFKi GmbH

Description

This paper describes an approach aiming at utilizing Wiktionary data for creating specialized lexical datasets which can be
used for enriching other lexical (semantic) resources or for generating datasets that can be used for evaluating or improving
NLP tasks, like Word Sense Disambiguation, Word-in-Context challenges, or Sense Linking across lexicons and dictionaries.
We have focused on Wiktionary data about pronunciation information in English, and grammatical number and grammatical
gender in German.

Files

2022.lrec-1.370(1).pdf

Files (170.5 kB)

Name Size Download all
md5:10c843140df08e3e6c5f58cc14ca1c19
170.5 kB Preview Download

Additional details

Funding

ELEXIS – European Lexicographic Infrastructure 731015
European Commission
Pret-a-LLOD – Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors 825182
European Commission