Enriching Slovene wordnet with domain-specific terms

Špela Vintar; Darja Fišer

doi:10.5281/zenodo.283489

Published February 9, 2017 | Version v1

Book chapter Open

Enriching Slovene wordnet with domain-specific terms

1. Dept. of Translation, Faculty of Arts, University of Ljubljana

The paper describes an innovative approach to expanding the domain coverage of the Slovene wordnet (sloWNet) by exploiting multiple resources. In the experiment described here we are using a large monolingual Slovene corpus of texts from the domain of informatics to harvest terminology from, and a parallel English-Slovene corpus and an online dictionary as bilingual resources to facilitate the mapping of terms to sloWNet. We first identify the core terms of the domain in English using the Princeton University's WordNet 2.1, and then we translate them into Slovene using a bilingual lexicon produced from the parallel corpus. In the next step we extract multi-word terms from the Slovene domain-specific corpus using a hybrid approach, and finally match the term candidates to existing wordnet synsets. The proposed method appears to be a successful way to improve the domain coverage of the wordnet as it yields abundant term candidates and exploits various multilingual resources

Files

3.pdf

Files (144.1 kB)

Name	Size	Download all
3.pdf md5:800cd1714b455f378392eaf6ab1626a7	144.1 kB	Preview Download

Additional details

Is part of: 10.5281/zenodo.283376 (DOI)

144

Views

101

Downloads

Show more details

	All versions	This version
Views	144	143
Downloads	101	101
Data volume	14.8 MB	14.8 MB

More info on how stats are collected....

DOI

Resource type

Book chapter

Publisher

Language Science Press

Imprint

Annotation, exploitation and evaluation of parallel corpora, 35–53. Berlin.

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 13, 2017
Modified: August 3, 2024

Enriching Slovene wordnet with domain-specific terms

Authors/Creators

Description

Files

3.pdf

Files (144.1 kB)

Additional details

Related works