Published June 18, 2018 | Version v1
Book chapter Open

Multilingual knowledge in aligned Wiktionary and OmegaWiki for translation applications

Description

Multilingual lexical-semantic resources play an important role in translation applications. However, multilingual resources with sufficient quality and coverage
are rare as the effort of manually constructing such a resource is substantial. In recent years, the emergence of Web 2.0 has opened new possibilities for constructing
large-scale lexical-semantic resources. We identified Wiktionary and OmegaWiki
as two important multilingual initiatives where a community of users (“crowd”)
collaboratively edits and refines the lexical information. They seem especially appropriate in the multilingual domain as users from all languages and cultures can
easily contribute. However, despite their advantages such as open access and coverage of multiple languages, these resources have hardly been systematically investigated and utilized until now. Therefore, the goals of our contribution are threefold: (1) We analyze how these resources emerged and characterize their content
and structure; (2) We propose an alignment at the word sense level to exploit the
complementary information contained in both resources for increased coverage;
(3) We describe a mapping of the resources to a standardized, unified model (uby-lmf) thus creating a large freely available multilingual resource designed for easy
integration into applications such as machine translation or computer-aided translation environments.

 

Files

9.pdf

Files (2.7 MB)

Name Size Download all
md5:f70fe20a409cc29bb086dd4573f13c31
2.7 MB Preview Download