Published May 15, 2020 | Version v1
Conference paper Open

Defying Wikidata: Validation of Terminological Relations in the Web of Data

  • 1. Ontology Engineering Group
  • 2. Insight Center for Data Analytics

Description

In this paper we present an approach to validate terminological data retrieved from open encyclopaedic knowledge bases. This need arises from the enrichment of automatically extracted terms with information from existing resources in the Linguistic Linked Open Data cloud. Specifically, the resource employed for this enrichment is WIKIDATA, since it is one of the biggest knowledge bases freely available within the Semantic Web. During the experiment, we noticed that certain RDF properties in the Knowledge Base did not contain the data they are intended to represent, but a different type of information. In this paper we propose an approach to validate the retrieved data based on four axioms that rely on two linguistic theories: the x-bar theory and the multidimensional theory of terminology. The validation process is supported by a second knowledge base specialised in linguistic data; in this case, CONCEPTNET. In our experiment, we validate terms from the legal domain in four languages: Dutch, English, German and Spanish. The final aim is to generate a set of sound and reliable terminological resources in RDF to contribute to the population of the Linguistic Linked Open Data cloud.

Files

2020.lrec-1.694.pdf

Files (342.8 kB)

Name Size Download all
md5:3b63eb98001993ed2cb33c1c44052aec
342.8 kB Preview Download

Additional details

Funding

European Commission
Pret-a-LLOD – Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors 825182
European Commission
Lynx – Building the Legal Knowledge Graph for Smart Compliance Services in Multilingual Europe 780602