Mapping WordNet Instances to Wikipedia
Creators
- 1. Insight Centre for Data Analytics, National University of Ireland Galway
Description
Lexical resource differ from encyclopaedic resources and represent two distinct types of resource covering general language and named entities respectively. However, many lexical resources, including Princeton WordNet, contain many proper nouns, referring to named entities in the world yet it is not possible or desirable for a lexical resource to cover all named entities that may reasonably occur in a text. In this paper, we propose that instead of including synsets for instance concepts PWN should instead provide links to Wikipedia articles describing the concept. In order to enable this we have created a gold-quality mapping between all of the 7,742 instances in PWN and Wikipedia (where such a mapping is possible). As such, this resource aims to provide a gold standard for link discovery, while also allowing PWN to distinguish itself from other resources such as DBpedia or BabelNet. Moreover, this linking connects PWN to the Linguistic Linked Open Data cloud, thus creating a richer, more usable resource for natural language processing.
Files
mccrae2018mapping.pdf
Files
(296.7 kB)
Name | Size | Download all |
---|---|---|
md5:e554994ba7c695e49492b4eeb58232da
|
296.7 kB | Preview Download |