Published May 20, 2019 | Version v1
Conference paper Open

Towards the Detection and Formal Representation of Semantic Shifts in Inflectional Morphology

  • 1. DFKI GmbH
  • 2. University of Vienna

Description

Semantic shifts caused by derivational morphemes is a common subject of investigation in language modeling, while inflectional morphemes are frequently portrayed as semantically more stable. This study is motivated by the previously established observation that inflectional morphemes can be just as variable as derivational ones. For instance, the English plural “-s” can turn the fabric silk into the garments of a jockey, silks. While humans know that silk in this sense has no plural, it takesmore for machines to arrive at this conclusion. Frequently utilized computational language resources, such as WordNet, or models for representing computational lexicons, like OntoLex-Lemon, have no descriptive mechanism to represent such inflectional semantic shifts. To investigate this phenomenon, we extract word pairs of different grammatical number from WordNet that feature additional senses in the plural and evaluate their distribution in vector space, i.e., pre-trained word2vec and fastText embeddings. We then propose an extension of OntoLex-Lemon to accommodate this phenomenon that we call inflectional morpho-semantic variation to provide a formal representation accessible to algorithms, neural networks, and agents. While the exact scope of the problem is yet to be determined, this first dataset shows that it is not negligible.

Notes

European Union Grant Number 825182, Prêt-à-LLOD

Files

OASIcs-LDK-2019-21.pdf

Files (446.6 kB)

Name Size Download all
md5:e8d9359c9006ab5832f74d26feda8f49
446.6 kB Preview Download

Additional details

Funding

European Commission
ELEXIS - European Lexicographic Infrastructure 731015
European Commission
Pret-a-LLOD - Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors 825182