Published August 2, 2019 | Version v1
Conference paper Open

Identification of Adjective-Noun Neologisms using Pretrained Language Models

  • 1. National University of Ireland Galway

Description

Neologism detection is a key task in the constructing of lexical resources and has wider implications for NLP, however the identification of multiword neologisms has received little attention. In this paper, we show that we can effectively identify the distinction between compositional and non-compositional adjective-noun pairs by using pretrained language models and comparing this with individual word embeddings. Our results show that the use of these models significantly improves over baseline linguistic features, however the combination with linguistic features still further improves the results, suggesting the strength of a hybrid approach.

Files

mccrae2019identification.pdf

Files (267.2 kB)

Name Size Download all
md5:6fad29a2a1985c6f7a247265cc01f02d
267.2 kB Preview Download

Additional details

Funding

ELEXIS – European Lexicographic Infrastructure 731015
European Commission