Published July 8, 2023 | Version v1
Conference paper Open

Generating Polarity Lexicons with WordNet Propagation in 5 Languages

  • 1. VU University Amsterdam
  • 2. Istituto di Linguistica Computazionale, CNR, Pisa
  • 3. IXA NLP Group, The University of the Basque Country, San Sebastian
  • 4. Vicomtech-IK4, San Sebastian

Description

In this paper we focus on the creation of general-purpose (as opposed to domain-specific) polarity lexicons in five languages: French, Italian, Dutch, English and Spanish using WordNet propagation. WordNet propagation is a commonly used method to generate these lexicons as it gives high coverage of general purpose language and the semantically rich WordNets where concepts are organized in synonym , antonym and hyperonym/hyponym structures seem to be well suited to the identification of positive and negative words. However, WordNets of different languages may vary in many ways such as the way they are compiled, and their numbers of synsets, synonyms and relations. We investigate whether this variability translates into differences of performance when these WordNets are used for polarity propagation. Although many variants of the propagation method are developed for English, little is known about how they perform with WordNets of other languages. We implemented a propagation algorithm and designed a method to obtain seed lists similar with respect to quality and size, for each of the five languages. We evaluated the results against gold standards also developed according to a common method in order to achieve as less variance as possible between the different languages.

Files

847_Paper.pdf

Files (120.3 kB)

Name Size Download all
md5:a2f257e266928921eae0d44d76729b0a
120.3 kB Preview Download

Additional details

Funding

European Commission
OPENER - Open Polarity Enhanced Named Entity Recognition 296451