sentiwordnet_it 1.0
Description
This repository contains a sentiment lexicon for Italian, based on SentiWordNet 3.0 (Baccianella, Esuli, and Sebastiani 2010; Esuli [2019] 2025) and MultiWordNet (Pianta, Bentivogli, and Girardi 2002).
Unlike previous resources—SentiWordNet, which provides sentiment scores without Italian lexical coverage, and MultiWordNet, which offers Italian synsets without sentiment annotation—this dataset bridges the two by mapping Italian lexical entries to sentiment scores in a ready-to-use CSV format.
This integration enables direct use in sentiment analysis and other NLP applications for Italian, filling a gap in existing resources.
The included files, in the data/ folder are:
swn_it.csv: A dataset of 35,001 Italian synsets with polarity scores, POS, synset, offset, English synset lemmas, and gloss (in English).swn_it_tidy.csv: A tidy (one token per row) dataset of 41,725 lemmas, with polarity scores. It is designed for use in R.
It also contains a folder with examples in R, and scripts to use and manipulate the datasets:
examples-R/:custom_dataset.R: Create a custom tidy dataset from the original one, for treating duplicate entries differently.example.R: Examples of how to use the dataset for sentiment analysis on a sample text.uso.md: Instructions for using the dataset in R (in Italian), referred to inexample.R.
Files
agnesevardanega/sentiwordnet_it-v1.0.1.zip
Files
(2.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:05dbc36bc0306b20cf0a63ad34f96260
|
2.0 MB | Preview Download |
Additional details
Related works
- Is derived from
- Dataset: https://github.com/aesuli/SentiWordNet (URL)
- Dataset: https://nlplab.fbk.eu/tools-and-resources/lexical-resources-and-corpora/multiwordnet (URL)
- Is source of
- Dataset: 10.5281/zenodo.15609215 (DOI)
- Is supplement to
- Software: https://github.com/agnesevardanega/sentiwordnet_it/tree/v1.0.1 (URL)
Software
- Repository URL
- https://github.com/agnesevardanega/sentiwordnet_it
References
- Baccianella, Stefano, Andrea Esuli, and Fabrizio Sebastiani. 2010. "Sentiwordnet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining." In Lrec, 10:2200–2204. 2010. Valletta.
- Basile, Valerio, and Malvina Nissim. 2013. "Sentiment Analysis on Italian Tweets." In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, 100–107.
- Bond, Francis, Michael Wayne Goodman, Ewa Rudnicka, Luis Morgado da Costa, Alexandre Rademaker, and John P. McCrae. 2023. "Documenting the Open Multilingual Wordnet." In Proceedings of the 12th Global Wordnet Conference, edited by German Rigau, Francis Bond, and Alexandre Rademaker, 150–57. University of the Basque Country, Donostia - San Sebastian, Basque Country: Global Wordnet Association.
- Denecke, Kerstin. 2008. "Using Sentiwordnet for Multilingual Sentiment Analysis." In 2008 IEEE 24th International Conference on Data Engineering Workshop, 507–12. IEEE.
- Esuli, Andrea. (2019) 2025. "Aesuli/SentiWordNet." https://github.com/aesuli/SentiWordNet.
- Pianta, Emanuele, Luisa Bentivogli, and Christian Girardi. 2002. "MultiWordNet: Developing an Aligned Multilingual Database." In First International Conference on Global WordNet, 293–302.