Lists of stopwords, polarity shifters and AnAwords of Bosnian language
Description
The dataset comprises three lists, a list of stopwords, a list of polarity shifters and a list of AnAwords (in two files) of the Bosnian language.
Stopwords refer to a set of words contained in a stop list that are deliberately filtered out or "stopped" during the processing of natural language data, specifically text. These words are typically common and frequently occurring words in a language that are considered to have little or no significance in determining the meaning or context of a text.
AnAwords (intensifiers and diminishers) refer to a set of words primarily functioning as intensifiers and diminishers, often manifesting as adverbs of manner and adjectives. The compilation of AnAwords is based on categorization, which includes six sublists: maximizers, boosters, approximators, relative intensifiers, diminishers, and minimizers. The list is split into two parts (intensifiers and diminishers) in two separate files.
Polarity shifters are words that can affect the polarity of a phrase, inverting or weakening it. When these words are content words, such as verbs, nouns, and adjectives, we refer to them as polarity shifters.