Lists of stopwords and AnAwords of Bosnian language
Description
The dataset comprises two lists, a list of stopwords and a list of AnAwords of the Bosnian language.
Stopwords refer to a set of words contained in a stop list that are deliberately filtered out or "stopped" during the processing of natural language data, specifically text. These words are typically common and frequently occurring words in a language that are considered to have little or no significance in determining the meaning or context of a text.
AnAwords refer to a set of words primarily functioning as intensifiers and diminishers, often manifesting as adverbs of manner and adjectives. The compilation of AnAwords is based on categorization, which includes six sublists: maximizers, boosters, approximators, relative intensifiers, diminishers, and minimizers.
Files
BOSNIAN_AnAwords_2023.txt
Files
(3.9 kB)
Name | Size | Download all |
---|---|---|
md5:082b26ec0ace5753c19e68a9bda231b6
|
1.3 kB | Preview Download |
md5:d7d6bae65540d392418894b56a88b8e6
|
2.6 kB | Preview Download |