trinker/sentimentr: version 0.4.0
Description
Releases will be numbered with the following semantic versioning format:
<major>.<minor>.<patch>
And constructed with the following guidelines:
- Breaking backward compatibility bumps the major (and resets the minor and patch)
- New additions without breaking backward compatibility bumps the minor (and resets the patch)
- Bug fixes and misc changes bumps the patch
BUG FIXES
- Missing documentation for `but' conjunctions added to the documentation.
Spotted by Richard Watson (see #23).
NEW FEATURES
extract_sentiment_termsadded to enable users to extract the sentiment terms from text aspolaritywould return in the qdap package.
MINOR FEATURES
update_polarity_tableandupdate_valence_shifter_tableadded to abstract away thinking about thecomparisonargument toupdate_key.
IMPROVEMENTS
CHANGES
sentimentr 0.2.0 - 0.2.3BUG FIXES
Commas were not handled properly in some cases. This has been fixed (see #7).
highlightparsed sentences differently than the mainsentimentfunction resulting in an error whenoriginal.textwas supplied that contained a colon or semi-colon. Spotted by Patrick Carlson (see #2).
MINOR FEATURES
as_keyandupdate_keynow coerce the first column of thexargument data.frame to lower case and warn if capital letters are found.
IMPROVEMENTS
A section on creating and updating dictionaries was added to the README: https://github.com/trinker/sentimentr#making-and-updating-dictionaries
plot.sentiment_byno longer color codes by grouping variables. This was distracting and removed. A jitter + red average sentiment + boxplot visual representation is used.
CHANGES
- Default sentiment and valence shifters get the following additions:
polarity_table: "excessively", 'overly', 'unduly', 'too much', 'too many', 'too often', 'i wish', 'too good', 'too high', 'too tough'valence_shifter_table: "especially"
BUG FIXES
get_sentencesconverted to lower case too early in the regex parsing, resulting in missed sentence boundary detection. This has been corrected.highlightfailed for some occasions when usingoriginal.textbecause the splitting algorithm forsentimentwas different.sentiment's split algorithm now matches and is more accurate but at the cost of speed.
NEW FEATURES
emoticonsdictionary added. This is a simple dataset containing common emoticons (adapted from Popular Emoticon List)replace_emoticonfunction added to replace emoticons with word equivalents.get_sentences2added to allow for users that may want to get sentences from text and retain case and non-sentence boundary periods. This should be preferable in such instances where these features are deemed important to the analysis at hand.highlightadded to allow positive/negative text highlighting.cannon_reviewsdata set added containing Amazon product reviews for the Cannon G3 Camera compiled by Hu and Liu (2004).replace_ratingsfunction +ratingsdata set added to replace ratings.polarity_tablegets an upgrade with new positive and negative words to improve accuracy.valence_shifters_tablepicks up a few non-traditional negators. Full list includes: "could have", "would have", "should have", "would be", "would suggest", "strongly suggest".is_keyandupdate_keyadded to test and easily update keys.gradesdictionary added. This is a simple dataset containing common grades and word equivalents.replace_gradefunction added to replace grades with word equivalents.
IMPROVEMENTS
plot.sentimentnow uses...to pass parameters to syuzhet'sget_transformed_values.as_key,is_key, &update_keyall pick up a logicalsentimentargument that allows keys that have character y columns (2nd column).
This package is designed to quickly calculate text polarity sentiment at the sentence level and optionally aggregate by rows or grouping variable(s).
Files
trinker/sentimentr-v0.4.0.zip
Files
(1.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:f8f21a2ad1641e457dd2fcd75527ea61
|
1.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/trinker/sentimentr/tree/v0.4.0 (URL)