Published March 6, 2018
| Version v1.1.0
Software
Open
quanteda/quanteda: CRAN v1.1.0
Creators
- Kenneth Benoit1
- Kohei Watanabe1
- Haiyan Wang2
- Paul Nulty3
- Adam Obeng4
- Stefan Müller5
- Aki Matsuo6
- Benjamin Lauderdale7
- Will Lowe
- Pablo Barberá7
- Tyler Rinker8
- Christopher Gandrud9
- Christian Mueller1
- tpaskhalis1
- mark padgham10
- hofaichan
- hotzeplotz
- Thomas J. Leeper1
- Stas Malavin11
- Michael W. Kearney12
- Michael Chirico13
- 1. London School of Economics and Political Science
- 2. LSE
- 3. University of Cambridge
- 4. Columbia University, London School of Economics
- 5. Trinity College Dublin
- 6. Department of Methodology, London School of Economics
- 7. London School of Economics
- 8. Campus Labs
- 9. @zalando
- 10. @ATFutures
- 11. Soil Cryology Lab
- 12. @MUDSA
- 13. @myteksi
Description
New Features
- Added
as.dfm()
methods for tmDocumentTermMatrix
andTermDocumentMatrix
objects. (#1222) predict.textmodel_wordscores()
nows includes aninclude_reftexts
argument to exclude training texts from the predicted model object (#1229). The default behaviour isinclude_reftexts = TRUE
, producing the same behaviour as existed before the introduction of this argument. This allows rescaling based on the reference documents (since rescaling requires prediction on the reference documents) but provides an easy way to exclude the reference documents from the predicted quantities.textplot_wordcloud()
now uses code entirely internal to quanteda, instead of using the wordcloud package.
- Eliminated unnecessary dependency on the digest package.
- Updated the vignette title to be less generic.
- Improved the robustness of
dfm_trim()
anddfm_weight()
for previously weighted dfm objects and when supplied thresholds are proportions instead of counts. (#1237) - Fixed a problem in
summary.corpus(x, n = 101)
whenndoc(x) > 100
(#1242). - Fixed a problem in
predict.textmodel_wordscores(x, rescaling = "mv")
that always reset the reference values for rescaling to the first and second documents (#1251). - Issues in the color generation and labels for
textplot_keyness()
are now resolved (#1233, #1233).
- textmodel methods are now exported, to facilitate extension packages for other textmodel methods (e.g. wordshoal).
- Changed the default in
textmodel_wordfish()
tosparse = FALSE
, in response to #1216. dfm_group()
now preserves docvars that are constant for the group aggregation (#1228).
Files
quanteda/quanteda-v1.1.0.zip
Files
(24.1 MB)
Name | Size | Download all |
---|---|---|
md5:0879745f05b90396ed4abf220bab5cff
|
24.1 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/quanteda/quanteda/tree/v1.1.0 (URL)