There is a newer version of this record available.

Software Open Access

quanteda/quanteda: CRAN v3.2.0

Kenneth Benoit; Kohei Watanabe; Haiyan Wang; Paul Nulty; Adam Obeng; Stefan Müller; Jiong Wei Lua; Aki Matsuo; Christian Mueller; José Tomás Atria; Will Lowe; Pablo Barberá; Christopher Gandrud; mark padgham; Tyler Rinker; Johannes Gruber; Katrin Leinweber; Kevin Reuning; Michael Chirico; Michael W. Kearney; Stas Malavin; Thomas J. Leeper; hotzeplotz; Chung-hong Chan; etienne-s; hofaichan; lindbrook; mmzmm; nicmer; Tom Paskhalis

Bug fixes and stability enhancements
  • dfm() returns a dfm with the identical column order even if tokens_compound() or tokens_ngrams() is used in the upstream (#2100).
  • dfm_group() with NA values in a grouping variable now drops those, similar to the behaviour of tokens_group() and corpus_group() (#2134).
Changes and additions
  • char_wordstem() now has a a new argument check_whitespace, which will not throw an error when lower-casing text containing a whitespace character.
  • dfm_remove() now has a new argument padding = FALSE that when TRUE, collects counts of the removed features in the first column. This produces results consistent with what is compiled as a dfm built from tokens where some have been removed with padding = TRUE (#2152).
Files (37.6 MB)
Name Size
37.6 MB Download
All versions This version
Views 3,05086
Downloads 3102
Data volume 9.1 GB75.2 MB
Unique views 2,88381
Unique downloads 1952


Cite as