Published May 21, 2025 | Version v4.3.0
Software Open

quanteda/quanteda: CRAN v4.3.0

  • 1. London School of Economics and Political Science
  • 2. Tracr
  • 3. Birkbeck, University of London
  • 4. Columbia University, London School of Economics
  • 5. University College Dublin
  • 6. MIT
  • 7. Department of Government, University of Essex
  • 8. Hertie School
  • 9. University of Southern California
  • 10. @spotify
  • 11. @rOpenSci
  • 12. Kangarootime
  • 13. @gesistsa
  • 14. @uc3m @IBiDat
  • 15. @wildlifeevoeco
  • 16. @gitlabhq
  • 17. National Institutes of Health
  • 18. Meijer
  • 19. Israel Oceanographic and Limnological Research

Description

Changes and additions

  • Added corpus_chunk() for chunking texts into smaller documents.

  • Significantly reduce the memory usage for the c operation on large tokens and tokens_xptr objects.

  • Further improvements to the verbose messages for corpus, tokens, dfm and fcm objects.

  • tokens_ngrams() now includes a new argument apply_if, functioning similar to this argument in tokens_compound() and tokens_lookup() (#2390).

  • Replaced remove_unigram with match_pattern in object2id() to control the matching of single-word patterns or multi-word patterns.

  • data_corpus_inaugural now updated for Trump 2025.

Files

quanteda/quanteda-v4.3.0.zip

Files (32.4 MB)

Name Size Download all
md5:26702a08bd1e9d29d218814d65fd904e
32.4 MB Preview Download

Additional details

Related works

Software