There is a newer version of the record available.

Published February 23, 2021 | Version v0.3.3
Software Open

cwbtools

  • 1. University of Duisburg-Essen

Description

The 'Corpus Workbench' ('CWB', <http://cwb.sourceforge.net/>) offers a classic and mature approach for working with large, linguistically and structurally annotated corpora. The 'CWB' is memory efficient and its design makes running queries fast (Evert and Hardie 2011, <http://www.stefan-evert.de/PUB/EvertHardie2011.pdf>). The 'cwbtools' package offers pure R tools to create indexed corpus files as well as high-level wrappers for the original C implementation of CWB as exposed by the 'RcppCWB' package <https://CRAN.R-project.org/package=RcppCWB>. Additional functionality to add and modify annotations of corpora from within R makes working with CWB indexed corpora much more flexible and convenient. The 'cwbtools' package in combination with the R packages 'RcppCWB' (<https://CRAN.R-project.org/package=RcppCWB>) and 'polmineR' (<https://CRAN.R-project.org/package=polmineR>) offers a lightweight infrastructure to support the combination of quantitative and qualitative approaches for working with textual data.

Files

cwbtools_0.3.3.zip

Files (1.3 MB)

Name Size Download all
md5:4479169355a9696ea413af835eb19915
326.0 kB Download
md5:b52a31b2cdec1783a0d004023ff8ec4b
483.2 kB Download
md5:b772e8845c4c8f08e7063745a7113479
483.1 kB Preview Download