There is a newer version of the record available.

Published February 17, 2021 | Version v0.3.2
Dataset Open

cwbtools

  • 1. University of Duisburg-Essen

Description

The 'Corpus Workbench' ('CWB', <http://cwb.sourceforge.net/>) offers a classic and mature approach for working with large, linguistically and structurally annotated corpora. The 'CWB' is memory efficient and its design makes running queries fast (Evert and Hardie 2011, <http://www.stefan-evert.de/PUB/EvertHardie2011.pdf>). The 'cwbtools' package offers pure R tools to create indexed corpus files as well as high-level wrappers for the original C implementation of CWB as exposed by the 'RcppCWB' package <https://CRAN.R-project.org/package=RcppCWB>. Additional functionality to add and modify annotations of corpora from within R makes working with CWB indexed corpora much more flexible and convenient. The 'cwbtools' package in combination with the R packages 'RcppCWB' (<https://CRAN.R-project.org/package=RcppCWB>) and 'polmineR' (<https://CRAN.R-project.org/package=polmineR>) offers a lightweight infrastructure to support the combination of quantitative and qualitative approaches for working with textual data.

Files

cwbtools_0.3.2.zip

Files (1.3 MB)

Name Size Download all
md5:917cf8a745963c0a7f2bb4b719925d29
335.2 kB Download
md5:8a700e386ae2d9e07e9ddaa68231cd1c
489.9 kB Download
md5:f03e54a7828c1f35e178ded5ee81ddda
511.2 kB Preview Download