The Corpus Workbench (CWB) is a classic indexing and query engine to efficiently work with large, linguistically annotated corpora. The cwbtools package offers a set of tools to conveniently create, modify and manage CWB indexed corpora from within R. It complements R packages that use the CWB as a backend for text mining with R, namely the RcppCWB package for low-level access to CWB indexed corpora, and polmineR as a toolset to implement common text mining workflows.
The package is available via CRAN and can be installed as follows on Windows, macOS and Linux.
install.packages("cwbtools")
To install the development version of the package, use the installation mechanism offered by the devtools package. On Windows, an installation of Rtools may be necessary.
if (!"devtools" %in% installed.packages()[,"Package"]) install.packages("devtools")
devtools::install_github("PolMine/cwbtools")