Published August 30, 2022 | Version v0.3.8
Software Open

cwbtools

  • 1. University of Duisburg-Essen

Description

The 'Corpus Workbench' ('CWB', <https://cwb.sourceforge.io/>) offers a classic and mature approach for working with large, linguistically and structurally annotated corpora. The 'CWB' is memory efficient and its design makes running queries fast, see Evert (2011) <https://eprints.lancs.ac.uk/id/eprint/62721>. The 'cwbtools' package offers pure 'R' tools to create indexed corpus files as well as high-level wrappers for the original 'C' implementation of 'CWB' as exposed by the 'RcppCWB' package (<https://CRAN.R-project.org/package=RcppCWB>). Additional functionality to add and modify annotations of corpora from within 'R' makes working with 'CWB' indexed corpora much more flexible and convenient. The 'cwbtools' package in combination with the 'R' packages 'RcppCWB' (<https://CRAN.R-project.org/package=RcppCWB>) and 'polmineR' (<https://CRAN.R-project.org/package=polmineR>) offers a lightweight infrastructure to support the combination of quantitative and qualitative approaches for working with textual data.

Files

cwbtools_0.3.8.zip

Files (1.3 MB)

Name Size Download all
md5:bda8c65565e9214507e16c1ad8ddbac9
323.0 kB Download
md5:e679ecf080058e6cf7613483caf8ef3a
502.9 kB Download
md5:1692c3e0f3c42351481099c8101510e5
517.8 kB Preview Download