registry_file.Rd
A set of functions to parse, create and write registry files.
registry_file_parse(corpus, registry_dir = Sys.getenv("CORPUS_REGISTRY")) registry_file_compose(x) registry_data(name, id, home, info = file.path(home, ".info", fsep = "/"), properties = c(charset = "utf-8"), p_attributes, s_attributes = character()) registry_file_write(data, corpus, registry_dir = Sys.getenv("CORPUS_REGISTRY"), ...)
corpus | A CWB corpus indicated by a length-one |
---|---|
registry_dir | Directory with registry files. |
x | An object of class |
name | Long descriptive name of corpus ( |
id | Short name of corpus ( |
home | Path with data directory for indexed corpus. |
info | A |
properties | Named |
p_attributes | A |
s_attributes | A |
data | A |
... | further parameters |
registry_file_parse
will return an object of class registry_data
.
See the appendix to the 'Corpus Encoding Tutorial' (http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf), which includes an explanation of the registry file format.
registry_file_compose
will turn an
registry_data
-object into a character vector with a registry file
that can be written to disk.
registry_file_write
will compose a registry file from
data
and write it to disk.
regdata <- registry_file_parse( corpus = "REUTERS", registry_dir = system.file(package = "RcppCWB", "extdata", "cwb", "registry") )