registry_file.RdA set of functions to parse, create and write registry files.
registry_file_parse(corpus, registry_dir = Sys.getenv("CORPUS_REGISTRY")) registry_file_compose(x) registry_data(name, id, home, info = file.path(home, ".info", fsep = "/"), properties = c(charset = "utf-8"), p_attributes, s_attributes = character()) registry_file_write(data, corpus, registry_dir = Sys.getenv("CORPUS_REGISTRY"), ...)
| corpus | A CWB corpus indicated by a length-one |
|---|---|
| registry_dir | Directory with registry files. |
| x | An object of class |
| name | Long descriptive name of corpus ( |
| id | Short name of corpus ( |
| home | Path with data directory for indexed corpus. |
| info | A |
| properties | Named |
| p_attributes | A |
| s_attributes | A |
| data | A |
| ... | further parameters |
registry_file_parse will return an object of class registry_data.
See the appendix to the 'Corpus Encoding Tutorial' (http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf), which includes an explanation of the registry file format.
registry_file_compose will turn an
registry_data-object into a character vector with a registry file
that can be written to disk.
registry_file_write will compose a registry file from
data and write it to disk.
regdata <- registry_file_parse( corpus = "REUTERS", registry_dir = system.file(package = "RcppCWB", "extdata", "cwb", "registry") )