CWB
|
A specialised hashtable for computing frequency distributions over tuples of lexicon IDs. More...
Data Fields | |
int | N |
number of keys, including constraint-only keys More... | |
Attribute * | att [MAX_N] |
list of the N attributes corresponding to the keys of the hash More... | |
int | offset [MAX_N] |
list of optional corpus position offsets More... | |
int | max_offset |
largest offset of all keys (to avoid scanning past end of corpus More... | |
int | is_structural [MAX_N] |
list of flags identifying s-attributes (all others are p-attributes) More... | |
CL_Regex | regex [MAX_N] |
optional regex constraint (compiled regular expression) More... | |
int | is_negated [MAX_N] |
whether regex constraint is negated (!=) More... | |
Attribute * | frequency_values |
int * | frequency |
pre-computed integer values for the attribute keys More... | |
int * | id_list [MAX_N] |
optional regex constraint (stored as a list of matching lexicon IDs) More... | |
int | id_list_size [MAX_N] |
size of this list More... | |
int | current_struc [MAX_N] |
number of current or next structure More... | |
int | start_cpos [MAX_N] |
start of this structure (cpos) More... | |
int | end_cpos [MAX_N] |
end of this structure (cpos) More... | |
int | constraint_ok [MAX_N] |
whether constraint is satisfied (initialised at start_cpos, reset at end_cpos) More... | |
int | virtual_id [MAX_N] |
virtual ID of a region's annotation string (constant within region) More... | |
char * | source_base [MAX_N] |
base pointers to compute virtual IDs (= offsets) from annotation strings More... | |
int | is_constraint [MAX_N] |
list of flags marking constraint keys ("?...") More... | |
int | K |
number of non-constraint keys, i.e. More... | |
cl_ngram_hash | table |
the actual hash table, a cl_ngram_hash object More... | |
A specialised hashtable for computing frequency distributions over tuples of lexicon IDs.
list of the N attributes corresponding to the keys of the hash
Referenced by main(), and scancorpus_add_key().
int constraint_ok[MAX_N] |
whether constraint is satisfied (initialised at start_cpos, reset at end_cpos)
Referenced by main(), and scancorpus_add_key().
int current_struc[MAX_N] |
number of current or next structure
Referenced by main(), and scancorpus_add_key().
int end_cpos[MAX_N] |
end of this structure (cpos)
Referenced by main(), and scancorpus_add_key().
int* frequency |
pre-computed integer values for the attribute keys
Referenced by main().
int* id_list[MAX_N] |
optional regex constraint (stored as a list of matching lexicon IDs)
Referenced by main(), and scancorpus_add_key().
int id_list_size[MAX_N] |
size of this list
Referenced by main(), and scancorpus_add_key().
int is_constraint[MAX_N] |
list of flags marking constraint keys ("?...")
Referenced by main(), and scancorpus_add_key().
int is_negated[MAX_N] |
whether regex constraint is negated (!=)
Referenced by main(), and scancorpus_add_key().
int is_structural[MAX_N] |
list of flags identifying s-attributes (all others are p-attributes)
Referenced by main(), and scancorpus_add_key().
int K |
number of non-constraint keys, i.e.
the actual hash table stores K-tuples
Referenced by main(), and scancorpus_add_key().
int max_offset |
largest offset of all keys (to avoid scanning past end of corpus
Referenced by main(), and scancorpus_add_key().
int N |
number of keys, including constraint-only keys
Referenced by main(), and scancorpus_add_key().
int offset[MAX_N] |
list of optional corpus position offsets
Referenced by main(), and scancorpus_add_key().
optional regex constraint (compiled regular expression)
Referenced by main(), and scancorpus_add_key().
char* source_base[MAX_N] |
base pointers to compute virtual IDs (= offsets) from annotation strings
Referenced by main(), and scancorpus_add_key().
int start_cpos[MAX_N] |
start of this structure (cpos)
Referenced by main(), and scancorpus_add_key().
the actual hash table, a cl_ngram_hash object
Referenced by main().
int virtual_id[MAX_N] |
virtual ID of a region's annotation string (constant within region)
Referenced by main().