compute_lexdiv_stats.Rd
Internal functions used in textstat_lexdiv
, for computing
lexical diversity measures on dfms or tokens objects
compute_lexdiv_dfm_stats(x, measure = NULL, log.base = 10) compute_lexdiv_tokens_stats(x, measure = c("MATTR", "MSTTR"), MATTR_window, MSTTR_segment)
x | a dfm object |
---|---|
measure | a list of lexical diversity measures. |
log.base | a numeric value defining the base of the logarithm (for measures using logs) |
MATTR_window | a numeric value defining the size of the moving window for computation of the Moving-Average Type-Token Ratio (Covington & McFall, 2010) |
MSTTR_segment | a numeric value defining the size of the each segment for the computation of the the Mean Segmental Type-Token Ratio (Johnson, 1944) |
a data.frame
with a document
column containing the
input document name, followed by columns with the lexical diversity
statistic, in the order in which they were supplied as the measure
argument.
compute_lexdiv_dfm_stats
in an internal function that
computes the lexical diversity measures from a dfm input.
compute_lexdiv_tokens_stats
in an internal function that
computes the lexical diversity measures from a dfm input.