ndoc.RdGet the number of documents or features in an object.
ndoc(x) nfeat(x) nfeature(x)
| x | a quanteda object: a corpus, dfm, or tokens object, or a readtext object from the readtext package. |
|---|
an integer (count) of the number of documents or features
ndoc returns the number of documents in an object
whose texts are organized as "documents" (a corpus,
dfm, or tokens object, a readtext object from the
readtext package).
nfeat returns the number of features from a dfm; it is an
alias for ntype when applied to dfm objects. This function is only
defined for dfm objects because only these have "features". (To count
tokens, see ntoken.)
nfeature is the deprecated form of nfeat.
# number of documents ndoc(data_corpus_inaugural)#> [1] 58#> [1] 10#> [1] 58#> [1] 10# number of features nfeat(dfm(corpus_subset(data_corpus_inaugural, Year > 1980), remove_punct = FALSE))#> [1] 3260#> [1] 3247