There is a newer version of this record available.

Software Open Access

# digitallinguistics/tags2dlx: v0.3.0

Daniel W. Hieber

This is a breaking release which changes the method for determining utterances in a text. Utterances are now determined based on newlines rather than punctuation. This was motivated by the fact that some portions of major corpora (such as the Open American National Corpus) do not include punctuation.

The utteranceSeparators option has been removed, and the punctuation option has been updated so that the default list of punctuation now includes punctuation typically placed at the end of a sentence/utterance.

Files (28.8 kB)
Name Size
digitallinguistics/tags2dlx-v0.3.0.zip
md5:57d850c5515989c7e59347d3ecbe8c1c
28.8 kB
18
16
views