Software Open Access
Daniel W. Hieber
This is a breaking release which changes the method for determining utterances in a text. Utterances are now determined based on newlines rather than punctuation. This was motivated by the fact that some portions of major corpora (such as the Open American National Corpus) do not include punctuation.
utteranceSeparators option has been removed, and the
punctuation option has been updated so that the default list of punctuation now includes punctuation typically placed at the end of a sentence/utterance.