There is a newer version of the record available.

Published December 19, 2020 | Version v0.10.0
Software Open

bab2min/tomotopy: 0.10.0

  • 1. Kakao Enterprise

Description

  • The interface of tomotopy.utils.Corpus and of tomotopy.LDAModel.docs were unified. Now you can access the document in corpus with the same manner.
  • getitem of tomotopy.utils.Corpus was improved. Not only indexing by int, but also by Iterable[int], slicing are supported. Also indexing by uid is supported.
  • New methods tomotopy.utils.Corpus.extract_ngrams and tomotopy.utils.Corpus.concat_ngrams were added. They extracts n-gram collocations using PMI and concatenates them into a single words.
  • A new method tomotopy.LDAModel.add_corpus was added, and tomotopy.LDAModel.infer can receive corpus as input.
  • A new module tomotopy.coherence was added. It provides the way to calculate coherence of the model.
  • A paramter window_size was added to tomotopy.label.FoRelevance.
  • An issue was fixed where NaN often occurs when training tomotopy.HDPModel.
  • Now Python3.9 is supported.
  • A dependency to py-cpuinfo was removed and the initializing of the module was improved.

Files

bab2min/tomotopy-v0.10.0.zip

Files (1.1 MB)

Name Size Download all
md5:6cce150fd49ff9444f0b7578c87dfd04
1.1 MB Preview Download

Additional details

Related works