There is a newer version of the record available.

Published October 12, 2021 | Version v0.9.2
Software Open

MaartenGr/BERTopic: v0.9.2

  • 1. Van Spaendonck
  • 2. Huggingface

Description

A release focused on algorithmic optimization and fixing several issues:

Highlights:

  • Update the non-multilingual paraphrase- models to the all- models due to improved performance
  • Reduce necessary RAM in c-TF-IDF top 30 word extraction

Fixes:

  • Fix topic mapping
    • When reducing the number of topics, these need to be mapped to the correct input/output which had some issues in the previous version
    • A new class was created as a way to track these mappings regardless of how many times they were executed
    • In other words, you can iteratively reduce the number of topics after training the model without the need to continuously train the model
  • Fix typo in embeddings page (#200)
  • Fix link in README (#233)
  • Fix documentation .visualize_term_rank() (#253)
  • Fix getting correct representative docs (#258)
  • Update memory FAQ with HDBSCAN pr

Files

MaartenGr/BERTopic-v0.9.2.zip

Files (6.2 MB)

Name Size Download all
md5:02decd55dcb9052d6b58c2313b151f03
6.2 MB Preview Download

Additional details

Related works