Published July 22, 2024
| Version v0.16.3
Software
Open
MaartenGr/BERTopic: v0.16.3
Creators
- Maarten Grootendorst1
- Heinz-Alexander Fuetterer2
- Freddy Heppell3
- zilch42
- Anubhab Das4
- Chris Inskip
- Ian Randman
- Jakub Ciszek
- Joshua Sundance Bailey5
- azikoss
- dschwalm
- Ahmed Elashry
- Alex Gamble
- Anastasia Simonoff
- Anoop Thomas Mathew6
- Aratako
- Bob7
- Carlos Pegueros8
- Daniel Kapitan9
- Daniel van Strien10
- Danny Huang
- David Dai
- David DiCato11
- Domenic Rosati12
- Elton Liao
- Felipe Alves Siqueira13
- Franz Louis Cesista
- Gus Moir14
- James Nicholson15
- Jamie Snape
- 1. IKNL
- 2. Freie Universität Berlin
- 3. @GateNLP
- 4. Textify AI (@T3xtifyai)
- 5. @swca
- 6. @iterflow
- 7. Proton AG
- 8. @Nubank
- 9. freelance data scientist
- 10. Hugging Face
- 11. CloseFactor
- 12. @scitedotai
- 13. ICMC, University of São Paulo
- 14. Mustang Analytics
- 15. @HarvardChanSchool
Description
<h1><b>Highlights</a></b></h1>
- Simplify zero-shot topic modeling by @ianrandman in #2060
- Option to choose between c-TF-IDF and Topic Embeddings in many functions by @azikoss in #1894
- Use the
use_ctfidf
parameter in the following function to choose between c-TF-IDF and topic embeddings:hierarchical_topics
,reduce_topics
,visualize_hierarchy
,visualize_heatmap
,visualize_topics
- Use the
- Linting with Ruff by @afuetterer in #2033
- Switch from setup.py to pyproject.toml by @afuetterer in #1978
- In multi-aspect context, allow Main model to be chained by @ddicato in #2002
<h1><b>Fixes</a></b></h1>
- Added templates for issues and pull requests
- Update River documentation example by @Proteusiq in #2004
- Fix PartOfSpeech reproducibility by @Greenpp in #1996
- Fix PartOfSpeech ignoring first word by @Greenpp in #2024
- Make sklearn embedding backend auto-select more cautious by @freddyheppell in #1984
- Fix typos by @afuetterer in #1974
- Fix hierarchical_topics(...) when the distances between three clusters are the same by @azikoss in #1929
- Fixes to chain strategy example in outlier_reduction.md by @reuning in #2065
- Remove obsolete flake8 config and update line length by @afuetterer in #22066
Files
MaartenGr/BERTopic-v0.16.3.zip
Files
(5.8 MB)
Name | Size | Download all |
---|---|---|
md5:48f9021b099e9f13a8c206a0510a4a40
|
5.8 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/MaartenGr/BERTopic/tree/v0.16.3 (URL)