There is a newer version of the record available.

Published November 3, 2022 | Version v0.7.0
Software Open

MaartenGr/KeyBERT: v0.7.0

  • 1. IKNL
  • 2. IIT kanpur
  • 3. @Gruveo
  • 4. IIIT, Hyderabad
  • 5. The University of Tokyo
  • 6. @explosion

Description

Highlights
  • Cleaned up documentation and added several visual representations of the algorithm (excluding MMR / MaxSum)
  • Added functions to extract and pass word- and document embeddings which should make fine-tuning much faster
from keybert import KeyBERT

kw_model = KeyBERT()

# Prepare embeddings
doc_embeddings, word_embeddings = kw_model.extract_embeddings(docs)

# Extract keywords without needing to re-calculate embeddings
keywords = kw_model.extract_keywords(docs, doc_embeddings=doc_embeddings, word_embeddings=word_embeddings)

Do note that the parameters passed to .extract_embeddings for creating the vectorizer should be exactly the same as those in .extract_keywords.

Fixes
  • Redundant documentation was removed by @mabhay3420 in #123
  • Fixed Gensim backend not working after v4 migration (#71)
  • Fixed candidates not working (#122)

Files

MaartenGr/KeyBERT-v0.7.0.zip

Files (140.6 kB)

Name Size Download all
md5:b022ca4a632ce611b084f21c2e37a323
140.6 kB Preview Download

Additional details

Related works