There is a newer version of the record available.

Published January 25, 2024 | Version v0.8.0
Software Open

KennethEnevoldsen/scandinavian-embedding-benchmark: v0.8.0

  • 1. Center for Humanities Computing Aarhus
  • 2. Center for Humanities Computing
  • 3. Aarhus University

Description

v0.8.0 (2024-01-25)

Ci

  • ci: fix mispecified yaml syntax (ca5567c)

Documentation

  • docs: formatting code blocks (cee41f3)

  • docs: update docs to not run all models (90cef3d)

Feature

  • feat: Added VG clustering dataset (49e75d5)

  • feat: Add swedn clustering (0786ec5)

Fix

  • fix: fixed error arised from merge (11e28d6)

  • fix: updated based on static type checks (4752f07)

  • fix: move description to the end as to make printing of task object prettier (f8ec70d)

  • fix: reduced size of SwednClustering and ensure that clusters match with document size (0b70730)

Style

Test

  • test: Performance using 5x2048 examples is 8.13 (ed5cb5d)

  • test: Performance using 5x10000 examples is 13.80 (ed36b82)

  • test: Performance using 2x10000 examples is 8.70 (6fe30b7)

  • test: Performance using 10000 examples is 8.46 (630769c)

  • test: Performance using 1000 examples is 8.12 (7732c32)

  • test: Performance using 100 examples is 21.07 (82f7b3f)

Unknown

  • Merge pull request #96 from KennethEnevoldsen/add-swedn-clustering

Add Swedn and VG clustering datasets (8537e12)

  • Merge branch 'main' of https://github.com/KennethEnevoldsen/scandinavian-embedding-benchmark into add-swedn-clustering (18f9afb)

  • tests: refactored tests to not be highly dependent on a few tasks (4b1eaa5)

  • Added a bunch of experiments for the vg summerization. (d9a13cb)

  • Merge pull request #90 from KennethEnevoldsen/types

Moved task types to task interface and deleted types module (7c3b582)

  • Added English to Language type (221bdd8)

  • Removed faulty import in E5 models (601002c)

  • Merge pull request #91 from KennethEnevoldsen/new_models

Added Jina base (95c515e)

  • Fixed import error in speed task (cfccbdf)

  • Added Jina base (6d1ec69)

  • Moved task types to task interface and deleted types module (2f1adf1)

Files

KennethEnevoldsen/scandinavian-embedding-benchmark-v0.8.0.zip

Files (2.0 MB)

Additional details