Published March 13, 2023
| Version v1.2.0
Software
Open
CouncilDataProject/speakerbox: Speakerbox: Few-Shot Learning for Speaker Identification with Transformers
Authors/Creators
- 1. University of Washington Information School, University of Washington, Seattle
- 2. University of Washington, Seattle
Description
Speakerbox is a library which enables:
- the creation of an audio-based speaker identification datasets
- training an audio-based speaker identification transformer model
- applying a pre-trained audio-based speaker identification to a new audio file and predicting portions of audio as the known speakers
This release completes the work for our Journal of Open Source Software (JOSS) paper.
The changes from v1.0.0 include:
- An example video attached to the README which demonstrates how to use this library (on a toy example) -- YouTube Video Link.
- A more thorough workflow diagram attached to the README which explains how all the components of this library fit together.
- The example data used for model reproduction is now available for download directly from a Python command.
- Upgrading to newer dependency versions.
- The JOSS paper content: paper.md.
- Upgraded linting with ruff.
- Minor improvements to logging.
Files
CouncilDataProject/speakerbox-v1.2.0.zip
Files
(38.9 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:655ebe4c313e2b02c9036a511fb7e400
|
38.9 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/CouncilDataProject/speakerbox/tree/v1.2.0 (URL)