Published June 23, 2024
| Version 0.9.1
Software
Open
Diart: A Python Library for Real-Time Speaker Diarization
Creators
- 1. Université Paris-Saclay CNRS, LISN, Orsay, France
- 2. IRIT, Université de Toulouse, CNRS, Toulouse, France
- 3. Ava, France
- 4. Verbally GmbH, Germany
- 5. Indian Institute of Technology, Tirupati, India
- 6. Tridhya Intuit Pvt Ltd, Gujarat, India
Description
Diart is a python framework to build AI-powered real-time audio applications. Its key feature is the ability to recognize different speakers in real time with state-of-the-art performance, a task commonly known as "speaker diarization".
The pipeline diart.SpeakerDiarization
combines a speaker segmentation and a speaker embedding model to power an incremental clustering algorithm that gets more accurate as the conversation progresses.
With diart you can also create your own custom AI pipeline, benchmark it, tune its hyper-parameters, and even serve it on the web using websockets.
Files
Files
(21.6 MB)
Name | Size | Download all |
---|---|---|
md5:a57ecbce98907f903cb9ded75a662475
|
21.6 MB | Download |
Additional details
Software
- Repository URL
- https://github.com/juanmc2005/diart
- Programming language
- Python
- Development Status
- Active