Diart: A Python Library for Real-Time Speaker Diarization

doi:10.5281/zenodo.12510278

Published June 23, 2024 | Version 0.9.1

Software Open

Diart: A Python Library for Real-Time Speaker Diarization

1. Université Paris-Saclay CNRS, LISN, Orsay, France
2. IRIT, Université de Toulouse, CNRS, Toulouse, France
3. Ava, France
4. Verbally GmbH, Germany
5. Indian Institute of Technology, Tirupati, India
6. Tridhya Intuit Pvt Ltd, Gujarat, India

Diart is a python framework to build AI-powered real-time audio applications. Its key feature is the ability to recognize different speakers in real time with state-of-the-art performance, a task commonly known as "speaker diarization".

The pipeline diart.SpeakerDiarization combines a speaker segmentation and a speaker embedding model to power an incremental clustering algorithm that gets more accurate as the conversation progresses.

With diart you can also create your own custom AI pipeline, benchmark it, tune its hyper-parameters, and even serve it on the web using websockets.

Files

Files (21.6 MB)

Name	Size	Download all
diart-0.9.1.tar.gz md5:a57ecbce98907f903cb9ded75a662475	21.6 MB	Download

Additional details

Repository URL: https://github.com/juanmc2005/diart
Programming language: Python
Development Status: Active

Views

Downloads

Show more details

	All versions	This version
Views	41	41
Downloads	7	7
Data volume	150.9 MB	150.9 MB

More info on how stats are collected....

DOI

Resource type

Software

Publisher

Zenodo

MIT License

A short and simple permissive license with conditions only requiring preservation of copyright and license notices. Licensed works, modifications, and larger works may be distributed under different terms and without source code. Read more

Technical metadata

Created: June 23, 2024
Modified: July 7, 2024

Diart: A Python Library for Real-Time Speaker Diarization

Creators

Description

Files

Files (21.6 MB)

Additional details

Software