Published June 23, 2024 | Version 0.9.1
Software Open

Diart: A Python Library for Real-Time Speaker Diarization

  • 1. Université Paris-Saclay CNRS, LISN, Orsay, France
  • 2. IRIT, Université de Toulouse, CNRS, Toulouse, France
  • 3. Ava, France
  • 4. Verbally GmbH, Germany
  • 5. Indian Institute of Technology, Tirupati, India
  • 6. Tridhya Intuit Pvt Ltd, Gujarat, India

Description

Diart is a python framework to build AI-powered real-time audio applications. Its key feature is the ability to recognize different speakers in real time with state-of-the-art performance, a task commonly known as "speaker diarization".

The pipeline diart.SpeakerDiarization combines a speaker segmentation and a speaker embedding model to power an incremental clustering algorithm that gets more accurate as the conversation progresses.

With diart you can also create your own custom AI pipeline, benchmark it, tune its hyper-parameters, and even serve it on the web using websockets.

Files

Files (21.6 MB)

Name Size Download all
md5:a57ecbce98907f903cb9ded75a662475
21.6 MB Download

Additional details

Software

Repository URL
https://github.com/juanmc2005/diart
Programming language
Python
Development Status
Active