Published December 3, 2019 | Version v1
Dataset Restricted


  • 1. Idiap Research Institute


ODESSA is a speaker diarization dataset. The dataset contains 42 short conversations via VoIP, in pairs of speakers among 14 individuals. Pre-prepared scripts are given to the speakers, they will start reading their assigned roles. Each of the speakers is using a PC. The session animator is using a third PC for recording while muting himself.



The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

Access to the dataset is based on an End-User License Agreement. The use of the dataset is strictly restricted to non-commercial research.

Please provide us the following information about the authorized signatory (MUST hold a permanent position):

  • Full name
  • Name of organization
  • Position / job title
  • Academic / professional email address
  • URL where we can verify the information details

Only academic/professional email addresses from the same organization as the signatory are accepted for the online request. All online requests coming from generic email providers such as gmail will be rejected.

You are currently not logged in. Do you have an account? Log in here

Additional details


Online Diarization Enhanced by recent Speaker identification and Structuredprediction Approaches (ODESSA) 200021E-164336
Swiss National Science Foundation