DISPLACE-2024 Dataset

KALLURI, SHAREEF BABU; Baghel,, Shikha; Chowdhuri, Pratik Roy; Singh, Prachi; Ramoji, Shreyas; Jain, Somil; Sidharth; Ranjana, H; Vijayasenan, Deepu; Ganapathy, Sriram

doi:10.5281/zenodo.12166687

Published June 19, 2024 | Version v1

Dataset Restricted

DISPLACE-2024 Dataset

1. Indian Institute of Science Bangalore
2. National Institute of Technology Karnataka

Inspired by the previous session of DISPLACE 2023 challenge, we have launched the DISPLACE 2024 challenge (https://displace2024.github.io/). Compared to the first DISPLACE challenge, the current challenge includes an additional track on automatic speech recognition (ASR) in code-switched multi-accent conversational scenarios along with speaker and language diarization tracks. We release supervised data for exploring new directions on multilingual multispeaker, multi accent conversational data. To the best of our knowledge, no publicly available dataset matches the diverse characteristics observed in the DISPLACE dataset, including code-mixing/switching, natural overlaps, reverberation, and noise. For this challenge, a natural multi-lingual, multi-speaker conversational dataset will be distributed for development and evaluation purposes. There will be no training data given and the participants will be free to use any resource for training the models. The challenge reflects the theme of Interspeech 2024 - "Speech and Beyond" in its true sense.

The dataset can be obtained by orgainisers by sending the request form and duly signed the terms and condtions

Link for registration for obtaining the data : Registration Link

Files

Restricted

The record is publicly accessible, but files are restricted. <a href="https://zenodo.org/account/settings/login?next=https://zenodo.org/records/12166687">Log in</a> to check if you have access.

Additional details

Alternative title: The Second DISPLACE Challenge data

Is new version of: Conference proceeding: https://www.isca-archive.org/interspeech_2023/baghel23_interspeech.pdf (URL); Journal: https://doi.org/10.1016/j.specom.2024.103080 (URL)

Repository URL: https://github.com/displace2024/Displace2024_baseline
Programming language: Shell , Python

	All versions	This version
Views	1,355	590
Downloads	576	79
Data volume	2.7 TB	411.9 GB

Files

Restricted

Additional titles

Related works

Software

DISPLACE-2024 Dataset

Authors/Creators

Description

Files

Restricted

Additional details

Additional titles

Related works

Software