Creating speech zones with self-distributing acoustic swarms (Simulated + Clutter)

Itani, Malek; Chen, Tuochao

doi:10.5281/zenodo.8219720

Published August 7, 2023 | Version 1.0

Dataset Open

Creating speech zones with self-distributing acoustic swarms (Simulated + Clutter)

1. University of Washington

Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms"

This deposit contains 2 distinct datasets:

A dataset of speech mixtures containing 2-5 speakers simulated using PyRoomAcoustics. The dataset consists of 8000 training mixtures, 500 validation mixtures and 1000 testing mixtures.
A dataset of speech mixtures containing 3-5 speakers created from synchronized recordings in reverberant rooms with objects cluttering the table. The dataset consists of 500 testing mixtures.

The source sounds are various utterances from the VCTK dataset. For real world data, the utterances are played over a Rokono Bass+ Mini Speaker. The recordings are captured from an array of 7 microphones, as they are recorded by our robotic swarm as it is distributed across the table. The recorded audio in the real world has been subjected to audio compression and decompression using the Opus Codec to enable multiple simultaneous streams.

Please see the Readme for more infromation. Please see related identifiers for other datasets.

Files

Readme.txt

Files (29.5 GB)

Name	Size
clutter_test.tar md5:255f14a08613ec34c28bb336d140179a	3.2 GB	Download
Readme.txt md5:33f31503d9f2696d23094e857ad9b66e	2.5 kB	Preview Download
synthetic_2_5spk.tar.gz md5:99c574d4479592c31611c6bcd52bd97c	26.3 GB	Download

Additional details

Is supplemented by: 10.5281/zenodo.8222714 (DOI); 10.5281/zenodo.8222784 (DOI)

	All versions	This version
Views	645	617
Downloads	402	382
Data volume	9.9 TB	6.7 TB

Creating speech zones with self-distributing acoustic swarms (Simulated + Clutter)

Authors/Creators

Description

Files

Readme.txt

Files (29.5 GB)

Additional details

Related works