Published August 7, 2023 | Version 1.0

Creating speech zones with self-distributing acoustic swarms (Simulated + Clutter)

  • 1. University of Washington

Description

Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms"

This deposit contains 2 distinct datasets: 

  1. A dataset of speech mixtures containing 2-5 speakers simulated using PyRoomAcoustics. The dataset consists of 8000 training mixtures, 500 validation mixtures and 1000 testing mixtures.
  2. A dataset of speech mixtures containing 3-5 speakers created from synchronized recordings in reverberant rooms with objects cluttering the table. The dataset consists of 500 testing mixtures.

The source sounds are various utterances from the VCTK dataset. For real world data, the utterances are played over a Rokono Bass+ Mini Speaker. The recordings are captured from an array of 7 microphones, as they are recorded by our robotic swarm as it is distributed across the table. The recorded audio in the real world has been subjected to audio compression and decompression using the Opus Codec to enable multiple simultaneous streams.

Please see the Readme for more infromation. Please see related identifiers for other datasets.

Files

Readme.txt

Files (29.5 GB)

Name Size
md5:255f14a08613ec34c28bb336d140179a
3.2 GB Download
md5:33f31503d9f2696d23094e857ad9b66e
2.5 kB Preview Download
md5:99c574d4479592c31611c6bcd52bd97c
26.3 GB Download

Additional details

Related works