Published March 28, 2018 | Version 1.0.0
Video/Audio Open

Pyramic Dataset : 48-Channel Anechoic Audio Recordings of 3D Sources (Raw)

  • 1. Tokyo Metropolitan University

Description

The Pyramic Dataset contains recordings done using the
Pyramic 48 channel microphone array in an
anechoic chamber. The recordings consist of 8 different samples (2x sweeps, 1x
noise, 5x speech) repeated at 180 angles (every 2 degrees) and from 3 different
heights. The audio samples recorded are

  • Linear and exponential sweeps
  • Noise sequence
  • 2x male and 3x female speech

This dataset allows to evaluate the performance of array processing algorithms
on real-life recordings done using MEMS microphones similar to those used in
mobile phones with all the non-idealities involved. The dataset is suitable for both 2D
and 3D scenarios. By subsampling the 48
microphones, a large number of array configurations can be tested.  Example of
algorithms are:

  • Direction of arrival (DOA) estimation
  • Beamforming
  • Source separation
  • Array calibration

Another application is the generation of realistic room impulse by combining
the impulse responses of microphones from sources at multiple angles with a
variant of the image source model.

In addition to the raw (compressed or not) and segmented
recordings, the impulse responses of all the microphones for every source
locations were recovered from the exponential sweep measurements and are
distributed together with the dataset. The initial manual measurement of loudspeakers
and microphones locations was improved upon using a blind calibration method.

This record contains only the raw measurements in wav format and archive of the documentation and code.

The post-processed data is available in a separate record that contains:

  • The compressed recordings (TTA format)
  • Segmented recorded samples
  • Impulse responses
  • Documentation and code (also available on github)

The best way to get started is to only get the documentation and code from github and download the data as needed into the unzipped archive. Then follow the instructions in README.md.

Notes

The author would like to acknowledge Juan Azcarreta Ortiz, Corentin Ferry, and René Beuchat for their help in the design and usage of the Pyramic array. Hanjie Pan, Miranda Kreković, Mihailo Kolundzija, and Dalia El Badawy for lending a hand, or even two, during experiments. Finally, Juan Azcarreta Ortiz, Eric Bezzam, Hanjie Pan and Ivan Dokmanić for feedback on the documentation and dataset organization.

Files

Files (38.1 GB)

Name Size Download all
md5:b9a34a2d25ce09d9c405891b1239b290
38.1 GB Download