Published March 29, 2018 | Version 1.0.0
Video/Audio Open

Pyramic Dataset : 48-Channel Anechoic Audio Recordings of 3D Sources

  • 1. Tokyo Metropolitan University

Description

The Pyramic Dataset contains recordings done using the
Pyramic 48 channel microphone array in an
anechoic chamber. The recordings consist of 8 different samples (2x sweeps, 1x
noise, 5x speech) repeated at 180 angles (every 2 degrees) and from 3 different
heights. The audio samples recorded are

  • Linear and exponential sweeps
  • Noise sequence
  • 2x male and 3x female speech

This dataset allows to evaluate the performance of array processing algorithms
on real-life recordings done using MEMS microphones similar to those used in
mobile phones with all the non-idealities involved. The dataset is suitable for both 2D
and 3D scenarios. By subsampling the 48
microphones, a large number of array configurations can be tested.  Example of
algorithms are:

  • Direction of arrival (DOA) estimation
  • Beamforming
  • Source separation
  • Array calibration

Another application is the generation of realistic room impulse by combining
the impulse responses of microphones from sources at multiple angles with a
variant of the image source model.

In addition to the raw (compressed or not) and segmented
recordings, the impulse responses of all the microphones for every source
locations were recovered from the exponential sweep measurements and are
distributed together with the dataset. The initial manual measurement of loudspeakers
and microphones locations was improved upon using a blind calibration method.

This record contains

  • The compressed recordings (TTA format)
  • Segmented recorded samples
  • Impulse responses
  • Documentation and code (also available on github)

The raw measurements in wav format are available as a separate record (10.5281/zenodo.1209005).

The best way to get started is to only get the documentation and code from github (a copy is available in pyramic-dataset-doc-d2a456b4.zip) and follow the instructions in the README. The version on github is most up-to-date. If possible, please use that one.

Notes

The author would like to acknowledge Juan Azcarreta Ortiz, Corentin Ferry, and René Beuchat for their help in the design and usage of the Pyramic array. Hanjie Pan, Miranda Kreković, Mihailo Kolundzija, and Dalia El Badawy for lending a hand, or even two, during experiments. Finally, Juan Azcarreta Ortiz, Eric Bezzam, Hanjie Pan and Ivan Dokmanić for feedback on the documentation and dataset organization.

Files

pyramic-dataset-doc-d2a456b4.zip

Files (42.3 GB)

Name Size Download all
md5:cc33a48cba19a4971b02bcc8d266de08
15.3 MB Preview Download
md5:4a38b221fc66e8aa57da595ce3aff986
18.2 GB Download
md5:250a6fb3608ce5b49cdc7796ae233adb
295.0 MB Download
md5:cbb58661b4c741e90b84a40cdbcb0078
1.7 GB Download
md5:0bac047d92ea5e5c723400341fe07d60
1.4 GB Download
md5:2c7851fa409b919b0ccf54a2faa795b6
1.1 GB Download
md5:15a535f980bf76129c7bb72dc65fe748
954.0 MB Download
md5:1bb0b08767bf8420e985e99f8d16de5f
1.3 GB Download
md5:d8386b1b83ba7e9f343b9bde0aca7cd0
5.3 GB Download
md5:3ca226802181d68f572e56f388de0e5b
1.6 GB Download
md5:a060669cd29184a75c7051ca9b160254
5.0 GB Download
md5:7ea382d3830e72c383617eede9740a7c
5.5 GB Download