1216072
doi
10.5281/zenodo.1216072
oai:zenodo.org:1216072
Soumitro Chakrabarty
Internation Audio Laboratories Erlangen
Emanuël Habets
Internation Audio Laboratories Erlangen
Bernd Edler
Internation Audio Laboratories Erlangen
LibriCount, a dataset for speaker count estimation
Fabian-Robert Stöter
Internation Audio Laboratories Erlangen
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
audio
dataset
speaker count estimation
<p><strong>LibriCount10 0dB Dataset</strong></p>
<p>This is the description to the LibriCount10 synthetic dataset for speaker count estimation.</p>
<p>Therefore for each recording we provide the ground truth number of speakers within the file name, where `k` in, `k_uniquefile.wav` is the maximum number of concurrent speakers with the 5 seconds of recording.</p>
<p>The dataset contains a simulated cocktail party environment of [0..10] speakers, mixed with 0dB SNR from random utterances of different speakers from the <a href="http://www.openslr.org/12/">LibriSpeech</a> `CleanTest` dataset.</p>
<p>All recordings are of 5s durations, and all speakers are active for the most part of the recording.</p>
<p>For each unique recording, we provide the audio wave file (16bits, 16kHz, mono) and an annotation `json` file with the same name as the recording.</p>
<p><strong>Metadata</strong></p>
<p>In the annotation file we provide information about the speakers sex, their unique speaker_id, and vocal activity within the mixture recording in samples. Note that these were automatically generated using a <a href="https://github.com/wiseman/py-webrtcvad">voice activity detection</a> system.</p>
<p>In the following example the annotation shows a speaker count of 3 speakers as can be extracted from the number of elements in the list:</p>
<pre><code class="language-json">[
{
"sex": "F",
"activity": [[0, 51076], [51396, 55400], [56681, 80000]],
"speaker_id": 1221
},
{
"sex": "F",
"activity": [[0, 51877], [56201, 80000]],
"speaker_id": 3570
},
{
"sex": "M",
"activity": [[0, 15681], [16161, 68213], [73498, 80000]],
"speaker_id": 5105
}
]</code></pre>
<p><br>
</p>
Zenodo
2018-04-16
info:eu-repo/semantics/other
1216071
v1.0.0
1579893962.130719
832499962
md5:30c8f844dc59fa65d216d53db9dc37e2
https://zenodo.org/records/1216072/files/LibriCount10-0dB.zip
public
10.5281/zenodo.1216071
isVersionOf
doi