LibriCount, a dataset for speaker count estimation

Fabian-Robert Stöter; Soumitro Chakrabarty; Emanuël Habets; Bernd Edler

doi:10.5281/zenodo.1216072

Published April 16, 2018 | Version v1.0.0

Dataset Open

LibriCount, a dataset for speaker count estimation

1. Internation Audio Laboratories Erlangen

LibriCount10 0dB Dataset

This is the description to the LibriCount10 synthetic dataset for speaker count estimation.

Therefore for each recording we provide the ground truth number of speakers within the file name, where `k` in, `k_uniquefile.wav` is the maximum number of concurrent speakers with the 5 seconds of recording.

The dataset contains a simulated cocktail party environment of [0..10] speakers, mixed with 0dB SNR from random utterances of different speakers from the LibriSpeech `CleanTest` dataset.

All recordings are of 5s durations, and all speakers are active for the most part of the recording.

For each unique recording, we provide the audio wave file (16bits, 16kHz, mono) and an annotation `json` file with the same name as the recording.

Metadata

In the annotation file we provide information about the speakers sex, their unique speaker_id, and vocal activity within the mixture recording in samples. Note that these were automatically generated using a voice activity detection system.

In the following example the annotation shows a speaker count of 3 speakers as can be extracted from the number of elements in the list:

[
    {
        "sex": "F",
        "activity": [[0, 51076], [51396, 55400], [56681, 80000]], 
        "speaker_id": 1221
    },
    {
        "sex": "F",
        "activity": [[0, 51877], [56201, 80000]],
        "speaker_id": 3570
    },
    {
        "sex": "M",
        "activity": [[0, 15681], [16161, 68213], [73498, 80000]], 
        "speaker_id": 5105
    }
]

Files

LibriCount10-0dB.zip

Files (832.5 MB)

Name	Size	Download all
LibriCount10-0dB.zip md5:30c8f844dc59fa65d216d53db9dc37e2	832.5 MB	Preview Download

	All versions	This version
Views	3,428	3,416
Downloads	1,010	1,008
Data volume	1.1 TB	1.1 TB

LibriCount, a dataset for speaker count estimation

Creators

Description

Files

LibriCount10-0dB.zip

Files (832.5 MB)