Labeled songs of domestic canary M1-2016-spring (Serinus canaria)
Creators
- 1. Inria Bordeaux Sud-Ouest, France and LaBRI, Bordeaux INP, CNRS, UMR 5800, France and Institut des Maladies Neurogégénératives, Université de Bordeaux, CNRS, UMR 5293, France
- 2. Paris-Saclay University, UMR 9197 CNRS, Paris-Saclay Institute of Neuroscience, France
Description
Labeled songs of domestic canary M1-2016-spring (Serinus canaria)
J. Giraudon*123, N. Trouvain*123, A. Cazala4, C. Del Negro4, X. Hinaut123
1 Inria Bordeaux Sud-Ouest, France
2 LaBRI, Bordeaux INP, CNRS, UMR 5800, France
3 Institut des Maladies Neurogégénératives, Université de Bordeaux, CNRS, UMR 5293, France
4 Paris-Saclay University, UMR 9197 CNRS, Paris-Saclay Institute of Neuroscience, France
* these authors participated equally to this work.
General information
This dataset contains ~3h of labeled songs (459 songs) of one male canary (called M1) recorded between May 24th and June 15th 2016. Songs were recorded in a sound-isolation chamber using a RODE M3 microphone, an external sound card for microphone amplification (M-Audio Fast Track Ultra 8R), and the software Sound Analysis Pro 2011 (SAP). SAP parameters were set with conservative thresholds (software threshold to 4-6) in order to record the initiation of canary's songs which can be low in volume.
Songs were hand labelled by one human expert using Audacity. They were then checked and corrected by another human expert assisted by an automated program based on recurrent neural networks (see References).
Dataset description
Canary songs are labeled using 27 different identified syllable classes + 1 "call" class identifying simple off-song calls + 1 "TRASH" class for irrelevant sounds (very rare vocalizations or non-bird sounds) + 1 "SIL" class for silence between vocalizations. Songs are annotated at the phrase level: a phrase consists of a repetition of a single syllable type and each phrase type is assigned a label.
Annotations are provided in CSV format in the "M1-2016-spring_annotations.zip" archive. There is one file per song, containing:
- a "wave" column indicating the song's audio filename;
- "start" and "end" columns indicating the temporal delimitation of the label from the begining of the song, in seconds;
- a "syll" column indicating the labels.
Songs are provided in WAV format (44kHz sampling rate) in the "M1-2016-spring_audio.zip" archive. There is one file per song: audio filenames match corresponding annotation filenames.
References
This dataset was used in:
N. Trouvain, X. Hinaut (2021) Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs. HAL preprint 〈hal-03203374〉
Files
M1-2016-sping_audio.zip
Files
(856.1 MB)
Name | Size | Download all |
---|---|---|
md5:d42020e8338f304b6af0860ddfd839ed
|
855.7 MB | Preview Download |
md5:8ab6b3dbee59e9108f3fdc3eb02ab022
|
424.3 kB | Preview Download |
Additional details
References
- N. Trouvain, X. Hinaut (2021) Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs. HAL preprint ⟨hal-03203374⟩