Soundscape Datasets for Few-Shot Bird Sound Classification
Description
This repository provides easy access to open-source soundscape datasets of bird sounds, specifically optimized for few-shot classification.
soundscapes.zip
contains evaluation soundscape datasets from the BIRB benchmark (https://arxiv.org/abs/2312.07439), downsampled to 16kHz, preprocessed using CNN14 from PANNs (https://arxiv.org/abs/1912.10211), to select a 6-second window with the highest bird activation, and converted to Pytorch (.pt) format to facilitate usability for evaluating deep neural networks.
These preprocessed datasets are employed in the work "Domain-Invariant Representation Learning of Bird Sounds" (https://arxiv.org/abs/2409.08589), which evaluates the few-shot learning capabilities of deep learning models trained on focal recordings (e.g., Xeno-Canto) and tested on soundscape recordings.
Dataset Structure
Validation Dataset
- POW (
pow.pt
): The validation dataset consists of 16,047 examples across 43 classes and is organized as a dictionary with'data'
and'label'
keys representing bird sounds and their corresponding labels. Storing the entire validation dataset in a single tensor enables rapid loading and efficient processing, significantly accelerating the validation process. Classes with only one example are removed, as they are insufficient for one-shot classification tasks. Source: https://zenodo.org/records/4656848#.Y7ijhOxudhE
Test Datasets
Each test dataset is structured with multiple subfolders, each labeled with an eBird species code to represent data for a specific bird species.
- SSW (
ssw/
): Contains 50,760 examples across 96 classes. Source: https://zenodo.org/records/7079380#.Y7ijHOxudhE - NES (
coffee_farms/
): Contains 6,952 examples across 89 classes. Source: https://zenodo.org/records/7525349#.ZB8z_-xudhE - UHH (
hawaii/
): Contains 59,583 examples across 27 classes. Source: https://zenodo.org/records/7078499#.Y7ijPuxudhE - HSN (
high_sierras/
): Contains 10,296 examples across 19 classes. Source: https://zenodo.org/records/7525805#.ZB8zsexudhE - SNE (
sierras_kahl/
): Contains 20,147 examples across 56 classes. Source: https://zenodo.org/records/7050014#.Y7ijWexudhE - PER (
peru/
): Contains 14,768 examples across 132 classes. Source: https://zenodo.org/records/7079124#.Y7iis-xudhE
Code and detailed instructions, including data loading, model implementation, and few-shot evaluation, can be found at: https://github.com/ilyassmoummad/ProtoCLR
Files
soundscapes.zip
Files
(45.0 GB)
Name | Size | Download all |
---|---|---|
md5:70f29da76177fa590f761e1d69c9754f
|
45.0 GB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/ilyassmoummad/ProtoCLR