Published October 26, 2024 | Version v2
Dataset Open

Soundscape Datasets for Few-Shot Bird Sound Classification

  • 1. ROR icon IMT Atlantique

Description

This repository provides easy access to open-source soundscape datasets of bird sounds, specifically optimized for few-shot classification.

soundscapes.zip contains evaluation soundscape datasets from the BIRB benchmark (https://arxiv.org/abs/2312.07439), downsampled to 16kHz, preprocessed using CNN14 from PANNs (https://arxiv.org/abs/1912.10211), to select a 6-second window with the highest bird activation, and converted to Pytorch (.pt) format to facilitate usability for evaluating deep neural networks. 

These preprocessed datasets are employed in the work "Domain-Invariant Representation Learning of Bird Sounds" (https://arxiv.org/abs/2409.08589), which evaluates the few-shot learning capabilities of deep learning models trained on focal recordings (e.g., Xeno-Canto) and tested on soundscape recordings.

Dataset Structure

Validation Dataset

  • POW (pow.pt): The validation dataset consists of 16,047 examples across 43 classes and is organized as a dictionary with 'data' and 'label' keys representing bird sounds and their corresponding labels. Storing the entire validation dataset in a single tensor enables rapid loading and efficient processing, significantly accelerating the validation process. Classes with only one example are removed, as they are insufficient for one-shot classification tasks. Source: https://zenodo.org/records/4656848#.Y7ijhOxudhE

Test Datasets 

Each test dataset is structured with multiple subfolders, each labeled with an eBird species code to represent data for a specific bird species.

  • SSW (ssw/): Contains 50,760 examples across 96 classes. Source: https://zenodo.org/records/7079380#.Y7ijHOxudhE
  • NES (coffee_farms/): Contains 6,952 examples across 89 classes. Source: https://zenodo.org/records/7525349#.ZB8z_-xudhE
  • UHH (hawaii/): Contains 59,583 examples across 27 classes. Source: https://zenodo.org/records/7078499#.Y7ijPuxudhE
  • HSN (high_sierras/): Contains 10,296 examples across 19 classes. Source: https://zenodo.org/records/7525805#.ZB8zsexudhE
  • SNE (sierras_kahl/): Contains 20,147 examples across 56 classes. Source: https://zenodo.org/records/7050014#.Y7ijWexudhE
  • PER (peru/): Contains 14,768 examples across 132 classes. Source: https://zenodo.org/records/7079124#.Y7iis-xudhE

Code and detailed instructions, including data loading, model implementation, and few-shot evaluation, can be found at: https://github.com/ilyassmoummad/ProtoCLR

Files

soundscapes.zip

Files (45.0 GB)

Name Size Download all
md5:70f29da76177fa590f761e1d69c9754f
45.0 GB Preview Download

Additional details