The Effect of Soundscape Composition on Bird Vocalization Classification in a Citizen Science Biodiversity Monitoring Project

Matthew Clark; Leonardo Salas; Shrishail Baligar; Colin Quinn

doi:10.5281/zenodo.6512218

Published March 7, 2023 | Version 2023-03-07

Dataset Open

The Effect of Soundscape Composition on Bird Vocalization Classification in a Citizen Science Biodiversity Monitoring Project

1. Sonoma State University, Rohnert Park, CA USA
2. Point Blue Conservation Science, Petaluma, CA, USA
3. University of California, Merced, CA, USA
4. Northern Arizona University, Flagstaff, AZ, USA

This archive includes sound clips (.wav files) and associated mel-scale spectrograms of bird vocalizations for 54 species in Sonoma County, California, USA. These data were used for training and validating convolutional neural network (CNN) models for bird species detection. We also include xeno-canto training and validation mel spectrograms used to pretrain CNNs. Details on these data are explained in the paper by Clark et al. (2023) titled "The effect of soundscape composition on bird vocalization classification in a citizen science biodiversity monitoring project". These data are available for use without restrictions, with no warranty on data quality or utility for a given application. We request that any work that does use these data cite the Clark et al. (2023) paper.

Clark, M.L., Salas, L., Baligar, S., Quinn, C., Snyder, R.L., Leland, D., Schackwitz, W., Goetz, S.J., Newsam, S. (2023). The effect of soundscape composition on bird vocalization classification in a citizen science biodiversity monitoring project. Ecological Informatics. https://doi.org/10.1016/j.ecoinf.2023.102065

Associated code for training CNN models, performing inference, and applying post-classification corrections can be found in the GitHub archive https://github.com/pointblue/Soundscapes2Landscapes/tree/master/CNN_Bird_Species

Raw sound data from the Soundscapes to Landscapes project are available upon request: Dr. Matthew Clark, matthew.clark@sonoma.edu

These data were collected as part of the Soundscapes to Landscapes project (soundscapes2landscapes.org), funded by NASA’s Citizen Science for Earth Systems Program (CSESP) 16-CSESP 2016-0009 under cooperative agreement 80NSSC18M0107.

----------------------------
This depository includes the following archives:

mel_specs.zip: contains 2-sec mel spectrograms split into training (“tr”), validation (“val”), testing (“test”) data for each target bird species (n = 54) used to fine-tune the CNNs. Select spectrogram files are appended with “aug” if they are augmented versions for the training data.
wav.zip: contains the associated wav-format sound recordings used to generate the training, validation, testing mel spectrograms found in mel_specs.zip.
Xeno-canto_pretrain.tar: contains 2-sec mel spectrograms split into training and validation data for 40 bird species used for CNN pre-training that were generated using a warbleR segmentation methodology described in the paper. The sound files used to generate these mel spectrograms came from the Kaggle competition, https://www.kaggle.com/datasets/imoore/xenocanto-bird-recordings-dataset
Mel spectrogram naming reflects the XC number used for cataloging on Xeno-canto in the format XC123456_2.png. The six numbers following the XC characters can be used to search for unique recordings on Xeno-canto (https://xeno-canto.org/) using the search query “nr:123456” in the search tool or queried using the Xeno-canto API (https://xeno-canto.org/explore/api). Unique recording names can be extracted from the mel spectrogram filenames.
soundscape_test_wavs.zip: the wav-format sound recordings used to perform soundscape testing.

Files

mel_specs.zip

Files (34.7 GB)

Name	Size	Download all
mel_specs.zip md5:e2420ca6d7e2592e9043ede3a28e0fe4	8.9 GB	Preview Download
soundscape_test_wavs.zip md5:847f6ef035d5ec4100a5e7b1d7fa7c94	2.4 GB	Preview Download
wav.zip md5:01942956e1b026805ddeb0c2b9d5ce89	3.6 GB	Preview Download
Xeno-canto_pretrain.tar md5:0407463057fcb27f36ddb06749eeb270	19.8 GB	Download

Additional details

Clark, M.L., Salas, L., Baligar, S., Quinn, C., Snyder, R.L., Leland, D., Schackwitz, W., Goetz, S.J., Newsam, S. (2023). The effect of soundscape composition on bird vocalization classification in a citizen science biodiversity monitoring project. Ecological Informatics. https://doi.org/10.1016/j.ecoinf.2023.102065

	All versions	This version
Views	236	236
Downloads	174	174
Data volume	1.8 TB	1.8 TB

The Effect of Soundscape Composition on Bird Vocalization Classification in a Citizen Science Biodiversity Monitoring Project

Authors/Creators

Description

Files

mel_specs.zip

Files (34.7 GB)

Additional details

References