Published January 21, 2022 | Version v1
Dataset Open

An annotated set of audio recordings of Eastern North American birds containing frequency, time, and species information

  • 1. University of Pittsburgh
  • 2. Carnegie Mellon University

Description

Acoustic recordings of soundscapes are an important category of audio data which can be useful for answering a variety of questions, and an entire discipline within ecology, dubbed "soundscape ecology," has risen to study them. Bird sound is often the focus of studies of soundscapes due to the ubiquitousness of birds in most terrestrial environments and their high vocal activity. Autonomous acoustic recorders have increased the quantity and availability of recordings of natural soundscapes while mitigating the impact of human observers on community behavior. However, such recordings are of little use without analysis of the sounds they contain. Manual analysis currently stands as the best means of processing this form of data for use in certain applications within soundscape ecology, but it is a laborious task, sometimes requiring many hours of human review to process comparatively few hours of recording. For this reason, few annotated datasets of soundscape recordings are publicly available. Further still, there are no publicly available strongly-labeled soundscape recordings of bird sounds which contain information on timing, frequency, and species. Therefore, we present the first dataset of strongly-labeled bird sound soundscape recordings under free use license. These data were collected in the Northeastern United States at Powdermill Nature Reserve, Rector, PA. Recordings encompass 385 minutes of dawn chorus recordings collected by autonomous acoustic recorders between the months of April through July 2018. Recordings were collected in continuous bouts on four days during the study period, and contain 48 species and 16,052 annotations. Applications of this dataset may be numerous, and include the training, validation, and testing of certain advanced machine learning models which detect or classify bird sounds. 

Notes

README.txt

This text contains a summary of the collection, annotation, and structure of these audio data. For more detailed metadata, see the supplementary information of the associated data paper.

mp3_Files.zip

This file contains the audio data in mp3 format organized by recording. Individual audio files are separated into five-minute segments of the full recording. The standard naming convention "Recording_#_Segment_##.mp3" provides information necessary to determine the associated wav audio file and annotation file.

wav_Files.zip

This file contains the audio data in wav format organized by recording. Individual audio files are separated into five-minute segments of the full recording. The standard naming convention "Recording_#_Segment_##.wav" provides information necessary to determine the associated mp3 audio file and annotation file.

annotation_Files.zip

This file contains the annotations of the audio data in tab-separated text files organized by recording. Individual annotation files reflect respective five-minute audio segments. The standard naming convention "Recording_#_Segment_##.Table.1.selections.txt" provides information necessary to determine the associated mp3 and wav audio files.

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DBI-IIBR #1935507

Files

annotation_Files.zip

Files (1.6 GB)

Name Size Download all
md5:8f78ba7cf9fc5d25656f7747ed3406d1
434.3 kB Preview Download
md5:c654bd85f43ad6259977940536350fba
182.3 MB Preview Download
md5:a1b4cbf97261285f9363d555d5b01259
7.9 kB Preview Download
md5:2876c0cfc6ace1845ac04814ea9180f1
1.4 GB Preview Download