Published February 14, 2020 | Version 1.0
Journal article Open

BirdVox-14SD: a dataset of flight calls with species annotation

  • 1. Cornell Lab of Ornithology
  • 2. New York University

Description

   

BirdVox 14 Species Dataset (BirdVox-14SD)
=============
Version 1.0, May 2020.

Created By
----------

Vincent Lostanlen (1, 2, 3), Andrew Farnsworth (1), Jason Cramer (2, 3), and Juan Pablo Bello (2, 3).

(1): Cornell Lab of Ornithology (CLO)
(2): Center for Urban Science and Progress, New York University
(3): Music and Audio Research Lab, New York University

https://wp.nyu.edu/birdvox


Description
-----------

The BirdVox 14 Species Dataset (BirdVox-14SD) contains 14,336 audio clips of avian flight calls, each ranging from about 150 ms to 500 ms in duration. These recordings come from ROBIN autonomous recording units, placed near Ithaca, NY, USA during the 2015 migration season (August - November). Nine different sensors were used, originally numbered 1, 2, 3, 4, 5, 6, 7, 8, and 10. These sensors acquired audio recordings in intervals of two hours across the season.

A subsample of 150 these two-hour recordings were chosen for annotation, using the Entrofy library [3] in order to maximize diversity across sensor locations, time of day, week in the season, and background noise characteristics (as represented by vector quantizations of median MFCCs). Andrew Farnsworth used the Raven software to pinpoint every avian flight call and labeled the corresponding order, family, and species.

The dataset can be used, among other things, for the research,
development and testing of bioacoustic classification models, including the reproduction of the results reported in [1].

For details on the hardware of ROBIN recording units, we refer the reader to [2].

[1] J. Cramer, V. Lostanlen, A. Farnsworth, J. Salamon, J.P. Bello. Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020.

[2] J. Salamon, J. P. Bello, A. Farnsworth, M. Robbins, S. Keen, H. Klinck, and S. Kelling. Towards the Automatic Classification of Avian Flight Calls for Bioacoustic Monitoring. PLoS One, 2016.

[3] D. Huppenkothen, B. McFee, L. Norén. Entrofy Your Cohort: A Data Science Approach to Candidate Selection. PLoS One, 2020.


Taxonomic Annotations
-----------------------

Classification annotations for each flight call are given at three taxonomic levels: order, family, and species. These annotations are condensed into a three-number-code which largely follow "<family>.<order>.<species>". The specific numeric codes are:

* Order
    * 1.\*.\* - Passerine
* Family
    * 1.1.\* - American Sparrow
    * 1.2.\* - Cardinals
    * 1.3.\* - Thrushes
    * 1.4.\* - New World warblers
* Species
    * 1.1.1 - American tree sparrows (ATSP)
    * 1.1.2 - Chipping sparrow (CHSP)
    * 1.1.3 - Savannah sparrow (SAVS)
    * 1.1.4 - White-throated sparrow (WTSP)
    * 1.2.1 - Rose-breasted grosbeak (RBGR)
    * 1.3.1 - Gray-cheeked thrush (GCTH)
    * 1.3.2 - Swainson's thrush (SWTH)
    * 1.4.1 - American redstart (AMRE)
    * 1.4.2 - Bay-breasted warbler (BBWA)
    * 1.4.3 - Black-throated blue warbler (BTBW)
    * 1.4.4 - Canada warbler (CAWA)
    * 1.4.5 - Common yellowthroat (COYE)
    * 1.4.6 - Mourning warbler (MOWA)
    * 1.4.7 - Ovenbird (OVEN)

Additionally, at any level of the taxonomy, the numeric code "0" is reserved for "other" and the code "X" refers to unknown. For example, 1.1.0 corresponds to an American Sparrow with a species outside of our scope of interest, and 1.1.X corresponds to an American Sparrow of unknown species. At the top level (family), the "other" codes (0.\*.\*) deviate from the family-order-species in order to capture a variety of other out-of-scope sounds, including anthropophony, non-avian biophony, and biophony of avians outside of the scope of interest. The file `taxonomy.yaml` details this taxonomy structure.


Data Files
------------

BirdVox-14SD contains the recordings as HDF5 files, sampled at 22,050 Hz, with a single channel (mono). Each HDF5 file contains flight call vocalizations of a particular species. The name of each HDF5 file follows the format: `BirdVox-14SD_<taxonomy-code>_original.h5`. The name of the HDF5 dataset in each file is "waveforms", with the corresponding key for each audio recording following the format: `unit<unit-id>-<recording-timestamp>_<clip-offset>_<ave-call-freq>`.


Please acknowledge BirdVox-14SD in academic research
--------------------------------------------------------------------------

When BirdVox-14SD is used for academic research, we would highly appreciate it if  scientific publications of works partly based on this dataset cite the following publication:

J. Cramer, V. Lostanlen, A. Farnsworth, J. Salamon, J.P. Bello. Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020.

The creation of this dataset was supported by NSF grants 1633259 (BIRDVOX).


Conditions of Use
----------------------

Dataset created by Vincent Lostanlen, Andrew Farnsworth, Jason Cramer, Justin Salamon, and Juan Pablo Bello.

The BirdVox-14SD dataset is offered free of charge under the terms of the Creative Commons Attribution 4.0 International License.

The dataset and its contents are made available on an "as is" basis and without  warranties of any kind, including without limitation satisfactory quality and  conformity, merchantability, fitness for a particular purpose, accuracy or  completeness, or absence of errors. Subject to any liability that may not be excluded or limited by law, CLO is not liable for, and expressly excludes all liability for, loss or damage however and whenever caused to anyone by any use of the BirdVox-14SD dataset or any part of it.


Feedback
-----------

Please help us improve BirdVox-14SD by sending your feedback to:
vincent.lostanlen@gmail.com and jtcramer@nyu.edu

In case of a problem, please include as many details as possible.


Acknowledgements
------------------------
Jessie Barry, Ian Davies, Tom Fredericks, Jeff Gerbracht, Sara Keen, Holger Klinck, Anne Klingensmith, Ray Mack, Peter Marchetto, Ed Moore, Matt Robbins, Ken Rosenberg, and Chris Tessaglia-Hymes.

We acknowledge that the land on which the data was collected is the unceded territory of the Cayuga nation, which is part of the Haudenosaunee (Iroquois) confederacy.

Files

Files (1.3 GB)

Name Size Download all
md5:e3188151c903568ee143f9d5a211b61a
96.4 kB Download
md5:d88d6f0748c4da98a49787987c34f0f2
10.1 MB Download
md5:4e6b8e8012b76c0b2c2fd5de24c99353
363.3 kB Download
md5:8f46253425fbb2d636460d913b587085
12.9 MB Download
md5:3d1f3c72737d2b62bdd503d67851eb0f
1.1 MB Download
md5:2dd1e390954764b7cc2c9836efd0565d
453.3 kB Download
md5:a32096ec107ccc0302956782efb7f3ac
33.6 MB Download
md5:38e3c4500fda6a83d31e27221cd60659
6.6 MB Download
md5:7e6d2b7885377074f835537ccbc7811a
3.8 MB Download
md5:1e7a2951d8020b58f89589b3d33f2a13
17.0 MB Download
md5:e5d6ec1139595e534c0c3d5384ff5aab
2.0 MB Download
md5:907f3efac89262505716da335d5fd421
2.2 MB Download
md5:ba8d12a18a8447e812efa65d175caa5d
5.0 MB Download
md5:dc61556d9d0e1eeaf9649d7cee8597b9
25.0 MB Download
md5:d179085d07fbc0b35ec812f13a56fc88
11.9 MB Download
md5:05a7f0307535219808e6fd6346f14bc8
13.6 MB Download
md5:5e79aad141384feb0cc670c4748e5ed5
10.2 MB Download
md5:5844abfedf59187cd8194ec06bbe6a4c
24.8 MB Download
md5:fd554ffd0e18436e4e7c7479db0185f0
34.8 MB Download
md5:e55d98069fbe9fee1e62f20bdd22ed6c
87.5 MB Download
md5:f36b9b85b704d262e1b85b2664e25af4
2.1 MB Download
md5:f0d84ca0e907349f846891d4d6b51ee3
21.4 MB Download
md5:734d2f7e037bfab2166d001396b259ed
2.4 MB Download
md5:d016db10024605f249c1619aa0045fc3
62.6 MB Download
md5:1e0fc051f04dd260bebf68bcca127089
11.5 MB Download
md5:b78417eff913d4c6c4b59a586bf27064
154.5 MB Download
md5:c302c429c904795b10f20fbcae3dd646
86.0 MB Download
md5:95f93b30e8eabd11050f9f18910730e6
40.6 MB Download
md5:1d3e83757e56472e84bb9680156a3698
9.0 MB Download
md5:9a41ab695302f1fec35b44fbd160c155
7.6 MB Download
md5:7bd4be6ca376e894da7431b85d443ceb
11.0 MB Download
md5:72f9e0c607165827e970810bc6cd90e5
1.3 MB Download
md5:02a46f7b67c4baa45c34d969dc18e50a
18.6 MB Download
md5:bbeb83552c1c77ca1b5b95d26e335787
1.3 MB Download
md5:cff1ee7ab75ec805030aaaa588c68a3c
36.3 MB Download
md5:99924836ca7978b1644808d0ed9b22ea
98.1 MB Download
md5:67138e0a0a49d4ab2c295bbdbe5c94ef
350.5 MB Download
md5:06ca6fea6a9b9342a053a2609dc61706
59.2 MB Download
md5:6d2b23d9a96312b358cfcf545e4396c0
3.2 kB Download