Published February 14, 2020
| Version 1.0
Journal article
Open
BirdVox-14SD: a dataset of flight calls with species annotation
- 1. Cornell Lab of Ornithology
- 2. New York University
Description
BirdVox 14 Species Dataset (BirdVox-14SD) ============= Version 1.0, May 2020. Created By ---------- Vincent Lostanlen (1, 2, 3), Andrew Farnsworth (1), Jason Cramer (2, 3), and Juan Pablo Bello (2, 3). (1): Cornell Lab of Ornithology (CLO) (2): Center for Urban Science and Progress, New York University (3): Music and Audio Research Lab, New York University https://wp.nyu.edu/birdvox Description ----------- The BirdVox 14 Species Dataset (BirdVox-14SD) contains 14,336 audio clips of avian flight calls, each ranging from about 150 ms to 500 ms in duration. These recordings come from ROBIN autonomous recording units, placed near Ithaca, NY, USA during the 2015 migration season (August - November). Nine different sensors were used, originally numbered 1, 2, 3, 4, 5, 6, 7, 8, and 10. These sensors acquired audio recordings in intervals of two hours across the season. A subsample of 150 these two-hour recordings were chosen for annotation, using the Entrofy library [3] in order to maximize diversity across sensor locations, time of day, week in the season, and background noise characteristics (as represented by vector quantizations of median MFCCs). Andrew Farnsworth used the Raven software to pinpoint every avian flight call and labeled the corresponding order, family, and species. The dataset can be used, among other things, for the research, development and testing of bioacoustic classification models, including the reproduction of the results reported in [1]. For details on the hardware of ROBIN recording units, we refer the reader to [2]. [1] J. Cramer, V. Lostanlen, A. Farnsworth, J. Salamon, J.P. Bello. Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020. [2] J. Salamon, J. P. Bello, A. Farnsworth, M. Robbins, S. Keen, H. Klinck, and S. Kelling. Towards the Automatic Classification of Avian Flight Calls for Bioacoustic Monitoring. PLoS One, 2016. [3] D. Huppenkothen, B. McFee, L. Norén. Entrofy Your Cohort: A Data Science Approach to Candidate Selection. PLoS One, 2020. Taxonomic Annotations ----------------------- Classification annotations for each flight call are given at three taxonomic levels: order, family, and species. These annotations are condensed into a three-number-code which largely follow "<family>.<order>.<species>". The specific numeric codes are: * Order * 1.\*.\* - Passerine * Family * 1.1.\* - American Sparrow * 1.2.\* - Cardinals * 1.3.\* - Thrushes * 1.4.\* - New World warblers * Species * 1.1.1 - American tree sparrows (ATSP) * 1.1.2 - Chipping sparrow (CHSP) * 1.1.3 - Savannah sparrow (SAVS) * 1.1.4 - White-throated sparrow (WTSP) * 1.2.1 - Rose-breasted grosbeak (RBGR) * 1.3.1 - Gray-cheeked thrush (GCTH) * 1.3.2 - Swainson's thrush (SWTH) * 1.4.1 - American redstart (AMRE) * 1.4.2 - Bay-breasted warbler (BBWA) * 1.4.3 - Black-throated blue warbler (BTBW) * 1.4.4 - Canada warbler (CAWA) * 1.4.5 - Common yellowthroat (COYE) * 1.4.6 - Mourning warbler (MOWA) * 1.4.7 - Ovenbird (OVEN) Additionally, at any level of the taxonomy, the numeric code "0" is reserved for "other" and the code "X" refers to unknown. For example, 1.1.0 corresponds to an American Sparrow with a species outside of our scope of interest, and 1.1.X corresponds to an American Sparrow of unknown species. At the top level (family), the "other" codes (0.\*.\*) deviate from the family-order-species in order to capture a variety of other out-of-scope sounds, including anthropophony, non-avian biophony, and biophony of avians outside of the scope of interest. The file `taxonomy.yaml` details this taxonomy structure. Data Files ------------ BirdVox-14SD contains the recordings as HDF5 files, sampled at 22,050 Hz, with a single channel (mono). Each HDF5 file contains flight call vocalizations of a particular species. The name of each HDF5 file follows the format: `BirdVox-14SD_<taxonomy-code>_original.h5`. The name of the HDF5 dataset in each file is "waveforms", with the corresponding key for each audio recording following the format: `unit<unit-id>-<recording-timestamp>_<clip-offset>_<ave-call-freq>`. Please acknowledge BirdVox-14SD in academic research -------------------------------------------------------------------------- When BirdVox-14SD is used for academic research, we would highly appreciate it if scientific publications of works partly based on this dataset cite the following publication: J. Cramer, V. Lostanlen, A. Farnsworth, J. Salamon, J.P. Bello. Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020. The creation of this dataset was supported by NSF grants 1633259 (BIRDVOX). Conditions of Use ---------------------- Dataset created by Vincent Lostanlen, Andrew Farnsworth, Jason Cramer, Justin Salamon, and Juan Pablo Bello. The BirdVox-14SD dataset is offered free of charge under the terms of the Creative Commons Attribution 4.0 International License. The dataset and its contents are made available on an "as is" basis and without warranties of any kind, including without limitation satisfactory quality and conformity, merchantability, fitness for a particular purpose, accuracy or completeness, or absence of errors. Subject to any liability that may not be excluded or limited by law, CLO is not liable for, and expressly excludes all liability for, loss or damage however and whenever caused to anyone by any use of the BirdVox-14SD dataset or any part of it. Feedback ----------- Please help us improve BirdVox-14SD by sending your feedback to: vincent.lostanlen@gmail.com and jtcramer@nyu.edu In case of a problem, please include as many details as possible. Acknowledgements ------------------------ Jessie Barry, Ian Davies, Tom Fredericks, Jeff Gerbracht, Sara Keen, Holger Klinck, Anne Klingensmith, Ray Mack, Peter Marchetto, Ed Moore, Matt Robbins, Ken Rosenberg, and Chris Tessaglia-Hymes. We acknowledge that the land on which the data was collected is the unceded territory of the Cayuga nation, which is part of the Haudenosaunee (Iroquois) confederacy.
Files
Files
(1.3 GB)
Name | Size | Download all |
---|---|---|
md5:e3188151c903568ee143f9d5a211b61a
|
96.4 kB | Download |
md5:d88d6f0748c4da98a49787987c34f0f2
|
10.1 MB | Download |
md5:4e6b8e8012b76c0b2c2fd5de24c99353
|
363.3 kB | Download |
md5:8f46253425fbb2d636460d913b587085
|
12.9 MB | Download |
md5:3d1f3c72737d2b62bdd503d67851eb0f
|
1.1 MB | Download |
md5:2dd1e390954764b7cc2c9836efd0565d
|
453.3 kB | Download |
md5:a32096ec107ccc0302956782efb7f3ac
|
33.6 MB | Download |
md5:38e3c4500fda6a83d31e27221cd60659
|
6.6 MB | Download |
md5:7e6d2b7885377074f835537ccbc7811a
|
3.8 MB | Download |
md5:1e7a2951d8020b58f89589b3d33f2a13
|
17.0 MB | Download |
md5:e5d6ec1139595e534c0c3d5384ff5aab
|
2.0 MB | Download |
md5:907f3efac89262505716da335d5fd421
|
2.2 MB | Download |
md5:ba8d12a18a8447e812efa65d175caa5d
|
5.0 MB | Download |
md5:dc61556d9d0e1eeaf9649d7cee8597b9
|
25.0 MB | Download |
md5:d179085d07fbc0b35ec812f13a56fc88
|
11.9 MB | Download |
md5:05a7f0307535219808e6fd6346f14bc8
|
13.6 MB | Download |
md5:5e79aad141384feb0cc670c4748e5ed5
|
10.2 MB | Download |
md5:5844abfedf59187cd8194ec06bbe6a4c
|
24.8 MB | Download |
md5:fd554ffd0e18436e4e7c7479db0185f0
|
34.8 MB | Download |
md5:e55d98069fbe9fee1e62f20bdd22ed6c
|
87.5 MB | Download |
md5:f36b9b85b704d262e1b85b2664e25af4
|
2.1 MB | Download |
md5:f0d84ca0e907349f846891d4d6b51ee3
|
21.4 MB | Download |
md5:734d2f7e037bfab2166d001396b259ed
|
2.4 MB | Download |
md5:d016db10024605f249c1619aa0045fc3
|
62.6 MB | Download |
md5:1e0fc051f04dd260bebf68bcca127089
|
11.5 MB | Download |
md5:b78417eff913d4c6c4b59a586bf27064
|
154.5 MB | Download |
md5:c302c429c904795b10f20fbcae3dd646
|
86.0 MB | Download |
md5:95f93b30e8eabd11050f9f18910730e6
|
40.6 MB | Download |
md5:1d3e83757e56472e84bb9680156a3698
|
9.0 MB | Download |
md5:9a41ab695302f1fec35b44fbd160c155
|
7.6 MB | Download |
md5:7bd4be6ca376e894da7431b85d443ceb
|
11.0 MB | Download |
md5:72f9e0c607165827e970810bc6cd90e5
|
1.3 MB | Download |
md5:02a46f7b67c4baa45c34d969dc18e50a
|
18.6 MB | Download |
md5:bbeb83552c1c77ca1b5b95d26e335787
|
1.3 MB | Download |
md5:cff1ee7ab75ec805030aaaa588c68a3c
|
36.3 MB | Download |
md5:99924836ca7978b1644808d0ed9b22ea
|
98.1 MB | Download |
md5:67138e0a0a49d4ab2c295bbdbe5c94ef
|
350.5 MB | Download |
md5:06ca6fea6a9b9342a053a2609dc61706
|
59.2 MB | Download |
md5:6d2b23d9a96312b358cfcf545e4396c0
|
3.2 kB | Download |