Dataset Open Access

HumBugDB: a large-scale acoustic mosquito dataset

Ivan Kiskin; Lawrence Wang; Marianne Sinka; Adam D. Cobb; Benjamin Gutteridge; Davide Zilli; Waqas Rafique; Rinita Dam; Theodoros Marinos; Yunpeng Li; Gerard Killeen; Dickson Msaky; Emmanuel Kaindoa; Kathy Willis; Steve J. Roberts

A large-scale multi-species dataset of acoustic recordings

Dataset accompanying code and paper: HumBugDB: a large-scale acoustic mosquito dataset.

A large-scale multi-species dataset containing recordings of mosquitoes collected from multiple locations globally, as well as via different collection methods. In total, we present 71,286 seconds (20 hours) of labelled mosquito data with 53,227 seconds (15 hours) of corresponding background noise, recorded at the sites of 8 experiments.  Of these, 64,843 seconds contain species metadata, consisting of 36 species (or species complexes).

This repository contains:

This data is supplemented by a GitHub repository, https://github.com/HumBug-Mosquito/HumBugDB, which aids as follows:

  • The multi-part zip is intended to be extracted into the folder: /data/audio/ in the repository.
  • Latest metadata is hosted on GitHub to allow the modification of additional metadata as it becomes available in the database or bug-fixing.
  • Documentation for code use, and a complete Datasheet for Datasets also available on GitHub.
  • Example code for data splitting, feature extraction, model training, and evaluation in the top-level notebook main.ipynb.
  • Bayesian Convolutional Neural Network models, in both Keras and PyTorch, trained on this data available at GitHub release v1.0

 

Funding from the 2014 Google Impact Challenge Award, and The Bill and Melinda Gates Foundation (https://www.gatesfoundation.org/about/committed-grants/2019/07/opp1209888)
Files (4.1 GB)
Name Size
humbugdb_neurips_2021_1.zip
md5:888d315776833ab3ed3065a2cac3c15e
1.0 GB Download
humbugdb_neurips_2021_2.zip
md5:93012a495d9cbcc4a549459c8acaf6a4
1.1 GB Download
humbugdb_neurips_2021_3.zip
md5:84931b88c31189e3864094e3b3aa9bea
1.3 GB Download
humbugdb_neurips_2021_4.zip
md5:5312bb0c33b3e2b85126ae6eb205f4b6
592.8 MB Download
neurips_2021_zenodo_0_0_1.csv
md5:b93f185c437fd07eb29185e3539c7fc0
1.5 MB Download
426
197
views
downloads
All versions This version
Views 426426
Downloads 197197
Data volume 170.9 GB170.9 GB
Unique views 380380
Unique downloads 6363

Share

Cite as