Published December 12, 2022 | Version v1
Dataset Open

The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics

  • 1. The Ohio State University

Description

Vocalizations in animals, particularly birds, are critically important behaviors that influence their reproductive fitness. While recordings of bioacoustic data have been captured and stored in collections for decades, the automated extraction of data from these recordings has only recently been facilitated by artificial intelligence methods. These have yet to be evaluated with respect to accuracy of different automation strategies and features. Here, we use a recently published machine learning framework to extract syllables from ten bird species ranging in their phylogenetic relatedness from 1 to 85 million years, to compare how phylogenetic relatedness influences accuracy. We also evaluate the utility of applying trained models to novel species. Our results indicate that model performance is best on conspecifics, with accuracy progressively decreasing as phylogenetic distance increases between taxa. However, we also find that the application of models trained on multiple distantly related species can improve the overall accuracy to levels near that of training and analyzing a model on the same species. When planning big-data bioacoustics studies, care must be taken in sample design to maximize sample size and minimize human labor without sacrificing accuracy.

Notes

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DEB 2016189

Files

9Spp_TrainValTest.txt

Files (1.2 GB)

Name Size Download all
md5:74a1a433f7fa2927342dd565e5248862
264.5 kB Preview Download
md5:597420e0be4d542756d06775b9d60e81
204.0 kB Preview Download
md5:211c00840c2a6156be50801334d6ccca
7.9 MB Preview Download
md5:f9885aa22a242bd23e29a289f7c4c38b
67.6 MB Preview Download
md5:0be6153dd806ec132731534271763d0c
41.5 kB Preview Download
md5:e3d4ff9b5d6b7fe4267b528524f21dda
17.1 kB Preview Download
md5:e2aee40b7b6df2394d5a088bb739e26a
551.4 kB Preview Download
md5:0f13291559b4508780b89c1c19d538d1
444.5 kB Preview Download
md5:1370a45018209258f06e485bacf4b82e
662 Bytes Preview Download
md5:0d1197769cc23b4daa673914e174c9fe
13.8 kB Preview Download
md5:a54bbcfb4b606664e9248e9735c6700e
15.6 kB Preview Download
md5:e9aa7dcfb855d9333382cef2a567b1a6
16.3 kB Preview Download
md5:39d63fca4745a2739f15f17c229119f4
1.1 GB Preview Download
md5:3f529fe8269caa29da2da8c2fabdfe02
36.8 kB Preview Download

Additional details