Published April 1, 2026 | Version v1
Software Open

Data and code from: Disentangling the evolutionary cause-effect relationships of environment, sexual selection and body size with birdsong frequency

  • 1. Universidad de Antioquia
  • 2. Instituto de Ecología

Description

This project integrates a large comparative dataset and phylogenetic information to study the evolution of birdsong across 472 Neotropical passerine species. The dataset includes acoustic, morphological, and ecological variables (birdsong_data.csv), additional model coefficients (coefficients-1.csv), and 100 phylogenetic trees (birdstrees_100_McTavish.nex). An accompanying R script (Code_PPA_birdsong-evolution.r) performs all analyses, model selection, and figure generation.

Using these data, the study employs Phylogenetic Path Analysis to test causal relationships among habitat structure, sexual dimorphism, morphology, and song frequency parameters. Across all phylogenies, a single causal structure was consistently supported. The analyses show that greater tree cover increases minimum, peak, and maximum song frequencies, while bandwidth remains unaffected. Sexual dimorphism decreases bandwidth and influences frequency values, whereas morphological traits impose biomechanical constraints on song frequencies and shape bandwidth differently. Habitat structure and sexual dimorphism also affect morphological traits, producing additional indirect pathways that influence birdsong. Furthermore, tree cover itself impacts sexual dimorphism, embedding it within a broader causal network.

Together, the dataset and analyses reveal that the evolution of birdsong emerges from interacting environmental, sexual, and morphological forces. The results support key hypotheses—including acoustic adaptation, sexual selection, and morphological constraints—and demonstrate that trait evolution is best understood through multicausal and phylogenetically informed models, rather than simple linear associations.

Notes

All data was analysed in R

Funding provided by: N/A
Crossref Funder Registry ID: 0
Award Number:

Methods

Datasets

We collected data for 472 bird species belonging to Passeriformes group (91 Oscines, 94 Suboscines) distributed in Colombia. For each species we obtained song recordings, morphological data, and a proxy of the environment in which they live, tree cover percentage. In addition, we used imputed sexual size dimorphism from Bulla et al. (2020) as a proxy of sexual selection. Details for our dataset are presented below.

 

Acoustic data

Song recordings were obtained from Macaulay Library and Xenocanto. Recordings from Macaulay were in WAV format, sampling rate: 44KHz, 16 bit. The recordings from Xenocanto are in MP3 format and were transformed with the help of Ocenaudio V. 3.6.3 to comply with the same characteristics of Macaulay´s recordings. All acoustic data were analyzed using Avisoft (Avisoft SAS-LAB Pro V. 5.2, Berlin, Germany). First, sonograms of all recordings were visually inspected to determine their quality (signal-to-noise ratio). Sonogram parameters were: Hamming window, FFT Length 512, frame size 75%, overlap: 50%. After this, recordings that were of sufficient quality were considered in our analysis. At least three different recordings per species, and a minimum of five strophes per recording were analyzed. Strophes were selected in Avisoft by using an automatic selection method with a -30 dB threshold relative to the peak amplitude. This threshold excluded background noise while capturing variation within the frequency characteristics of the song and avoiding bias for manual selection.

 

 

Morphological data

 

A total of nine morphological measurements from males (culmen, bill depth, bill width, gape, wing length, tail length, tarsus length, hallux, body mass) were used to evaluate morphological variation. Measurements were obtained from a published dataset for Colombian species (Montoya et al. 2018), by measuring museum specimens following a standardized protocol (Lopez-Ordoñez et al. 2016), or from a database collected by the Ecology and Evolution of Vertebrates Research Group.  Since both the published dataset and the database included several individuals per species, average values were calculated for each species. We visited the Museo Universitario de la Universidad de Antioquia and the Museo de Ciencias Naturales de La Salle in Medellín to collect measurements of museum specimens. OG collected all measurements, and several individuals per species were measured and averaged.

 

Environment

we used the species distribution polygons from BirdLife (BirdLife International & World 2023) and the tree canopy cover raster data from Hansen et al. (2013). This raster layer is defined as canopy closure for all vegetation higher than five meters, expressed as a percentage within each 30-meter grid cell. To improve computational efficiency, we processed the tree cover data for the Americas using Google Earth Engine and resampled it from its original 30-meter resolution to a 1-kilometer resolution, employing the mean value for each resampled cell. Finally, we applied the zonal statistics tool in ArcGIS Pro (ESRI 2024) to estimate the mean tree cover percentage within the distribution range of each species.

 

Phylogenetic tree

We selected 472 Passeriform species belonging to different families (228 Oscines, 244 Suboscines). To build a reliable phylogeny, we used a recently published complete bird phylogeny (McTavish et al., 2025) which provides a standardized phylogeny with a robust, validated background. This is suitable in the absence of a complete phylogenetic analysis of all the species in our study. Implementing a random imputation procedure for 10 species that were not included in McTavish phylogeny, a total of 100 different trees were generated with the help of rtrees package in R (Li, 2023). All trees were used in the analysis to account for phylogenetic uncertainty.

Files

Files (8.7 kB)

Name Size Download all
md5:cc8c2d462a47eef23236d55d643d020b
8.7 kB Download

Additional details

Related works

Is source of
10.5061/dryad.g1jwstqtc (DOI)