Published November 20, 2023 | Version v1
Dataset Open

Western burrowing owl genomics

  • 1. University of California Los Angeles

Description

Migration is driven by a combination of environmental and genetic factors, but many questions remain about those drivers. Potential interactions between genetic and environmental variants associated with different migratory phenotypes are rarely the focus of study. We pair low coverage whole genome resequencing with a de novo genome assembly to examine population structure, inbreeding, and the environmental factors associated with genetic differentiation between migratory and resident breeding phenotypes in a species of conservation concern, the western burrowing owl (Athene cunicularia hypugaea). Our analyses reveal a dichotomy in gene flow depending on whether the population is resident or migratory, with the former being genetically structured and the latter exhibiting no signs of structure. Among resident populations, we observed significantly higher genetic differentiation, significant isolation‐by‐distance, and significantly elevated inbreeding. Among migratory breeding groups, on the other hand, we observed lower genetic differentiation, no isolation‐by‐distance, and substantially lower inbreeding. Using genotype–environment association analysis, we find significant evidence for relationships between migratory phenotypes (i.e., migrant versus resident) and environmental variation associated with cold temperatures during the winter and barren, open habitats. In the regions of the genome most differentiated between migrants and residents, we find significant enrichment for genes associated with the metabolism of fats. This may be linked to the increased pressure on migrants to process and store fats more efficiently in preparation for and during migration. Our results provide a significant contribution toward understanding the evolution of migratory behavior and vital insight into ongoing conservation and management efforts for the western burrowing owl.

Notes

ANGSD

Funding provided by: California Energy Commission
Crossref Funder Registry ID: https://ror.org/05eaakg28
Award Number: EVA-2023-004-OA.R2

Funding provided by: National Geographic Society
Crossref Funder Registry ID: https://ror.org/04bqh5m06
Award Number: WW-202R-17

Funding provided by: National Science Foundation
Crossref Funder Registry ID: https://ror.org/021nxhr62
Award Number: NSF-1942313

Funding provided by: Santa Clara Valley Habitat Agency*
Crossref Funder Registry ID:
Award Number:

Methods

Details regarding sample collection, genome sequencing, and sequence processing may be found in Methods S1; but notably, we sequenced a reference genome to high coverage and 202 burrowing owl samples collected across their migratory and resident breeding range to low coverage. Because our resequencing dataset was low coverage, we used variant detection and analytical methods that largely did not require called genotypes. This included both genotype likelihoods as estimated in the program ANGSD (Korneliussen et al., 2014) and a single‐read‐sampling (SRS) method that randomly selects one read per variant to temper the bias of high variation in locus‐to‐locus depths. Using these methods, files were prepared for analyses as described below using the following four filtering and genotyping frameworks and conditions: (1) Using ANGSD to produce genotype likelihood files for all individuals in the BEAGLE format (‐doGlf 3) and a minor allele frequency file (‐domaf 1) with restrictive filtering that uses a conservative minimum minor allele frequency (‐minmaf 0.05), a low maximum likelihood of being polymorphic (‐SNP_pval 1e‐6), adjusting mapQ scores for excessive mismatches from the reference genome (‐C 50), and confirming variants using a base alignment quality estimation (‐baq 1). (2) For SRS analyses, we used the 'HaplotypeCaller' module in GATK (McKenna et al., 2010) to call genotypes for all individuals sequenced, filtered by removing insert/deletion variants, and kept only biallelic variants found in 50% of the individuals. (3) We used ANGSD to create population‐specific site frequency spectra (SFSs) from site allele frequency files using the reference genome to polarize allele calls (‐anc), adjusting frequencies using individual FIS (‐indF), and with strict filtering conditions including discarding reads without unique mapping (‐uniqueOnly 1), removing bad reads (‐remove_bads 1), using only reads for which mates are mapped (‐only_proper_pairs 1), discarding reads with low mapping quality (‐minMapQ 1), keeping reads with high base quality (‐minQ 20), dropping reads with low or high depth across samples (‐setMinDepth 10 ‐setMaxDepth 500), keeping only biallelic sites (‐skipTriallelic 1), and also previously described conditions (‐minMaf 0.05 ‐C 50 ‐baq 1). (4) Minor allele frequency files (MAFs) were also generated for each sample site (‐doMaf 1), sampling all the sites identified in the overall MAF file, and only generating minor allele frequencies for variants found in a minimum of four individuals in each population.

Files

environmental_data_for_gradient_forest.csv

Files (4.5 GB)

Name Size Download all
md5:e027b01d0bd655fbd6e37ad42c255a35
730.4 MB Download
md5:4922484957c31c0665fa7177bf6c7165
979.5 MB Download
md5:b073e7e1292d2030afee1c99f59870c2
4.8 kB Preview Download
md5:c7b3f76d5744413fad761a5b8732cb61
548.5 MB Download
md5:b0104e9301867cef1fa76a8df8e87dba
899.5 kB Preview Download
md5:7337a9ba8862140ba7448e0c1dce9241
107.2 MB Download
md5:5d97594562a7430cf1323b1937cc78dc
125.0 MB Download
md5:39fa5021b1ec18b931df2dac2eebaddc
69.4 MB Download
md5:355973026ee07b400d027061f76c3b56
39.0 MB Download
md5:cc352ce059503cdae1b145a6f04ca75d
51.0 MB Download
md5:fbb89dc88f02de5e5a5fe2f0969798c8
13.4 MB Download
md5:c0d380adb16da27f2f8f416cd63cf92b
14.8 MB Download
md5:9125fdf42622575cd8335cf627cca48d
84.2 MB Download
md5:af17abb7668a26fd82f465125ce96c3c
4.5 MB Download
md5:b9fb06801f1acd4c27b073ca9abc3123
13.3 MB Download
md5:623107e4e1b7339236d8cc8ab9f78492
17.4 MB Download
md5:fe123c9fd58d4f8cc09d4651ee8fd8ac
43.0 MB Download
md5:74b98ba440397b4f8d1fd20c6d1d755f
6.1 MB Download
md5:b1f1111ab3e419e53a9a7523db4dd2f3
941.1 MB Download
md5:74e09277781ecd329050e335f2a12071
97.3 MB Download
md5:b07d8c64182ef93f882a3abceb939fa7
37.2 MB Download
md5:4d08d06000971640d6a2a8f302a61396
122.0 MB Download
md5:f09a9bd35a765f1ebf7f58e59ff06f10
147.0 MB Download
md5:b40e68157220c7386b78017356aa07da
8.7 kB Preview Download
md5:6d29225b1eddc74faf4e8487add2e910
68.7 MB Download
md5:3dff1d88069b76b58f51fc377767644a
8.8 MB Download
md5:a736a5e04441edf52058921e7279388b
105.9 MB Download
md5:46b0df2587bb3cfc3b650524f4cbdf8e
92.6 MB Download

Additional details

Related works

Is cited by
10.1111/eva.13600 (DOI)