Published January 6, 2023 | Version v1
Dataset Open

Data from: Whole genomes reveal evolutionary relationships and mechanisms underlying gene-tree discordance in Neodiprion sawflies

  • 1. University of Kentucky
  • 2. Daniel K. Inouye U.S. Pacific Basin Agricultural Research Center

Description

Rapidly evolving taxa are excellent models for understanding the mechanisms that give rise to biodiversity. However, developing an accurate historical framework for comparative analysis of such lineages remains a challenge due to ubiquitous incomplete lineage sorting and introgression. Here, we use a whole-genome alignment, multiple locus-sampling strategies, and locus-based and SNP-based species-tree methods to infer a species tree for eastern North American Neodiprion species, a clade of pine-feeding sawflies (Order: Hymenopteran; Family: Diprionidae). We recovered a well-supported species tree that—except for three uncertain relationships—is robust to different strategies for analyzing whole-genome data. Despite this consistency, underlying gene-tree discordance is high. To understand this discordance, we use multiple regression to model topological discordance as a function of several genomic features. We find that gene-tree discordance tends to be higher in regions of the genome that may be more prone to gene-tree estimation error, as indicated by a lower density of parsimony-informative sites, a higher density of genes, a higher average pairwise genetic distance, and gene trees with lower average bootstrap support. Also, contrary to the expectation that discordance via incomplete lineage sorting is reduced in low-recombination regions of the genome, we find a negative correlation between recombination rate and topological discordance. We offer potential explanations for this pattern and hypothesize that it may be unique to lineages that have diverged with gene flow. Our analysis also reveals an unexpected discordance hotspot on Chromosome 1, which contains several genes potentially involved in mitochondrial-nuclear interactions and produces a gene-tree that resembles a highly discordant mitochondrial tree. Based on these observations, we hypothesize that our genome-wide scan for topological discordance has identified a nuclear locus involved in a mito-nuclear incompatibility. Together, these results demonstrate how phylogenomic analysis coupled with high-quality, annotated genomes can generate novel hypotheses about the mechanisms that drive divergence and produce variable genealogical histories across genomes.

Notes

All files are either in FASTA or NEXUS format. FASTA format is a standard text format for nucleotide sequences. FASTA genome files are provided for each Neodiprion species. Using freely available scripts (https://github.com/LinnenLab/Herrig_etal_NeodiprionPhylogeny), these can be used to produce window-based and gene-based datasets in nexus format. Nexus is a standard format for character data for phylogenetic analysis. These can be used as input for many different phylogenetic programs. 

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: DEB-CAREER-1750946

Funding provided by: United States Department of Agriculture
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000199
Award Number: 2016-67014-2475

Files

README.md

Files (5.5 GB)

Name Size Download all
md5:fb96a7502a9c8817a4cca1d9e1ed15a6
52.5 kB Download
md5:99a5c956df5d59473adc002446f50a74
481.8 kB Download
md5:1c598bec6beb2f86df846a49ceb5e994
4.3 MB Download
md5:73267e6218bfdfe17c95e47a9fc8e85f
102.9 kB Download
md5:8a509971c9a722b23ac90fd41d494ad5
929.8 kB Download
md5:6dda4be16ab1714227734db965378125
276.7 MB Download
md5:34db2b566536a0791c624db786db77c3
274.6 MB Download
md5:b0a351655e3eba3199e30c0c4cb04ac9
276.7 MB Download
md5:3e103a74beef35da26d39414b759390b
276.7 MB Download
md5:619c2058bc58772fed84231a096fa79b
276.7 MB Download
md5:52c7284d84b030712d57a1447edce6b4
276.7 MB Download
md5:8b6f51cd75148e630a582037dca8ecaf
276.7 MB Download
md5:b805a1731aec94ff54f328e6f3570665
276.7 MB Download
md5:554d0536deda44184bf115ade0d84676
276.7 MB Download
md5:3c315ce71921cc29f23052824d51d107
276.7 MB Download
md5:2541f82967a287fcdadf4269e1541749
276.7 MB Download
md5:2d838ed4ac7a816c6a4eaf9b737681a3
276.7 MB Download
md5:e6ed92c09aef2a7219cb91dabe1e3217
276.7 MB Download
md5:d01028516345bbd07dfe5ec74abe64cc
276.7 MB Download
md5:0469855b3ede2c4bebf94656529754aa
276.7 MB Download
md5:1331dd04b0b2151a1cfa1615197d3841
4.0 kB Preview Download
md5:370b0c047a5ed080c7ff63464a969f29
276.7 MB Download
md5:c151ca9012fa82c9daeac5e2f848d369
276.7 MB Download
md5:b9a70796eb703523a3ece160bcd78097
276.7 MB Download
md5:f3249f6aed9d94b9d6a7c607d047ef44
276.7 MB Download
md5:7e6c1ca9fdc7636184bd5f36da02345d
276.7 MB Download

Additional details