Published April 29, 2019 | Version v1
Dataset Open

Data from: Whole genome shotgun phylogenomics resolves the pattern and timing of swallowtail butterfly evolution

Description

Evolutionary relationships have remained unresolved in many well-studied groups, even though advances in next-generation sequencing and analysis, using approaches such as transcriptomics, anchored hybrid enrichment, or ultraconserved elements, have brought systematics to the brink of whole genome phylogenomics. Recently, it has become possible to sequence the entire genomes of numerous non-biological models in parallel at reasonable cost, particularly with shotgun sequencing. Here we identify orthologous coding sequences from whole-genome shotgun sequences, which we then use to investigate the relevance and power of phylogenomic relationship inference and time-calibrated tree estimation. We study an iconic group of butterflies - swallowtails of the family Papilionidae - that has remained phylogenetically unresolved, with continued debate about the timing of their diversification. Low-coverage whole genomes were obtained using Illumina shotgun sequencing for all genera. Genome assembly coupled to BLAST-based orthology searches allowed extraction of 6,621 orthologous protein-coding genes for 45 Papilionidae species and 16 outgroup species (with 32% missing data after cleaning phases). Supermatrix phylogenomic analyses were performed with both maximum-likelihood (IQ-TREE) and Bayesian mixture models (PhyloBayes) for amino acid sequences, which produced a fully resolved phylogeny providing new insights into controversial relationships. Species tree reconstruction from gene trees was performed with ASTRAL and SuperTriplets and recovered the same phylogeny. We estimated gene site concordant factors to complement traditional node-support measures, which strengthens the robustness of inferred phylogenies. Bayesian estimates of divergence times based on a reduced dataset (760 orthologs and 12% missing data) indicate a mid-Cretaceous origin of Papilionoidea around 99.2 million years ago (Ma) (95% credibility interval: 68.6-142.7 Ma) and Papilionidae around 71.4 Ma (49.8-103.6 Ma), with subsequent diversification of modern lineages well after the Cretaceous-Paleogene event. These results show that shotgun sequencing of whole genomes, even when highly fragmented, represents a powerful approach to phylogenomics and molecular dating in a group that has previously been refractory to resolution.

Notes

Files

Appendix S10 - Chronogram files.zip

Files (504.8 MB)

Name Size Download all
md5:5780717e0f6cbecdd13f253586eaabcc
13.7 kB Download
md5:b218f3861a1eba6ba73534c1e4031bf0
10.1 kB Preview Download
md5:f6449888634c008065375bf3f508d270
123.3 kB Preview Download
md5:d5839d9acdbc693a4aafc437586aa085
59.6 kB Preview Download
md5:7ff5ba3b35a33ab2ce7a438fc5827c20
70.3 kB Preview Download
md5:c843dfd615e58bf4413a1ae3bba2770d
17.6 MB Download
md5:e2f34135e755365724c864e0a17e765d
101.0 MB Download
md5:f8a6c1cf567210742ff2d681c16db635
55.3 MB Download
md5:73d7b0e81816042df5cdbd388149a0de
330.0 MB Download
md5:69c4ce3984a9c44babb2d983b5f445b6
205.5 kB Preview Download
md5:26c88b67b32d82166cb94e9b256b7794
13.7 kB Preview Download
md5:5752ce3e2cfaed0fd043f612bec45d93
279.1 kB Preview Download
md5:0c313ea8b0fe587a13f871c5e15ddbdf
151.8 kB Preview Download