Published February 21, 2024 | Version v1
Software Open

Convergent genomic signatures associated with vertebrate viviparity

  • 1. Monash University

Description

Viviparity—live birth—is a complex and innovative mode of reproduction that has evolved repeatedly across the vertebrate Tree of Life. The genetic basis of viviparity has garnered increasing interest over recent years, however such studies are often undertaken on small evolutionary timelines, and thus are not able to address changes occurring on a broader scale. Using whole genome data, we investigated the molecular basis of this innovation across the diversity of vertebrates to answer a long held question in evolutionary biology: is the evolution of convergent traits driven by convergent genomic changes? This dataset includes the scripts and files used to investigate the genomic basis of viviparity in vertebrates. Specifically, we use genome alignments to investigate changes to protein families, protein-coding regions, introns and untranslated regions (UTRs). We assess changes in the sizes of protein families, as well as analyse differences in substitution rates in coding and noncoding sequences.

Notes

Funding provided by: Australian Government
Crossref Funder Registry ID: https://ror.org/0314h5y94
Award Number:

Funding provided by: Australian Research Council
Crossref Funder Registry ID: https://ror.org/05mmh0f86
Award Number: DP220100245

Funding provided by: Australian Research Council
Crossref Funder Registry ID: https://ror.org/05mmh0f86
Award Number: FT190100014

Funding provided by: Australian Research Council
Crossref Funder Registry ID: https://ror.org/05mmh0f86
Award Number: DP180104195

Funding provided by: Australian Research Council
Crossref Funder Registry ID: https://ror.org/05mmh0f86
Award Number: DE180101558

Methods

We used newly sequenced and publicly available whole genome data to generate multi-genome alignments that allowed us to make phylogenetic, genomic, and proteomic comparisons between viviparous and oviparous species. Specifically, we extracted coding sequences, UTRs and introns from the multi-genome alignments, before re-aligning them to generate both species and time trees. We additionally extracted fourfold degenerate (4d) sites from the genome alignments to generate a neutral phylogenetic model, which was used to analyse differences in substitution rates. Finally, we aligned genomes to sequences of protein families to obtain data corresponding to protein family sizes.

Files

01_genome_alignment.zip

Files (70.4 kB)

Name Size Download all
md5:31bc9f82cfd2dbb05ae930bddbbd0f15
1.9 kB Preview Download
md5:a3a6755235291d022e3546abf411af12
8.4 kB Preview Download
md5:f2e8d7bc8bf3d147e6599da45725ae8b
2.4 kB Preview Download
md5:20f504219db8c0997cbf781784cbb3fe
3.4 kB Preview Download
md5:8fbc43be72ae517c9d301f1104576f3d
2.5 kB Preview Download
md5:88080c871b4f77c5299727bca5568a90
22.9 kB Preview Download
md5:9a1d5e4bbed0254f589b81ffc015e027
9.2 kB Preview Download
md5:f55d97316c7a450baf5dad4bbd489be8
19.7 kB Preview Download

Additional details

Related works

Is source of
10.5061/dryad.rn8pk0pjx (DOI)