Published September 10, 2021 | Version v1
Software Open

Supporting data for: Gene-rich UV sex chromosomes harbor conserved regulators of sexual development (Carey et al., 2021)

  • 1. University of Florida
  • 2. Joint Genome Institute
  • 3. HudsonAlpha Institute for Biotechnology
  • 4. University of Paris-Saclay
  • 5. Duke University
  • 6. Philipp University of Marburg
  • 7. Clemson University
  • 8. RAPiD Genomics*
  • 9. Georgia Institute of Technology
  • 10. Center for International Forestry Research
  • 11. University of Turku
  • 12. Cornell University
  • 13. Chicago Botanic Garden*
  • 14. Texas Tech University

Description

Non-recombining sex chromosomes, like the mammalian Y, often lose genes and accumulate transposable elements, a process termed degeneration. The correlation between suppressed recombination and degeneration is clear in animal XY systems, but the absence of recombination is confounded with other asymmetries between the X and Y. In contrast, UV sex chromosomes, like those found in bryophytes, experience symmetrical population genetic conditions. Here we generate and use nearly gapless female and male chromosome-scale reference genomes of the moss Ceratodon purpureus to test for degeneration in the bryophyte UV sex chromosome system. We show the moss sex chromosomes evolved over 300 million years ago and expanded via two chromosomal fusions. Although the sex chromosomes show signs of weaker purifying selection than autosomes, we find suppressed recombination alone is insufficient to drive gene loss on sex-specific chromosomes. Instead, the U and V sex chromosomes harbor thousands of broadly-expressed genes, including numerous key regulators of sexual development across land plants.

Notes

  • novaseq_FASTQ_de_interlacer.pl -- splits paired-end Illumina NovaSeq data into forward and reverse files
  • liverwort_trinity_assemblies.tar.gz -- contains all de novo Trinity assemblies for liverworts used in this study
  • moss_trinity_assemblies.tar.gz -- contains all de novo Trinity assemblies for mosses used in this study
  • all_pep_files_for_orthofinder.tar.gz -- all peptide files for all species used in the OrthoFinder run in this study
  • Orthogroups.txt - all orthogroups identified by OrthoFinder clustering
  • orthogroup_filter.pl -- perl script to filter orthogroups ("clusters") output by OrthoFinder for a minimum number of species
  • all_cds.fa.gz and all_pep.fa.gz -- fasta files containing all cds and peptides, respectively, for all species combined to write fasta files for each Orthofinder gene cluster
  • fasta_from_OrthoFinder.pl -- perl script to write a separate fasta file for each Orthogroup ("cluster") output by OrthoFinder
  • alignment_length_filter.pl -- perl script to filter fasta files by a user input minimum number of nucleotides or amino acids
  • sexlinked_liverwort_alignments.tar.gz -- final, filtered cds alignments used to build gene trees of sex-linked genes in Marchantia polymorpha
  • sexlinked_moss_alignments.tar.gz -- final, filtered cds alignments used to build gene trees of sex-linked genes in Ceratodon purpureus
  • sexlinked_liverwort_trees.tar.gz -- RAxML gene trees with bootstrap support of sex-linked genes in Marchantia polymorpha
  • sexlinked_moss_trees.tar.gz -- RAxML gene trees with bootstrap support of sex-linked genes in Ceratodon purpureus
  • edlwtre2.pl -- perl script that roots gene trees and reduces isoforms of the same sample (within a clade) down to the longest isoform
  • physco_outgroup.py -- python script that uses ETE3 to identify C. purpureus sex-linked genes and the closest Physcomitrium patens outgroup
  • prune_tree.py -- python script that uses ETE3 to identify C. purpureus sex-linked genes and prune at the closest Physcomitrium patens outgroup. The script also randomly selects one isoform/homolog for each other species in the tree
  • array_hash_extractor_fasta_unlock_tree_mod.pl -- perl script that filters the original fasta file for those left after prune_tree.py
  • paml_header_prep.pl -- perl script for prepping the headers in gene trees and fasta files for PAML
  • paml_tree_prep.pl -- perl script for generating different labeled trees for the sex-linked genes evolving differently than autosomes for PAML
  • paml_bash.sh -- bash script to run PAML on multiple genes and report the results of dN, dS, and dN/dS for C. purpureus sex-linked genes
  • paml_AIC.pl -- perl script necessary to run PAML in paml_bash.sh
  • array_hash_extractor_fasta_unlock_ks.pl -- perl script that searches for a user identified list of C. purpureus one-to-one orthologous UV genes across multiple alignments. The output is an individual alignment for each of the U and V-linked orthologous genes
  • aln_to_axt.pl -- perl script that converts an alignment of one-to-one UV genes into axt format for KaKs Calculator
  • ceratodon_genome_plots.R -- R script for generating gene tree plots, density plots, Ks on UV chromosome plot,  codon metrics and dN/dS plots, and gene expression heatmaps

Files

Files (70.0 kB)

Name Size Download all
md5:b2ea627510641ee4628f9af72405f12d
894 Bytes Download
md5:d13d8a233472a27878efd376df8b55f0
832 Bytes Download
md5:a4b3a6d0cc9a5c8d05c8e1a26b334d74
2.3 kB Download
md5:8d7f1c6952c34413319c4bb553f360f4
2.6 kB Download
md5:9ad33a8a6be0d1510a700ad44288d448
42.0 kB Download
md5:d07a2065f6b25591713445efbf0d82a0
3.7 kB Download
md5:6f41dbf19350d3fd516718e40b2b2cf2
1.6 kB Download
md5:bdae04be47c204fb165d9f10e3e1dbd9
1.2 kB Download
md5:196de011b07178d211a41763d84dbb61
946 Bytes Download
md5:96fcee9492b15aa1ff40ae4c033c4f29
2.7 kB Download
md5:87285e0b2c43086bbfb7c2b935e34295
5.1 kB Download
md5:9b8bb8bfa09878c2f43a636a6038873f
1.3 kB Download
md5:eba9eec6cf1be87ed1fec312bc134840
1.6 kB Download
md5:abc5a7aca2861f53bf1231c90e907fd4
1.3 kB Download
md5:b519b78bc450318664f1688fd58cb666
2.1 kB Download

Additional details

Related works

Is source of
10.5061/dryad.v41ns1rsm (DOI)