Published February 17, 2021 | Version v1
Journal article Open

Traces of transposable elements in genome dark matter co-opted by flowering gene regulation networks

  • 1. Université Paris-Saclay, INRAE, URGI, 78026, Versailles, France.
  • 2. IJM, Institut Jacques Monod, CNRS UMR 7592, Université Paris-Diderot, Paris, France
  • 3. Université Paris-Saclay, INRAE, BioinfOmics, Plant bioinformatics facility, 78026, Versailles, France.
  • 4. IJM, Institut Jacques Monod, CNRS UMR 7592, Université Paris-Diderot, Paris, France / Université Paris-Sorbonne, Paris, France
  • 5. Université Paris-Saclay, INRAE, URGI, 78026, Versailles, France / Université Paris-Saclay, INRAE, BioinfOmics, Plant bioinformatics facility, 78026, Versailles, France.

Description

Transposable elements (TEs) are mobile, repetitive DNA sequences that make the largest contribution to genome bulk. They thus contribute to the so-called “dark matter of the genome”, the part of the genome in which nothing is immediately recognizable as biologically functional.en

We developed a new method, based on k-mers, to identify degenerate TE sequences. With this new algorithm, we detect up to 10% of the A. thaliana genome as derived from as yet unidentified TEs, bringing the proportion of the genome known to be derived from TEs up to 50%. A significant proportion of these sequences overlapped conserved non-coding sequences identified in crucifers and rosids, and transcription factor binding sites. They are overrepresented in some gene regulation networks, such as the flowering gene network, suggesting a functional role for these sequences that have been conserved for more than 100 million years, since the spread of flowering plants in the Cretaceous.

Files

547877v6.full.pdf

Files (2.7 MB)

Name Size Download all
md5:eb568a9ee6410c6638427c20544cd15c
2.7 MB Preview Download