Published November 26, 2012 | Version v1
Dataset Open

Data from: Patterns and evolution of nucleotide landscapes in seed plants

  • 1. Institut des Sciences de l'Evolution de Montpellier
  • 2. Montpellier SupAgro

Description

Nucleotide landscapes, which is the way base composition is distributed along a genome, strongly vary among species. The underlying causes of these variations have been much debated. Though mutational bias and selection were initially invoked, GC-biased gene conversion (gBGC), a recombination-associated process favoring the G and C over A and T bases, is increasingly recognized as a major factor. As opposed to vertebrates, evolution of GC content is less well known in plants. Most studies have focused on the GC-poor and homogeneous Arabidopsis genome and the much more GC-rich and heterogeneous rice (Oryza sativa) genome and has often been generalized as a dicot/monocot dichotomy. This vision is clearly phylogenetically biased and does not allow understanding the mechanisms involved in GC-content evolution in plants. To tackle these issues, we used EST data from more than 200 species and provided the most comprehensive description of gene GC content across the seed plant phylogeny so far available. As opposed to the classically assumed dicot/monocot dichotomy, we found continuous variations in GC content from the probably ancestral GC-poor and homogeneous genomes to the more derived GC-rich and highly heterogeneous ones, with several independent enrichment episodes. Our results suggest that gBGC could play a significant role in the evolution of GC content in plant genomes.

Notes

Files

README_for_tpc093674SupplementalDS4.txt

Files (40.5 MB)

Name Size Download all
md5:cf3f04cd188bb420a25e9f7fefaa048d
3.5 kB Preview Download
md5:cf3f04cd188bb420a25e9f7fefaa048d
3.5 kB Preview Download
md5:233b11410818f95ef0e3ccc63d70aa23
28.9 MB Preview Download
md5:2b24f7efabe2e87ba5a17490f11aab13
11.6 MB Preview Download

Additional details

Related works

Is cited by
10.1105/tpc.111.093674 (DOI)