Published December 8, 2022 | Version v2
Journal article Open

Environmental and genealogical effects on DNA methylation in a widespread apomictic dandelion lineage

  • 1. Institute of Plant Sciences Paris-Saclay (IPS2), Université Paris-Saclay, CNRS, INRAE, Université Evry, Université Paris Diderot, 91190 Gif sur Yvette, France.
  • 2. Department of Terrestrial Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Droevendaalsesteeg 10, 6708 PB Wageningen

Description

The following repository contains raw sequenced reads, R scripts, reports and processed DNA methylation files presented in the article "Environmental and genealogical effects on DNA methylation in a widespread apomictic dandelion lineage". 2023. Journal of Evolutionary Biology.


The demultiplexed sequencing raw data was deposited at ENA: PRJEB56325. To reproduce the results presented in this article, you can visit epiTree and epiGBS pipeline.

Genetic marker

  • SSR_Data.csv: genetic data set used to compare epigenetic variability.

epiGBS2 pipeline files:

  • run0202_epiGBS-superpool_S34_L008_R{1,2}_001.nophix.fastq.gz: raw sequenced reads for 80 samples multiplexed, half of them gDNA were fragmented with a combination of Csp6I-NsiI and the other half with AseI-NsiI restriction enzymes.
  • {AseI-NsiI,Csp6I-NsiI}_barcode.tsv: A barcode file used for demultiplexing and that contains the following information: Flowcell, Lane, Barcode_R1, Barcode_R2, Sample, ENZ_R1, ENZ_R2, Wobble_R1, Wobble_R2.

epiGBS2 pipeline output files:

  • {AseI-NsiI,Csp6I-NsiI}_config.yaml: configuration files used to run epiGBS pipeline.
  • {AseI-NsiI,Csp6I-NsiI}_consensus_cluster.renamed.fa: de novo epiGBS loci representing the consensus sequences obtained during the creation of the de novo reference sequence.
  • {AseI-NsiI,Csp6I-NsiI}_report.html: a report file that summarize all stats from the epiGBS analysis.
  • {AseI-NsiI,Csp6I-NsiI}_methylation.bed: main output file containing the methylation calling as a genome-wide methylation report for all sequenced cytosines.

Processed data:

  • {AseI-NsiI,Csp6I-NsiI}_methylation.filtered: The filtered DNA methylation data. This data was obtained after filtering the raw DNA methylation data. Cytosines which had a 10X coverage or higher and which were present in 80% of all samples were kept for further analysis.
  • {AseI-NsiI,Csp6I-NsiI}_mergedAnnot.csv: Files with gene, repeats and transposable elements annotation and non-classified genomic regions for each epiGBS fragment.

R scripts:

  • commonFunctions.R: To improve the readability of scripts, this file contains all hiden function used in the following scripts.
  • 00_mergeAnnotation.R: This script is used to merge gene, repeats and transposable elements annotation and detect non-classified genomic regions.
  • 01_filterMethylation.R: This script contain the code to filter methylation.bed files and to keep cytosines with 10X as minimum, that are presented in 80% of all samples and discarding the top 1% with the highest sequence coverage.
  • 02_characterizeOverallMethylation.R: This script contain the code to characterized overall methylation
  • 03_distances.R: This script contain the code to calculate and visualize the pairwise epigenetic distances, as the average methylation difference across all cytosines between two samples.
  • 04_differentialCytosineMethylationWithDSS.R: This script contain the code to calculate differences in DNA methylation between accessions or light treatments at each individual cytosine using the ‘DSS’ package. Resulting p-values were adjusted for multiple testing using False Discovery Rate control using a threshold of 0.05.
  • 05_manhattanPlot.R: This script contain the code to generate the manhattan plots of epiGBS loci using ‘qqman’
  • 06_dendrogram.R: This script contain the code to generate UPGMA dendrograms based on methylation levels using euclidean distance or on eight microsatellite loci using Nei distance.
  • 07_DMC_description.R: This script contain the code to characterized DMC over different genomic regions.
  • 08_mantelTest.R: This script contain the code to correlate CG methylation variation with SSR variation.

Files

AseI-NsiI_mergedAnnot.csv

Files (23.0 GB)

Name Size Download all
md5:d343e6e397ac61d9c22cc3cac4826047
1.5 kB Download
md5:63a539324db4039553bdf4e488a860f1
5.9 kB Download
md5:18122b7b20d251fbf1bf9adb8fd27932
5.7 kB Download
md5:0ce9e8545a52f2b3649dcecb2abc941f
2.4 kB Download
md5:4f507b410a8578091cc413c3fc525179
5.0 kB Download
md5:046f8afe081fa34fd7c7be93632ed3b3
2.1 kB Download
md5:1211fae5b2d21441fecb61bb974f3b32
4.4 kB Download
md5:3940d52a652ab02f52bea09c710b23a8
4.2 kB Download
md5:34b576ce9e4f798f285e8539449302dd
6.0 kB Download
md5:6cf7246530e0e99b91a3a087a1958da4
2.5 kB Download
md5:172be7365e82ad5e5233fd4457bf5c68
2.2 kB Download
md5:6032219ffd5ac71783e3218260adb3a0
26.1 MB Download
md5:8805c21374c0049a8bb5bc263b01135d
5.6 MB Preview Download
md5:bb1daacf44dfaef02995a3807149de52
2.6 GB Download
md5:a540b5fb2a369cbf1e25e97df60a954a
6.4 MB Download
md5:54cce6c3d33b3a6443bcbcd6288e7d23
1.1 MB Download
md5:e75acc6397b81208f06777a91727de1e
18.8 kB Download
md5:13f4c32f355d43e2bdbf4a7ba415d52b
4.9 GB Download
md5:866a5464dabdc6aba2e9f299af59db3d
2.5 kB Download
md5:5669703e314494ed62c1247457e12699
2.2 kB Download
md5:10e5e1b23fd09463a7f970738a3a6939
44.6 MB Download
md5:3afe7f9beff39e7a131bfb6e8c753d45
10.2 MB Preview Download
md5:609671686a66db3008308042d33c5f85
25.0 MB Download
md5:0351f352ccb226edf74dae162811350b
1.0 MB Download
md5:f1582f9810659226085a997e54eb2c99
7.8 GB Download
md5:e0dfefb2ce693caab06db2678c40eda2
7.6 GB Download
md5:8b1cbf6bc18f6b8bed5b1f76a3f8eef2
1.3 kB Preview Download