Published April 24, 2024 | Version v1
Dataset Open

SedDNA results along the sediment core of the Dzoumogné reservoir, Mayotte Island, Comoros Archipelago, France

Description

This repository is made up of two main folders containing information about the metadata and the database variables:

  • Metadata_SedDNA_dzoumogne_reservoir_afoucher.xlsx (description of the metadata associated with the sediment samples)
  • Variable_description_dzoumogne_reservoir_afoucher.xlsx (description of the database variables)

as well as 6 folders containing the data sets for each genetic marker:

  • Balitzki_12S-afoucher.xlsx
  • Hardy_18S-afoucher.xlsx
  • Kelly_16S-afoucher.xlsx
  • Pornon_ITS-afoucher.xlsx
  • Prosser_rtbl-afoucher.xlsx
  • White_ITS-afoucher.xlsx

Related publication: Foucher et al. submitted

---------------------------------------------------------

Context: Sediment DNA was extracted from 12 layers along a sediment core (IGSN number: 10.58052/IEFOU0003) collected in 2021 in the Dzoumogné reservoir (Mayotte Island, Comoros Archipelago, France), covering the 2012-2021 period. The aim of these analyses was to evaluate the consequences of landscape defragmentation, induced by deforestation and intensification of agricultural practices, on changes in biological communities.

Sample collection, storage and subsampling: Sediment core was collected using an Uwitec gravity corer equipped with a 63mm PVC liner. The core was stored at 4°C in a cold and dark room for 3 weeks prior to subsampling. The sediment core was opened and subsampled in a previously sterilized hoot (DNA cleaner and UV) in a dedicated biological room at the University of Paris-Saclay. Each sample was extracted from the sediment core using sterilized metal plates inserted on either side of the layer to be sampled (the central part of this sample was then sub-sampled to remove the part of the sediment in contact with the PVC tube of the core and the cutting tools). During this process the operators were equipped with gloves, masks and headgear to avoid any risk of contamination. A blank was placed in the fume cupboard to control contamination during core opening and sampling.

Extraction and assignation: SedDNA was retrieved using a standardized protocol that combines the use of commercial kits (Nucleospin soil – Macherey-Nagel) and the extracellular DNA pre-treatment described in Taberlet et al. (1). In order to limit amplification biases and to improve the metabarcoding sensitivity, a combination of 7 genetic markers was used to PCR-amplify each targeted taxonomic group (plants - 50-70 bp – trnL (2),  plants - 150 bp – rbcL (3), plants/fungi – 320 bp –ITS (4), plants – 380 bp – ITS , metazoa – 200 bp – 16S (5), Eukaryotes – 200 bp – 18S (6), vertebrates – 200 bp – 12S (7)). PCRs were realized in octuplicates with positive and negative controls. Extraction controls were also amplified with the dedicated primers. PCR products were cleaned using the MinElute purification kit (QIAGEN) and all purified amplicons were pooled in equimolar concentrations at the exception of the extraction and the PCR negative controls as they were not concentrated enough. Then, each sample pool was run on an Agilent Bioanalyzer using the DNA High Sensitivity LabChip kit (Agilent Technologies, Santa Clara, CA, USA) to verify amplicon length and each sample was finally quantified with a Qubit fluorometer (Invitrogen, Carlsbad, CA, USA). 
Libraries were generated with a PCR-free protocol in order to tremendously reduce the number of chimeras induced by the metabarcoding approach. One microgram of each purified PCR products pool was submitted to molecular end-repair and then ligated to Illumina adapters prior to sequencing on Illumina platforms (Next-Generation Sequencing; Fasteris, Switzerland) using a NovaSeq instrument (Illumina, San Diego, CA, USA) with a paired-end read mode covering the amplicon lengths. The objective was to obtain approximately 1 million reads per marker and per sample. 
Bioinformatics analyses were performed using the OBITools (8) and QIIME2 (9) pipelines. Reads were first demultiplexed using OBITOOLS. Then erroneous sequences were removed, and singletons were filtered out. Sequences were then clustered at 97% identity using VSEARCH (10). Sequences that were found in the extraction blanks were removed from the downstream analysis. Taxonomic assignment was realized in two steps. First, sequences were assigned by blasting them against a custom database generated from the GenBank latest release curated and restricted to all the documented taxa. Sequences with over 95% identity match were kept and their last common ancestor was computed using MEGAN (11). A second assignment pass was made using a Naïve Bayesian Classifier using QIIME’s rescript plugin (12).

Data validation: Discuss in the related paper.

 ----------------------------------------------------------

References:


1.     P. Taberlet, S. M. Prud’Homme, E. Campione, J. Roy, C. Miquel, W. Shehzad, L. Gielly, D. Rioux, P. Choler, J. C. Clément, C. Melodelima, F. Pompanon, E. Coissac, Soil sampling and isolation of extracellular DNA from large amount of starting material suitable for metabarcoding studies. Mol. Ecol., doi: 10.1111/j.1365-294X.2011.05317.x (2012).
2.     P. Taberlet, E. Coissac, M. Hajibabaei, L. H. Rieseberg, Environmental DNA. Mol. Ecol. 21, 1789–1793 (2012).
3.     S. W. J. Prosser, P. D. N. Hebert, Rapid identification of the botanical and entomological sources of honey using DNA metabarcoding. Food Chem. 214, 183–191 (2017).
4.     T. J. White, T. J. White, T. Bruns, S. Lee, J. Taylor, Amplification and Direct Sequencing of Fungal Ribosomal RNA Genes for Phylogenetics (Academic press, New York, USA, 1990).
5.     R. P. Kelly, Making environmental DNA count. Mol. Ecol. Resour. 16, 10–12 (2016).
6.     C. M. HARDY, E. S. KRULL, D. M. HARTLEY, R. L. OLIVER, Carbon source accounting for fish using combined DNA and stable isotope analyses in a regulated lowland river weir pool. Mol. Ecol. 19, 197–212 (2010).
7.     B. Balitzki-Korte, K. Anslinger, C. Bartsch, B. Rolf, Species identification by means of pyrosequencing the mitochondrial 12S rRNA gene. Int. J. Legal Med. 119, 291–294 (2005).
8.     F. Boyer, C. Mercier, A. Bonin, Y. Le Bras, P. Taberlet, E. Coissac, obitools : a unixinspired software package for DNA metabarcoding. Mol. Ecol. Resour. 16, 176–182 (2016).
9.     E. Bolyen, J. R. Rideout, M. R. Dillon, N. A. Bokulich, C. C. Abnet, G. A. Al-Ghalith, H. Alexander, E. J. Alm, M. Arumugam, F. Asnicar, Y. Bai, J. E. Bisanz, K. Bittinger, A. Brejnrod, C. J. Brislawn, C. T. Brown, B. J. Callahan, A. M. Caraballo-Rodríguez, J. Chase, E. K. Cope, R. Da Silva, C. Diener, P. C. Dorrestein, G. M. Douglas, D. M. Durall, C. Duvallet, C. F. Edwardson, M. Ernst, M. Estaki, J. Fouquier, J. M. Gauglitz, S. M. Gibbons, D. L. Gibson, A. Gonzalez, K. Gorlick, J. Guo, B. Hillmann, S. Holmes, H. Holste, C. Huttenhower, G. A. Huttley, S. Janssen, A. K. Jarmusch, L. Jiang, B. D. Kaehler, K. Bin Kang, C. R. Keefe, P. Keim, S. T. Kelley, D. Knights, I. Koester, T. Kosciolek, J. Kreps, M. G. I. Langille, J. Lee, R. Ley, Y.-X. Liu, E. Loftfield, C. Lozupone, M. Maher, C. Marotz, B. D. Martin, D. McDonald, L. J. McIver, A. V. Melnik, J. L. Metcalf, S. C. Morgan, J. T. Morton, A. T. Naimey, J. A. Navas-Molina, L. F. Nothias, S. B. Orchanian, T. Pearson, S. L. Peoples, D. Petras, M. L. Preuss, E. Pruesse, L. B. Rasmussen, A. Rivers, M. S. Robeson, P. Rosenthal, N. Segata, M. Shaffer, A. Shiffer, R. Sinha, S. J. Song, J. R. Spear, A. D. Swafford, L. R. Thompson, P. J. Torres, P. Trinh, A. Tripathi, P. J. Turnbaugh, S. Ul-Hasan, J. J. J. van der Hooft, F. Vargas, Y. Vázquez-Baeza, E. Vogtmann, M. von Hippel, W. Walters, Y. Wan, M. Wang, J. Warren, K. C. Weber, C. H. D. Williamson, A. D. Willis, Z. Z. Xu, J. R. Zaneveld, Y. Zhang, Q. Zhu, R. Knight, J. G. Caporaso, Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 37, 852–857 (2019).
10.     T. Rognes, T. Flouri, B. Nichols, C. Quince, F. Mahé, VSEARCH: a versatile open source tool for metagenomics. PeerJ 4, e2584 (2016).
11.     D. H. Huson, A. F. Auch, J. Qi, S. C. Schuster, MEGAN analysis of metagenomic data. Genome Res. 17, 377–386 (2007).
12.     M. S. Robeson, D. R. O’Rourke, B. D. Kaehler, M. Ziemski, M. R. Dillon, J. T. Foster, N. A. Bokulich, RESCRIPt: Reproducible sequence taxonomy reference database management. PLOS Comput. Biol. 17, e1009581 (2021).

Files

Files (8.8 MB)

Name Size Download all
md5:736784d37036de8378a4c7fd4301f648
242.5 kB Download
md5:967d15959f41c32f1cc290e9a1d67c50
1.6 MB Download
md5:ac374de5a301679521c8cdec78f77701
864.9 kB Download
md5:451be35680c3d5a304b52dafb90bc3df
11.3 kB Download
md5:00b20baed5a5edc222c7ad9c7284d90f
1.3 MB Download
md5:1ad9ee8e6e88f2bcd63db534407d7854
985.4 kB Download
md5:846668d8cf6d42a0901b8fac4f44f9ac
11.5 kB Download
md5:4be88b57ea28e70293ee2e35c7801440
3.8 MB Download

Additional details

Related works

Is published in
10.1126/sciadv.adn5941 (DOI)