Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published February 16, 2024 | Version v1
Dataset Open

Arctic-boreal bryophytes from sedimentary ancient DNA metabarcoding

  • 1. Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research, Polar Terrestrial Environmental Systems, Telegrafenberg, Potsdam 14473, Germany
  • 2. Institute of Environmental Science and Geography, University of Potsdam, Potsdam 14469, Germany
  • 3. Institute of Biology and Biochemistry, University of Potsdam, Potsdam 14469, Germany
  • 4. Institute of Plant Sciences and Microbiology, Universität Hamburg, Hamburg 22609, Germany
  • 5. Department of Earth, Environmental and Planetary Sciences, Brown University, USA
  • 6. School of Earth and Sustainability, Northern Arizona University, Flagstaff, Arizona, USA
  • 7. Institute of Geology and Mineralogy, University of Cologne, Cologne 50674, Germany
  • 8. North-Eastern Federal University: Yakutsk, Republic of Sakha (Yakutia), Russia

Description

A total of 26 lake-sediment cores collected from 23 sites spanning the glacial and interglacial transition are used in this study. These sites are distributed across Siberia, Beringia, and Alaska regions, with a gradient of vegetation types dominated by tundra in the northern region and transitioning to boreal forest in the southern extents. DNA samples from the sediment core were analysed with a standard sedimentary ancient DNA metabarcoding pipeline (see additional description), which resulted in a raw dataset of all DNA plant sequences, which were then filtered for Bryophytes (Bryophyte DNA dataset). The Bryophyte DNA dataset contains 127 unique ASV. Samples in the Bryophyte DNA dataset are then grouped into 1000-year time slices and are subsequently resampled to a base count of 500 read counts for each time slice. After that, a Bryophyte trait datastet is assigned to the Bryophyte DNA dataset. 

Input files

Bryophyte DNA dataset: "2024-02-23_combined_gh_bryophytes_sequences_unique_taxanames.csv"

Bryophyte ASV and taxa names: "combined_gh_datasets_table_sequences_taxanames_eco_revised.csv"

Bryophyte trait dataset: "Bryophytes_GoogleSpreadsheet_data_categorial.csv"

R script

Script: "2024-02-23_R-Script_gh_dataset_Bryophytes.R"

R outputs

Bryophyte DNA dataset 1000-year time slice: "2024-02-23_combined_gh_bryophytes_unique_taxanames_1000yr_timeslices_counts.csv"

Bryophyte DNA dataset 1000-year time slice, resampled 100x: "2024-02-23_combined_gh_bryophytes_resampled_percentages_mean_100runs_sequences.csv”

Bryophyte DNA dataset 1000-year time slice, resampled 100x, with Bryophyte traits: "2024-02-23_combined_gh_bryophytes_traits.csv“

 

 

Methods (English)

DNA samples from the sediment core were subsampled under cool and clean conditions and ancient DNA work was conducted in the paleogenetic laboratories at Alfred Wegener Institute, Potsdam, Germany. DNA was extracted in batches, each of the batches included nine samples (3-10 g sample–1) and one extraction control, using the PowerMax® Soil DNA Isolation Kit (Mo Bio Laboratories, Inc., USA). All subsequent steps were performed according to the instructions of the manufacturer Qiagen, except for the final elution volume which varied between 1.6 and 2.0 mL. PCR reactions were conducted using g and h primer targeting the vascular plant trnL P6 loop of the chloroplast genome, each modified with unique 8 bp tags differing by at least five base pairs to distinguish samples post-sequencing. After pooling the PCR samples from each sediment core into a PCR Pool, sequencing for each pool was performed by Fasteris Gensupport SA sequencing service (Switzerland) with the paired-end sequencing on a HiSeq or NextSeq Illumina platform. Raw sequences were analysed using OBITools software (version 3.0.0) against a specific plant database (SibAla_2023). The customised database SibAla_2023 has been compiled using different R packages with the following steps. Taxa selection from a given region (55–90°N, 50–150°E and 40–90°N, 150°E–140°W) and taxa occurrences (>10) using the Global Biodiversity Information Facility (GBIF, 71) resulting in 233 families, 1059 genera, and 4849 species. GBIF occurrences were aligned with available trnL P6 loop sequence types from public databases. The SibAla_2023 database compared with given occurrences in GBIF amounts to 95.7% (family-level), 89.4% (genus-level), and 70.1%(species-level) of taxonomic coverage. Finally, the SibAla_2023 database includes a total of 4939 entries, comprising 3398 species, 947 genera, and 223 families that collapse into 2371 unique P6 loop sequence types. 

Files

2024-02-23_combined_gh_bryophytes_sequences_unique_taxanames.csv

Additional details

Funding

Glacial Legacy 772852
European Research Council
PLOT (Paleolimnological Transect) 03G0859
Deutsche Forschungsgemeinschaft