Published July 29, 2022 | Version v1
Software Open

Permafrost-thaw lake development in Central Yakutia: sedimentary ancient DNA and element analyses from a Holocene sediment record

Description

In Central Yakutia (Siberia) livelihoods of local communities depend on alaas (thermokarst depression) landscapes and the lakes within. Development and dynamics of these alaas lakes are closely connected to climate change, permafrost thawing, catchment conditions, and land use. To reconstruct lake development throughout the Holocene we analyze sedimentary ancient DNA (sedaDNA) and biogeochemistry from a sediment core from Lake Satagay, spanning the last c. 10,800 calibrated years before present (cal yrs BP). SedaDNA of diatoms and macrophytes and microfossil diatom analysis reveal lake formation earlier than 10,700 cal yrs BP. The sedaDNA approach detected 42 amplicon sequence variants (ASVs) of diatom taxa, one ASV of Eustigmatophyceae (Nannochloropsis), and 12 ASVs of macrophytes. We relate diatom and macrophyte community changes to climate-driven shifts in water level and mineral and organic input, which result in variable water conductivity, in-lake productivity, and sediment deposition. We detect a higher lake level and water conductivity in the Early Holocene (c. 10,700–7000 cal yrs BP) compared to other periods, supported by the dominance of Stephanodiscus sp. and Stuckenia pectinata. Further climate warming towards the Mid-Holocene (7000–4700 cal yrs BP) led to a shallowing of Lake Satagay, an increase of the submerged macrophyte Ceratophyllum, and a decline of planktonic diatoms. In the Late Holocene (c. 4700 cal yrs BP–present) stable shallow water conditions are confirmed by small fragilarioid and staurosiroid diatoms dominating the lake. Lake Satagay has not yet reached the final stage of alaas development, but satellite imagery shows an intensification of anthropogenic land use, which in combination with future warming will likely result in a rapid desiccation of the lake.

Notes

The datasets* are prepared for the manuscripts: Baisheva et al. (2022): "Permafrost-thaw lake development in Central Yakutia – Sedimentary ancient DNA and element analyses from a Holocene sediment record" (submitted) and Glückler et al. (2022): "Holocene wildfire and vegetation dynamics in Central Yakutia, Siberia, reconstructed from lake-sediment proxies" (preprint). Also included is the processing of the raw sequencing data using bioinformatics tools.

*The datasets were uploaded into two separate directories containing data and scripts. Each directory contains two main folders APMG_32 (Diatoms) and APMG_33 (Plants). Files of APMG_32 and APMG_33 after downloading have to be merged in the same folder, so the structure of datasets looks like as it is given below:

1)      APMG_32 contains several folders and files of different format:

00.   APMG-32_Metadata - Metadata information including lake geographic coordinates, sample depths and ages, laboratory codes and used primer tag combinations of forward and reverse primers to enable demultiplexing of the sequencing data

FILE:   APMG-32_Metadata.xlsx contains information on sequencing (run number, type, device, mode, forward and reverse tags, read length). Also it includes information on individual samples: name, type, age, depth, extraction number, and PCR number, as well as sediment core name and core section number.

Format: .xlsx

01. Raw_data_APMG-32 – Illumina sequencing raw data.

FILES: 210602_NB501850_A_L1-4_APMG-32_R1.fastq.gz

210602_NB501850_A_L1-4_APMG-32_R2.fastq.gz

Format: Illumina fastq format. The sequence files are compressed as .gz archives. Before using the data with the Obitools script (APMG_32_metabarcoding_rbcL_obi3_Dryad.sh) the datasets need to be uncompressed and converted into .fastq files.

02. Reference_data_rbcl – Database used for taxonomic assignment of diatoms.

FILES:   rbcl_embl143_db.fasta

Obi3_rbcL_database_build.sh - Script for the conversion step.

Format: .fasta and .sh. To use the rbcL database in the Obitools script (APMG_32_metabarcoding_rbcL_obi3_Dryad.sh), the rbcl_embl143_db.fasta needs to be converted to an obi3 database.

03. OBITools_APMG-32 – The metabarcoding pipeline for analyzing the raw sequencing data using OBITools3. 

FILES:   APMG_32_metabarcoding_rbcL_obi3_Dryad.sh - Script to run OBITools3 pipeline with short descriptions and output data.

APMG-32_embl143_rbcL.csv - Output file.

APMG32_tagfile.txt - File contains primer combinations for demultiplexing with Obitools3 (see script: APMG_32_metabarcoding_rbcL_obi3_Dryad.sh).

Format: .csv, .txt and .sh

04. Final_resampled_data_APMG-32:

FILES:   APMG-32_identitylevel0.98_wideformat.csv - Final count data.

APMG-32_final_resampled_scientific_name.csv - Final dataset with filtering threshold of 98%, resampled to the minimal number of counts (n=2050), including diatoms and Nannochloropsis. 

Format: .csv

    • The file APMG-32_final_resampled_data.csv was used for further statistical analyses in Baisheva et al. (2022): "Permafrost-thaw lake development in Central Yakutia – Sedimentary ancient DNA and element analyses from a Holocene sediment record" (submitted). 

2)      APMG_33 contains several folders and files of different format:

00.   APMG-33_Metadata - Contains information on sequencing (run number, type, device, mode, forward and reverse tags, read length). Also it includes information on individual samples: name, type, age, depth, extraction number, and PCR number. As well as sediment core name and core section number.

FILE:   APMG-33_Satagay2_metadata.xlsx

Format: .xlsx 

01.   Raw_data_APMG_33 – Illumina sequencing raw data.

FILES: 210602_NB501850_A_L1-4_APMG-33_R1.fastq.gz

210602_NB501850_A_L1-4_APMG-33_R2.fastq.gz

Format: Illumina fast-q format. The sequence files are compressed as .gz archives. The archives can be uncompressed on linux OS using a gzip -d command.

02. Reference_database_plants – Reference database to run OBITools pipeline with short instruction and script for the conversion step.

FILES:   arctborbryo_gh.fasta

gh_embl143_db_97.fasta

Obi3_arctborbryo_database_build.sh

Obi3_embl_database_build.sh

Format: .fasta and .sh. To use the arctborbryo embl143 database in the Obitools script (APMG-33_obi3_script.sh), .fasta files need to be converted to an obi3 database.

03. OBITools_APMG-33 – The metabarcoding pipeline for analyzing the raw sequencing data using OBITools3. 

FILES:   APMG33_arc_anno.csv - Output file.

APMG33_embl143_anno.csv - Output file.

APMG-33_obi3_script.sh

APMG-33_tagfile.txt

Format: .csv, .txt and .sh. OBITools_APMG-33 has two outputs as taxonomic assignment provided against the EMBL and Arctic databases. 

04. Final_datasets_APMG-33 - EMBL and Arctic assignments were  merged into the one dataset and filtered with 100% threshold. Final datasets separated into macrophytes and terrestrial plants. 

FILES:   APMG-33_identitylevel100_wideformat.csv - Final count data.

APMG-33_macrophytes_resampled_scientific_name.csv - Final dataset of separated macrophytes  and resampled to the minimal number of counts (n=1653). 

APMG-33_terrestrial_families.csv - Final dataset of separated terrestrial plants.

Format: .csv

  • The file "APMG-33_macrophytes_resampled_scientific_name.csv" from output data was used for further statistical analyses in Baisheva et al. (2022): "Permafrost-thaw lake development in Central Yakutia – Sedimentary ancient DNA and element analyses from a Holocene sediment record" (submitted). 
  • The file "APMG-33_terrestrial_families.csv" of separated terrestrial plants data was used for further statistical analyses in Glückler et al. (2022): "Holocene wildfire and vegetation dynamics in Central Yakutia, Siberia, reconstructed from lake-sediment proxies" (preprint).

Funding provided by: European Research Council
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100010663
Award Number: Glacial Legacy: 772852

Funding provided by: Ministry of Education and Science of the Russian Federation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100003443
Award Number: FSRG-2020-0019

Funding provided by: Deutscher Akademischer Austauschdienst
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100001655
Award Number:

Funding provided by: AWI INSPIRES (International Science Program for Integrative Research)*
Crossref Funder Registry ID:
Award Number:

Funding provided by: German Academic Exchange Service e.V.
Award Number: 91775743

Funding provided by: AWI INSPIRES

Funding provided by: Earth Systems Knowledge Platform (ESKP) of the Helmholtz Foundation

Funding provided by: the European Research Council
Award Number: Glacial Legacy: 772852

Funding provided by: Alfred-Wegener-Institut, Helmholtz-Zentrum für Polar- und Meeresforschung (AWI)

Files

Diatoms_and_plants_sedimentary_ancient_DNA_from_Lake_Satagay_(Central_Yakutia__Siberia)_covering_the_last_10_800_years_(scripts_and_supporting_tag_files).zip

Additional details

Related works