Published November 9, 2022 | Version v1
Dataset Open

Saw Kill river (NY, USA) metagenomics and environmental variables

  • 1. Federal University of Bahia
  • 2. New York University
  • 3. Bard College

Description

Microbial community structure and diversity in waterways are altered by wastewater treatment plant (WWTP) discharge which can introduce human-associated microbes, antimicrobial resistance (AMR), and significantly change environmental variables. To better understand these interactions, we investigated the impact of the Bard College WWTP on microbial communities collected from the surface water and sediment from four sites, two sites above and two sites below the Bard College outflow, as well as the outflow itself, this was performed over a period of five months. We measured physico-chemical parameters such as temperature, turbidity, conductivity, dissolved oxygen, and salinity as well as the bioindicators Escherichia coli, total coliforms, and Enterococcus sp. concentration, endotoxins, the intI1 gene marker, and the total bacterial abundance through 16S rRNA gene.

Notes

Original raw sequences are available in NCBI SRA (Accession: PRJNA565393)
Sample sites and sample names are in "Sawkill_mapping_and_env_var.csv"
All scripts used are located in "Combined_Scripts.rmd".

This dataset includes the following files:

  • Sampling_site_metadata_table.csv - Sample ID, Field ID, Date, Cumulative Rain Fall (mm), and Air Temperature (°C)
  • Physicochemical_characteristics.csv - Sample ID, Water Temperature (ºC), Turbidity (TU), Conductivity (µmhos/cm), Dissolved Oxygen (mgL), and Salinity (ppt)
  • Escherichia_coli_concentration.csv - Sample ID, E. coli (MPN·100mL-1)
  • Total_coliforms_concentration.csv - Sample ID, Coliform (MPN·100mL-1)
  • Enterococcus_sp_concentration.csv - Sample ID, Enterococcus sp. (MPN·100mL-1)
  • Endotoxins_concentation.csv - Sample ID, Endotoxin (EU/mL)
  • Integron_1_relative_abundance.csv - Sample ID, intI1 rel. abun.
  • ASV_and_taxa_assignment.tsv - Tab separated file with each ASV, assigned taxa, and percent confidence (aka, ASV_to_taxa_confidece.tsv)
  • Taxa_abundance_by_sample.csv - All taxa at species level resolution (or lowest possible) and abundances in each sample (aka, Species_resolved_taxa_and_counts.csv)
  • Sawkill_mapping_and_env_var.csv - comma-separated file containing sample names and sample sites and measured environmental variables.
    • detail: SampleID - Unique Identifier: SK-<sort_number>   
    • sort - Unique Identifier index number
    • FieldID - Combination field: <site>_<date> combination of site column and date column
    • Date - Calendar date
    • Day - Sample index starting at one (06/22/2015) and incrementing by one for each additional sample date
    • Site - Location of Sample site and Replicate number (Counting from 0):
    • OW = Outflow Water
    • AS# = Above Outflow Sediment (Far) (NB: AS = replicate 0, AS1 = replicate 1)
    • A1S# = Above Outflow Sediment (Near) (NB: A1S = replicate 0, A1S1 = replicate 1)
    • BS# = Below Outflow Sediment (Near) (NB: BS = replicate 0, BS1 = replicate 1)
    • B1S# = Below Outflow Sediment (Far) (NB: B1S = replicate 0, B1S1 = replicate 1)
    • AW = Above Outflow Sediment (Far)
    • A1W = Above Outflow Sediment (Near)
    • BW = Below Outflow Sediment (Near)
    • B1W = Below Outflow Sediment (Far)
    • GeoLocation - Spatial location of sample site
    • O = Outflow
    • A = Above Outflow (Far)
    • A1 = Above Outflow (Near)
    • B = Below Outflow (Near)
    • B1 = Below Outflow (Far)
    • Type - Physical matrix sampled (Outflow, Sediment, Water)
    • Desc - Brief text description of Sample
    • Pool - Brief text description of Sample Group
    • Rain12 - Cumulative Measure of Rain (mm) collected over the previous 12 hours.   
    • Rain24 - Cumulative Measure of Rain (mm) collected over the previous 24 hours.   
    • Rain48 - Cumulative Measure of Rain (mm) collected over the previous 48 hours.   
    • Rain72 - Cumulative Measure of Rain (mm) collected over the previous 72 hours.
    • AirTemp - Ambient air temperture (Celsius)
    • WaterTemp - Temperature of water at collection site (Celsius)
    • Turbidity - Water Turbidity (TU)
    • Conductivity - Water Conductivity (µmhos/cm)
    • DO_mgl - Water Dissolved Oxygen (mgL)
    • salinity_ppt - Water Salinity (ppt)
    • intI1 - Relative abundance of intI1 using qPCR
    • Ecoli - IDEXX Colilert assay (MPN·100mL-1)
    • Coliform - IDEXX Colilert assay (MPN·100mL-1)
    • Entero - IDEXX Enterolert (MPN·100mL-1)
    • Endotoxins - Charles River Endosafe system (EU/mL)
  • Denoising_qc_stats.tsv - tab separated file with the following read abundances per sample 'input', filtered', 'denoised', 'merged', and 'non-chimeric'.
  • Sample_frequency_detail.csv - Reads used per sample
  • Sample_site_GPS.txt - Map refined estimated GPS coordinates (decimal format) of all sample sites.
  • Sample_type_date_site_season_name.csv - Comma separated file containing Sample, type, date, site and season.

For ease of use, we have also included the original visualization files generated by QIIME2 (using the code in 'combined_scripts.rmd'). These are viewable using the QIIME2 viewer or manually by decompressing them.

  • Combined_Scripts.rmd - Rmarkdown containing code for scripts used for data processing and presentation
  • demux.qzv
  • denoising_stats.qzv
  • rep-seqs.qzv
  • table-dada2.qzv
  • taxa-bar-plots.qzv

Files

Endotoxins_concentation.csv

Files (49.6 MB)

Name Size Download all
md5:488f14bf9d5d3a15a188638632626db3
2.5 MB Download
md5:488f14bf9d5d3a15a188638632626db3
2.5 MB Download
md5:b46964e64232db0ef0e394a20da41458
305.0 kB Download
md5:706cf58e49bb2e2273bc940e7f12849b
1.2 MB Download
md5:d939471a1b0e79244b7810fb4ce510b2
4.6 kB Download
md5:93317eff4a414715e078096ec2493786
3.9 kB Preview Download
md5:89e2b06db7b4fc52a28fd7a1e3b84077
1.6 kB Preview Download
md5:6519bd073913bc4de918ca32d939b679
1.8 kB Preview Download
md5:5f9d33583d1032d828a1f4303169f093
4.9 kB Preview Download
md5:2d0287a5362cf93f0a11dac6fd16d858
3.6 kB Preview Download
md5:998688b915a558881d2b9ecf5fc41a89
4.4 kB Preview Download
md5:ac80b44e7f6c06a058595542e0aec1cb
2.7 MB Download
md5:dea27fc99ce2584d5165986337caa0fb
1.6 kB Preview Download
md5:6528293928c859892f487337209f8535
205 Bytes Preview Download
md5:ffe7ae45ce5a023b0c54afc01344701c
6.2 kB Preview Download
md5:3a8086b45ec9840264d8eb31d319ec0b
8.6 kB Preview Download
md5:f2f9b475c107b7a4158f80ad2c98624f
22.0 kB Preview Download
md5:540196710ec2a4e995a1608c6db3365f
1.1 MB Download
md5:5e50ce22800a1ee2990b194a79216911
37.2 MB Download
md5:76eaae9797fed833d2b5e7ce04a8ece5
2.1 MB Preview Download
md5:956d321c0eebb245207f548c4c89fdb3
1.9 kB Preview Download

Additional details

Related works