There is a newer version of the record available.

Published May 3, 2024 | Version v1
Model Open

Code for generating Landsat reflectance time series, water chemistry matchups, and machine learning models to predict water chemistry for US lakes

  • 1. ROR icon Michigan State University

Description

Code and models associated with the generation of the LAGOS-US LANDSAT extension module, which contains 45,867,023 sets of reflectances from Landsat 5, 7, and 8 as well predicted water quality for chlorophyll, Secchi deph, true water color, dissolved organic carbon, total suspended solids, and turbidity for 136,977 lakes 4+ hectares in the conterminous US based on matchups to in situ data from LAGOS-US LIMNO. Contained here are the following files:

GEE_Extraction_Code.txt: code used to extract time series of Landsat reflectances for a list of LAGOS-US lakes using the shapefiles contained within the LAGOS-US LOCUS data module using Google Earth Engine. The code harmonizes among sensors, calculates pixel-specific band ratios, and exports a CSV of the time series for each lake. 

lagos_us_matchup.R: Creates a matchup dataset for a window of +/- 7 calendar days using LAGOS-US LIMNO data and the compiled CSVs generated by GEE.

lagos_us_rs_model.R: Suite of 40 random forest models run for water quality variable predictions, including testing the usage of log10-transformed response, band ratio usage, the removal of scenes with negative median reflectances, and the removal of scenes with >50% cloud cover. The selected final model for each water quality variable is used to make final predictions for the data module.

Each of the final model fitted objects is contained within this model and code product in an individual RData file:

Colort: true water color (pcu) - log10-transformed

CHL: chlorophyll (micrograms/L) - log10-transformed

NTU: turbidity (NTU) - log10-transformed

TSS: total suspended solids (mg/L) - log10-transformed

DOC: dissolved organic carbon (mg/L) - log10-transformed

Secchi: Secchi depth (m) - untransformed

lagos_us_pred_obs.R: Script that calculates and plots predicted vs in situ output from each of the six water quality variable models and generates plots of mean absolute percentage error (MAPE) by range of each water quality variable.

Files

GEE_Extraction_Code.txt

Files (3.1 GB)

Name Size Download all
md5:bc1185948270bddb2af4a08463a2d5bb
213.0 MB Download
md5:a3c2ff0cde57ae9e4e06cb9e8ac77f58
150.2 MB Download
md5:deff5ee9ddaba914cc6c8590e41399c1
34.6 MB Download
md5:75fb664d773be4cb72462ca2ef29b888
9.6 kB Preview Download
md5:d2caa695039575150b916f11705fefd4
6.0 kB Download
md5:a3120241a6372532fa6ea8dc1dbb02f5
17.5 kB Download
md5:7cc54302059e0aeac0685bb553110064
21.8 kB Download
md5:aabc4a704f42d051da39e76a2aae3d07
286.9 MB Download
md5:33d1d7dcfe81582d2c63d7aa408db58a
2.4 GB Download
md5:00df3e821abbec5c83d280566cfb0448
47.9 MB Download