Code for generating Landsat reflectance time series, water chemistry matchups, and machine learning models to predict water chemistry for US lakes
Description
Code and models associated with the generation of the LAGOS-US LANDSAT extension module, which contains 45,867,023 sets of reflectances from Landsat 5, 7, and 8 as well predicted water quality for chlorophyll, Secchi deph, true water color, dissolved organic carbon, total suspended solids, and turbidity for 136,977 lakes 4+ hectares in the conterminous US based on matchups to in situ data from LAGOS-US LIMNO. Contained here are the following files:
GEE_Extraction_Code.txt: code used to extract time series of Landsat reflectances for a list of LAGOS-US lakes using the shapefiles contained within the LAGOS-US LOCUS data module using Google Earth Engine. The code harmonizes among sensors, calculates pixel-specific band ratios, and exports a CSV of the time series for each lake.
lagos_us_matchup.R: Creates a matchup dataset for a window of +/- 7 calendar days using LAGOS-US LIMNO data and the compiled CSVs generated by GEE.
lagos_us_rs_model.R: Suite of 40 random forest models run for water quality variable predictions, including testing the usage of log10-transformed response, band ratio usage, the removal of scenes with negative median reflectances, and the removal of scenes with >50% cloud cover. The selected final model for each water quality variable is used to make final predictions for the data module.
Each of the final model fitted objects is contained within this model and code product in an individual RData file:
Colort: true water color (pcu) - log10-transformed
CHL: chlorophyll (micrograms/L) - log10-transformed
NTU: turbidity (NTU) - log10-transformed
TSS: total suspended solids (mg/L) - log10-transformed
DOC: dissolved organic carbon (mg/L) - log10-transformed
Secchi: Secchi depth (m) - untransformed
lagos_us_pred_obs.R: Script that calculates and plots predicted vs in situ output from each of the six water quality variable models and generates plots of mean absolute percentage error (MAPE) by range of each water quality variable.
Files
GEE_Extraction_Code.txt
Files
(3.1 GB)
Name | Size | Download all |
---|---|---|
md5:bc1185948270bddb2af4a08463a2d5bb
|
213.0 MB | Download |
md5:a3c2ff0cde57ae9e4e06cb9e8ac77f58
|
150.2 MB | Download |
md5:deff5ee9ddaba914cc6c8590e41399c1
|
34.6 MB | Download |
md5:75fb664d773be4cb72462ca2ef29b888
|
9.6 kB | Preview Download |
md5:d2caa695039575150b916f11705fefd4
|
6.0 kB | Download |
md5:a3120241a6372532fa6ea8dc1dbb02f5
|
17.5 kB | Download |
md5:7cc54302059e0aeac0685bb553110064
|
21.8 kB | Download |
md5:aabc4a704f42d051da39e76a2aae3d07
|
286.9 MB | Download |
md5:33d1d7dcfe81582d2c63d7aa408db58a
|
2.4 GB | Download |
md5:00df3e821abbec5c83d280566cfb0448
|
47.9 MB | Download |