Published November 2024 | Version v2
Model Open

Code for generating Landsat reflectance time series, water chemistry matchups, and machine learning models to predict water chemistry for US lakes

  • 1. ROR icon Michigan State University

Description

Code and models associated with the generation of the LAGOS-US LANDSAT extension module, which contains 45,867,023 sets of reflectances from Landsat 5, 7, and 8 as well predicted water quality for chlorophyll, Secchi deph, true water color, dissolved organic carbon, total suspended solids, and turbidity for 136,977 lakes 4+ hectares in the conterminous US based on matchups to in situ data from LAGOS-US LIMNO. Contained here are the following files:

GEE_Extraction_Code.txt: code used to extract time series of Landsat reflectances for a list of LAGOS-US lakes using the shapefiles contained within the LAGOS-US LOCUS data module using Google Earth Engine. The code harmonizes among sensors, calculates pixel-specific band ratios, and exports a CSV of the time series for each lake. 

lagos_us_matchup.R: Creates a matchup dataset for a window of +/- 7 calendar days using LAGOS-US LIMNO data and the compiled CSVs generated by GEE.

lagos_us_rs_model.R: Suite of 40 random forest models run for water quality variable predictions, including testing the usage of log10-transformed response, band ratio usage, the removal of scenes with negative median reflectances, and the removal of scenes with >50% cloud cover. The selected final model for each water quality variable is used to make final predictions for the data module.

Each of the final model fitted objects is contained within this model and code product in an individual RData file:

Colort: true water color (pcu) - log10-transformed

CHL: chlorophyll (micrograms/L) - log10-transformed

NTU: turbidity (NTU) - log10-transformed

TSS: total suspended solids (mg/L) - log10-transformed

DOC: dissolved organic carbon (mg/L) - log10-transformed

Secchi: Secchi depth (m) - untransformed

lagos_us_pred_obs.R: Script that calculates and plots predicted vs in situ output from each of the six water quality variable models and generates plots of mean absolute percentage error (MAPE) by range of each water quality variable.

 

The addendum files associated with the lagos_us_tt_full_compare.R script contain Generalized Random Tessellation Stratified (GRTS) test and train data splits as well as a comparison of the original full data random forest model to a comparable model using test and train splits. The predictions from these data are also included in the EDI repository associated with this data product.

Files

GEE_Extraction_Code.txt

Files (6.0 GB)

Name Size Download all
md5:bc1185948270bddb2af4a08463a2d5bb
213.0 MB Download
md5:4fbfe705a0e0564e3d3c175dfa0a05e3
159.6 MB Download
md5:a3c2ff0cde57ae9e4e06cb9e8ac77f58
150.2 MB Download
md5:6289a6605b39919d05fd8acd16ee6f2e
113.2 MB Download
md5:deff5ee9ddaba914cc6c8590e41399c1
34.6 MB Download
md5:8709e39d7496d16a9113ff152c899d36
28.9 MB Download
md5:75fb664d773be4cb72462ca2ef29b888
9.6 kB Preview Download
md5:d2caa695039575150b916f11705fefd4
6.0 kB Download
md5:a3120241a6372532fa6ea8dc1dbb02f5
17.5 kB Download
md5:7cc54302059e0aeac0685bb553110064
21.8 kB Download
md5:c825c0dc3d6c540629e182f9b476cd1d
27.3 kB Download
md5:aabc4a704f42d051da39e76a2aae3d07
286.9 MB Download
md5:a453a1daa4551b5aeb8353f86e521700
41.9 MB Download
md5:33d1d7dcfe81582d2c63d7aa408db58a
2.4 GB Download
md5:badd30d2f9b101037fdf856b100269e8
1.8 GB Download
md5:591a7597be6aca62b30b0671acc647fd
10.2 MB Preview Download
md5:dc1cf0a077d75e0aa58612a9da9732d7
8.0 MB Preview Download
md5:7cc234fea9f02373a268580a446d439d
2.0 MB Preview Download
md5:d1f6aa9d4b0fee166c6013daf406c027
2.9 MB Preview Download
md5:a60fd1de5f3274993ab90050a856db9b
134.1 MB Preview Download
md5:f3b89b7e350a9c4b9e8ffafa46d58a0f
2.3 MB Preview Download
md5:11d9db0795f393d58a968ba6c51deda5
30.5 MB Preview Download
md5:d71a758fcac05d04ec1c1c1d6b107e6d
23.9 MB Preview Download
md5:bb38588633bf5f8c4d83da10d15ecf57
6.0 MB Preview Download
md5:f3e78ef8a5905826542cb45c64c1f6c6
8.8 MB Preview Download
md5:0169c97f78ccbdf44a291a8eef934359
402.2 MB Preview Download
md5:46976099a0fe8265f3b8c30f1cf350c0
6.8 MB Preview Download
md5:00df3e821abbec5c83d280566cfb0448
47.9 MB Download
md5:54511513c0ce0f72b99f9d2b0f888ca6
32.0 MB Download

Additional details

Dates

Updated
2024-11