Fractional Abundance Datasets for Salt Patch and Marshland Across the Delmarva Peninsula, v2
Authors/Creators
Description
Abstract:
Coastal agricultural lands in the eastern USA are increasingly plagued by escalating soil salinity, rendering them unsuitable for profitable farming. Increasing soil salinization can lead to both land cover modification and conversion. Two notable instances of such transformations include the conversion of farmland to marshland or to barren salt patches devoid of vegetation. However, quantifying these land cover changes across large geographic regions poses a significant challenge due to their varying spatial granularity. To tackle this issue, we first developed a machine-learning based method using Sentinel-2 imagery for 2022 where we used a non-linear spectral unmixing approach utilizing a Random Forest (RF) algorithm (Sarupria et al., 2025). The RF models were constructed using 100 trees and 27,437 reference data points, resulting in two sets of ten models: one for salt patches and another for marshland. Validation metrics for sub-pixel fractional abundances revealed a moderate R-squared value of 0.50 for the salt model ensemble and a high R-squared value of 0.90 for the marsh model ensemble. Building upon this methodology, we then generated annual gridded datasets of fractional abundance for salt patch and marshland across the Delmarva Peninsula (14 coastal counties in Delaware, Maryland and Virginia, USA) for 2019-2023. In these datasets, we only report mean fractional abundance values ranging from 0.4 to 1 for salt patches and 0.25 to 1 for marshland, along with the standard deviation associated with each value.
Description:
This data collection consists of 20 single-band raster files: fractional abundance mean values and associated standard deviation values for two land covers, salt patch and marshland, produced annually over the five-year period.
-
Salt Patch Fractional Abundance Mean (2019–2023): Five files, each representing the per-pixel fractional abundance mean of salt patches for the years 2019–2023. Values are derived from an ensemble of 10 Random Forest (RF) models. Only pixels with a salt patch fraction ≥ 0.40 were retained.
-
Salt Patch Fractional Abundance Standard Deviation (2019–2023): Five files representing the per-pixel standard deviation of the salt patch fractional abundance means for the years 2019–2023. Estimates are based on an ensemble of 10 RF models and include only pixels with a salt patch fraction ≥ 0.40.
-
Marsh Fractional Abundance Mean (2019–2023): Five files, each showing the per-pixel fractional abundance mean of marshland for the years 2019–2023, derived from an ensemble of 10 RF models. Only pixels with a marsh fraction ≥ 0.25 were retained.
-
Marsh Fractional Abundance Standard Deviation (2019–2023): Five files showing the per-pixel standard deviation of the marsh fractional abundance means for 2019–2023. Estimates are based on an ensemble of 10 RF models, limited to pixels with a marsh fraction ≥ 0.25.
Input Data:
This approach integrated Sentinel-2 surface reflectance imagery, a global land use/land cover dataset from ESRI (Karra et al., 2021), a NAIP-derived Delmarva land cover dataset (Mondal et al., 2022), high-resolution PlanetScope true color images (Planet Team, 2017), very high-resolution Unoccupied Aerial Vehicle (UAV) imagery, and ground truth data.
More details on input data can be found in Sarupria et al., 2025a and version 1 of this dataset (Sarupria et al., 2025b).
Method:
We utilized Sentinel-2 Level 2 A surface reflectance imagery for spectral unmixing. This multispectral dataset, corrected for atmospheric and radiometric effects, encompasses 13 spectral bands from visible to near-infrared wavelengths (0.443–2.190 micrometers). The imagery offers spatial resolutions ranging from 10 m to 60 m and is captured every 5 days. To aid in selecting reference points for model training and testing, high-resolution (60 cm) UAV images of specific farmlands in Dorchester and Somerset counties, Maryland, were acquired under optimal weather conditions.
The study incorporated multiple datasets to refine the analysis. The Sentinel-2 derived global land use/land cover dataset from ESRI was employed to isolate relevant land cover classes such as 'Crops' and 'Rangeland'. A NAIP-derived Delmarva land cover dataset with eight classes helped exclude non-agricultural land cover types. High-resolution PlanetScope true color images with 3 m spatial resolution were used as reference data for model validation.
In this study, we applied a Random Forest (RF) classifier for nonlinear spectral unmixing. The RF classifier functions by utilizing an ensemble of decision trees that are independently trained on random subsets of training data through bootstrap aggregation. The final classification is determined by aggregating votes from all trees, with the endmember receiving the highest total votes being selected as the final output. To access soft voting information from the RF classifier in python, we used its probability prediction function called ‘predict_proba’. This function enables each decision tree to produce a probability distribution for each endmember instead of making a single class decision. The probability distribution from a decision tree indicates how likely it is that an input pixel belongs to each endmember. The final predicted probabilities are calculated by averaging these distributions across all decision trees for each of the five endmembers. As a result, each pixel in the final output is represented by five probability values that indicate the fractional abundance of each corresponding endmember within that pixel. These probabilities sum to one, effectively illustrating the spectral unmixing of a mixed pixel. For a specific endmember, a pixel with fractional abundance value of 0 signifies the absence of it, while a value of 1 indicates a pure pixel. Values between 0 and 1 reflect varying levels of mixed endmembers.
More details on the methods and accuracy assessment can be found in Sarupria et al., 2025a.
Data format:
The spatial resolution of all the derived datasets is 10 m. These georeferenced datasets are distributed in GEOTIFF format and are compatible with GIS and/or image processing software, such as R and ArcGIS Pro. The GIS-ready raster files can be used directly in mapping and geospatial analysis.
Code:
Sample python code for performing spectral unmixing is available at: https://github.com/Manan-prog/Non-linear-Spectral-Unmixing. To run this code successfully, the user must provide training data for the desired land cover classes and an input raster image for spectral unmixing.
Datasets for download:
- Five layers for salt patch mean values for 2019-2023: SaltPatch_FrAb_Mean_<Year>
- Five layers for salt patch standard deviation values for 2019-2023: SaltPatch_FrAb_StdDev_<Year>
- Five layers for marsh mean values for 2019-2023: Marsh_FrAb_Mean_<Year>
- Five layers for marsh standard deviation values for 2019-2023: Marsh_FrAb_StdDev_<Year>
Notes
Files
Marsh_FrAb_Mean_2019.tif
Files
(718.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:7e6fdf8a220c0dac7c366a28a987a335
|
51.6 MB | Preview Download |
|
md5:fbe8e1cc6234169b24b37edabc8e4b32
|
53.5 MB | Preview Download |
|
md5:4c0153aa112f10441cfe5e99cfcda5e9
|
52.9 MB | Preview Download |
|
md5:07b15ff650375c980782ee9eba09d1fb
|
48.7 MB | Preview Download |
|
md5:597d10288fae67ed1a5a9aaa2c8852c4
|
49.5 MB | Preview Download |
|
md5:11faabd511928904f56c78c45c89e100
|
62.9 MB | Preview Download |
|
md5:6c05e936f9685dc71e130ac5b567a437
|
65.3 MB | Preview Download |
|
md5:6d53ff3408ef4b5e162d12548e51d74e
|
64.9 MB | Preview Download |
|
md5:34b7e55d9532c6e5e2bc82b5c5d85304
|
57.6 MB | Preview Download |
|
md5:0bc46642f8260560aea4070bba3ce913
|
58.6 MB | Preview Download |
|
md5:7a46f8c6744d556bc6fe956be6fb7b79
|
14.5 MB | Preview Download |
|
md5:ea236236ae8e31a05f8ffabadda3dd7e
|
15.0 MB | Preview Download |
|
md5:f93b4297e1911e0be594069fa40875c7
|
14.7 MB | Preview Download |
|
md5:d4a49019ac3b3813355987efdaa30187
|
15.0 MB | Preview Download |
|
md5:10eed674aa0d9128eb93c4cde97f2ee0
|
15.9 MB | Preview Download |
|
md5:dd97e28c0efc4511e86477d8d9eebf80
|
15.0 MB | Preview Download |
|
md5:2804d9010864b9566071abd189a4aa2a
|
15.7 MB | Preview Download |
|
md5:6e22f972e8a811a0054fd4d59474fe30
|
15.2 MB | Preview Download |
|
md5:26aaa232aff54804c51d4d2afb57b56e
|
15.1 MB | Preview Download |
|
md5:56e6256bcf20842756764649e78fe280
|
16.6 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Dataset: https://doi.org/10.5281/zenodo.6685695 (Other)
Software
- Repository URL
- https://github.com/Manan-prog/Non-linear-Spectral-Unmixing
- Programming language
- Python
References
- Karra, K., Kontgis, C., Statman-Weil, Z., Mazzariello, J.C., Mathis, M., Brumby, S.P., 2021. Global land use / land cover with Sentinel 2 and deep learning, in: 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS. Presented at the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, pp. 4704–4707. https://doi.org/10.1109/IGARSS47720.2021.9553499
- Mondal, P., Walter, M., Miller, J., Epanchin-Niell, R., Yawatkar, V., Nguyen, E., Gedan, K., Tully, K., 2022. High-resolution remotely sensed datasets for saltwater intrusion across the Delmarva Peninsula. https://doi.org/10.5281/zenodo.6685695
- Planet Team, 2017. Planet Application Program Interface: In Space for Life on Earth. San Francisco, CA. https://api.planet.com.
- Sarupria, M., Vargas, R., Walter, M., Miller, J., Mondal, P., 2025a. Non-linear spectral unmixing for monitoring rapidly salinizing coastal landscapes. Remote Sensing of Environment 319, 114642. https://doi.org/10.1016/j.rse.2025.114642
- Sarupria, M., Mondal, P., Vargas, R., Walter, M., Miller, J. 2025b. Fractional Abundance Datasets for Salt Patches and Marshes Across the Delmarva Peninsula, v1 [Data set]. In Remote Sensing of Environment 319, 114642). Zenodo. https://doi.org/10.5281/zenodo.14709313