Published November 30, 2022 | Version v1.0
Dataset Open

Images and 2-class labels for semantic segmentation of Sentinel-2 and Landsat RGB, NIR, and SWIR satellite images of coasts (water, other)

  • 1. Marda Science LLC

Description

Images and 2-class labels for semantic segmentation of Sentinel-2 and Landsat RGB, NIR, and SWIR satellite images of coasts (water, other)

Images and 2-class labels for semantic segmentation of Sentinel-2 and Landsat 5-band (R+G+B+NIR+SWIR) satellite images of coasts (water, other)

Description

3649 images and 3649 associated labels for semantic segmentation of Sentinel-2 and Landsat 5-band (R+G+B+NIR+SWIR) satellite images of coasts. The 2 classes are 1=water, 0=other. Imagery are a mixture of 10-m Sentinel-2 and 15-m pansharpened Landsat 7, 8, and 9 visible-band imagery of various sizes. Red, Green, Blue, near-infrared, and short-wave infrared bands only

These images and labels could be used within numerous Machine Learning frameworks for image segmentation, but have specifically been made for use with the Doodleverse software package, Segmentation Gym**.

Two data sources have been combined

Dataset 1

* 579 image-label pairs from the following data release**** https://doi.org/10.5281/zenodo.7344571
* Labels have been reclassified from 4 classes to 2 classes.
* Some (422) of these images and labels were originally included in the Coast Train*** data release, and have been modified from their original by reclassifying from the original classes to the present 2 classes.
* These images and labels have been made using the Doodleverse software package, Doodler*.

Dataset 2

  • 3070 image-label pairs from the Sentinel-2 Water Edges Dataset (SWED)***** dataset, https://openmldata.ukho.gov.uk/, described by Seale et al. (2022)******
  • A subset of the original SWED imagery (256 x 256 x 12) and labels (256 x 256 x 1) have been chosen, based on the criteria of more than 2.5% of the pixels represent water

File descriptions

  •     classes.txt, a file containing the class names
  •     images.zip, a zipped folder containing the 3-band RGB images of varying sizes and extents
  •     labels.zip, a zipped folder containing the 1-band label images
  •     nir.zip, a zipped folder containing the 1-band near-infrared (NIR) images
  •     swir.zip, a zipped folder containing the 1-band shorttwave infrared (SWIR) images
  •     overlays.zip, a zipped folder containing a semi-transparent overlay of the color-coded label on the image (red=1=water, blue=0=other)
  •     resized_images.zip, RGB images resized to 512x512x3 pixels
  •     resized_labels.zip, label images resized to 512x512x1 pixels
  •     resized_nir.zip, NIR images resized to 512x512x1 pixels
  •     resized_swir.zip, SWIR images resized to 512x512x1 pixels

References

*Doodler: Buscombe, D., Goldstein, E.B., Sherwood, C.R., Bodine, C., Brown, J.A., Favela, J., Fitzpatrick, S., Kranenburg, C.J., Over, J.R., Ritchie, A.C. and Warrick, J.A., 2021. Human‐in‐the‐Loop Segmentation of Earth Surface Imagery. Earth and Space Science, p.e2021EA002085https://doi.org/10.1029/2021EA002085. See https://github.com/Doodleverse/dash_doodler.

**Segmentation Gym: Buscombe, D., & Goldstein, E. B. (2022). A reproducible and reusable pipeline for segmentation of geoscientific imagery. Earth and Space Science, 9, e2022EA002332. https://doi.org/10.1029/2022EA002332 See: https://github.com/Doodleverse/segmentation_gym

***Coast Train data release: Wernette, P.A., Buscombe, D.D., Favela, J., Fitzpatrick, S., and Goldstein E., 2022, Coast Train--Labeled imagery for training and evaluation of data-driven models for image segmentation: U.S. Geological Survey data release, https://doi.org/10.5066/P91NP87I. See https://coasttrain.github.io/CoastTrain/ for more information

****Buscombe, Daniel. (2022). Images and 4-class labels for semantic segmentation of Sentinel-2 and Landsat RGB, NIR, and SWIR satellite images of coasts (water, whitewater, sediment, other) (v1.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.7344571

*****Seale, C., Redfern, T., Chatfield, P. 2022. Sentinel-2 Water Edges Dataset (SWED) https://openmldata.ukho.gov.uk/

******Seale, C., Redfern, T., Chatfield, P., Luo, C. and Dempsey, K., 2022. Coastline detection in satellite imagery: A deep learning approach on new benchmark data. Remote Sensing of Environment, 278, p.113044.

Files

classes.txt

Files (1.7 GB)

Name Size Download all
md5:1ecf2899a249144a91eccdc65ffc3520
11 Bytes Preview Download
md5:43ff09c2c75142f07cd3dd2e04f0dda7
63.0 MB Preview Download
md5:cf151c86eba84a481d247f344fc26aae
4.2 MB Preview Download
md5:78112a7eb0084a6e3668f823ee9ed103
57.7 MB Preview Download
md5:a0265f8d417b489dc6233697f7a8c70b
895.1 MB Preview Download
md5:083ed3f673de3f06e48f91ff5f5ecb0d
4.0 kB Preview Download
md5:214946dce378c15caf83f79888ae1e19
346.6 MB Preview Download
md5:1198d1d7108382a38cf6a535a848ae1f
7.7 MB Preview Download
md5:f5d7d5a13ee3d0add94f516013424503
177.3 MB Preview Download
md5:915fd56af636a3711aadc36345f2a38f
140.5 MB Preview Download
md5:81d919ee9a9b5d584a43d2e22ddec53b
46.5 MB Preview Download