Published March 20, 2026 | Version v1
Dataset Open

Datasets produced during the EOAfrica "Sentinel 2 user-relevant water quality monitoring in small southern African water bodies" project

  • 1. ROR icon Council for Scientific and Industrial Research
  • 2. EDMO icon University of Cape Town, Department of Oceanography
  • 3. ROR icon University of Stirling

Description

This repository contains the datasets produced during the "Sentinel 2 user-relevant water quality monitoring in small southern African water bodies (SWAM)" project. The project focused on water quality parameters (WQPs), specifically Chlorophyll a (Chl-a) concentration and Total Suspended Matter (TSM) concentration.  

File naming convention: {WQP}_{DatasetType}_{StartYear}_{EndYear}_{Lake}.{FileType}

  • {WQP}:
    • CHL: Chlorophyll-a concentration in units mg m$^{-3}$
    • TSM: Total Suspended Matter concentration in units g m$^{-3}$
  • {DatasetType}:
    • cube: netcdf cube in format (time, lat, lon)
    • timeseries: csv file containing columns for water body name (Lake_name), satellite overpass date and time (DateTime_UTC), date (Date), 10th percentile of WQP over the lake (e.g. CHL_10P), mean WQP concentration (e.g. TSM_mean), median WQP concentration (e.g. CHL_median), 90th percentile of WQP concentration over the lake (e.g. TSM_90P), number of valid satellite pixels remaining after application of atmospheric correction and WQP algorithm (N_valid). Data are only included where >5% of the total water body area contained valid pixels after atmospheric correction and WQP algorithm application. 
  • {StartYear}:
    • 2015: the year of the first timestamp 
  •  {EndYear}:
    • 2025: the year of the last timestamp
  • {Lake}:
    • TW: Theewaterskloof Dam
    • VV: Voelvlei Dam
    • CW: Clanwilliam Dam
    • MV: Misverstand Dam
    • ZV: Zeekoevlei
    • RV: Rietvlei
  • {FileType}:
    • csv: csv file
    • nc: netcdf4 file

All satellite data access and processing was performed on the EO Africa Innovation Lab, a cloud-based virtual research environment with co-located data and computing services. Access to Copernicus data is enabled through Data and Information Access Services (DIAS) technology, with direct data access to CreoDIAS. All workflows were produced in JupyterLab using python. The Earth Observation Data Access Gateway (EODAG) python library was used to search for the file location of all Sentinel 2 L1C data within CreoDIAS during the period from 1 August 2015 to 30 August 2025 for each water body. Each Sentinel 2 file was read, resampled to 20 m spatial resolution, spatially subset to the smallest region of interest encompassing the entire water body, atmospherically corrected and written to NetCDF. In the case of the C2-Nets, the IdePix S2-MSI processor (v9.0.2) was applied to the L1C data to provide additional data flagging and masking options. 

Total Suspended Matter (TSM) concentration products were produced using the Case 2 Regional CoastColour (C2RCC) atmopsheric correction processor (Brockmann et al., 2016) with the C2X-Complex neural network to produce the remote sensing reflectance product, and then deriving TSM using the Jiang et al (2023) algorithm. 

Chlorophyll-a (Chl-a) concentration products were produced using the ACOLITE-RAdCor atmospheric correction (Vanhellemont & Ruddick 2016; Castagna & Vanhellemont, 2025) to produce the remote sensing reflectance product, and then deriving Chl-a using the Mixture Density Network (MDN) from Pahlevan et al (2020). 

This work was supported by the EO Africa R&D Facility, a joint initiative of ESA and the African Union Commission, in the context of the bilateral research project 03_SWAM_ZA_GB_EOAC3 between the Council for Scientific and Industrial Research (South Africa) and the University of Stirling (United Kingdom of Great Britain and Northern Ireland) under ESA Contract No. 4000133905/21/I-EF

 

Files

CHL_timeseries_2015_2025_CW.csv

Files (223.4 MB)

Name Size Download all
md5:6ba55355723c8dd651879e89f71b876a
19.6 MB Download
md5:0322f1a8fdb626f0d8a50830b50ca709
1.3 MB Download
md5:0f4ddd3348cddf11796d5fcf41b25c22
2.1 MB Download
md5:82d8990e61ee7fba1b47f4cfe2b6c9c2
103.2 MB Download
md5:8ff693e89db72d7840fbf361c3a07998
39.1 MB Download
md5:7508f018bfa49ac3d4feb99460191cb3
6.5 MB Download
md5:1a522a5cb61fa7d23cbaf31fb4a55ea5
41.3 kB Preview Download
md5:66459fffa75b12f5dbeba98042781103
17.6 kB Preview Download
md5:af9040473a0a9c1afafcdb3ae112fec1
43.2 kB Preview Download
md5:0b2840674cb77d9dc29783755fbf77b1
47.1 kB Preview Download
md5:45c1b07afee14c8740f5550e543086ae
44.5 kB Preview Download
md5:81991d3693d66c9255fe94ec83e317f6
41.8 kB Preview Download
md5:bc415dbfe41c0731805c0682caf8af2e
25.3 MB Download
md5:95eacc94a7f007dccb609e8683c9ae2c
3.5 MB Download
md5:cd19c8f2d8548990af0199e38347d157
1.6 MB Download
md5:78d9ca08230bbdd96084c2cb68f3e16a
12.9 MB Download
md5:dbd73f4430d18fb76504af8f88aa8ad8
4.3 MB Download
md5:458038e1e6668d5a0b54b6b6978f833b
3.5 MB Download
md5:b44441eae8ebb6a1c718a072e25c8d95
48.5 kB Preview Download
md5:33026414ada8dbc9e668113c28381be6
31.1 kB Preview Download
md5:f2e26508992a92f32f48fb48ab5fe733
36.9 kB Preview Download
md5:8f17b82ba798281508b06aa6fbdd6da9
38.1 kB Preview Download
md5:9ba240b351f3d1d19b8799900e63e557
35.1 kB Preview Download
md5:5a60ab6b7265dcb3a6218c7abb11104e
24.2 kB Preview Download

Additional details

Funding

European Space Agency
EO Africa R&D 4000133905/21/I-EF

Software

Repository URL
https://github.com/DalinJiang/EO-Africa-SWAM
Programming language
Python , Jupyter Notebook
Development Status
Active

References

  • Brockmann, C., Roland, Peters, M., Kerstin, Sabine, & Ruescas, A. (2016). Evolution of the C2RCC neural network for Sentinel 2 and 3 for the retrieval of ocean colour products in normal and extreme optically complex waters. Proceedings of the Living Planet Symposium, 9-13 May 2016.
  • Castagna, A., & Vanhellemont, Q. (2025). A generalized physics-based correction for adjacency effects. Applied Optics, 64(10), 2719. https://doi.org/10.1364/ao.546766
  • Jiang, D., Matsushita, B., Pahlevan, N., Gurlin, D., Fichot, C. G., Harringmeyer, J., Sent, G., Brito, A. C., Brotas, V., Werther, M., Mascarenhas, V., Blake, M., Hunter, P., Tyler, A., & Spyrakos, E. (2023). Estimating the concentration of total suspended solids in inland and coastal waters from Sentinel-2 MSI: A semi-analytical approach. ISPRS Journal of Photogrammetry and Remote Sensing, 204, 362–377. https://doi.org/10.1016/j.isprsjprs.2023.09.020
  • Pahlevan, N., Smith, B., Schalles, J., Binding, C., Cao, Z., Ma, R., Alikas, K., Kangro, K., Gurlin, D., Hà, N., Matsushita, B., Moses, W., Greb, S., Lehmann, M. K., Ondrusek, M., Oppelt, N., & Stumpf, R. (2020). Seamless retrievals of chlorophyll-a from Sentinel-2 (MSI) and Sentinel-3 (OLCI) in inland and coastal waters: A machine-learning approach. Remote Sensing of Environment, 240. https://doi.org/10.1016/j.rse.2019.111604
  • Vanhellemont, Q., & Ruddick, K. (2016, May). Acolite for Sentinel-2: Aquatic applications of MSI imagery. In Proceedings of the 2016 ESA living planet symposium, Prague, Czech Republic (Vol. 9).
  • Vanhellemont, Q (2026). Acolite [Software]. GitHub. https://github.com/acolite/acolite
  • STREAM_RS (2026). MDN_Stream [Software]. Github. https://github.com/STREAM-RS/MDN-STREAM