Dataset Open Access

Sentinel-2 reference cloud masks generated by an active learning method

Louis Baetens; Olivier Hagolle


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/ae8ac2a1-35db-4869-b577-b0cf918d121b/SENTINEL_2_reference_cloud_masks_Baetens_Hagolle.tgz"
      }, 
      "checksum": "md5:ee035e0d22a441086cfaabcface3cf24", 
      "bucket": "ae8ac2a1-35db-4869-b577-b0cf918d121b", 
      "key": "SENTINEL_2_reference_cloud_masks_Baetens_Hagolle.tgz", 
      "type": "tgz", 
      "size": 234569318
    }
  ], 
  "owners": [
    44141
  ], 
  "doi": "10.5281/zenodo.1460961", 
  "stats": {
    "version_unique_downloads": 430.0, 
    "unique_views": 1846.0, 
    "views": 2078.0, 
    "version_views": 2079.0, 
    "unique_downloads": 430.0, 
    "version_unique_views": 1847.0, 
    "volume": 167951631688.0, 
    "version_downloads": 716.0, 
    "downloads": 716.0, 
    "version_volume": 167951631688.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.1460961", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.1460960", 
    "bucket": "https://zenodo.org/api/files/ae8ac2a1-35db-4869-b577-b0cf918d121b", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.1460960.svg", 
    "html": "https://zenodo.org/record/1460961", 
    "latest_html": "https://zenodo.org/record/1460961", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.1460961.svg", 
    "latest": "https://zenodo.org/api/records/1460961"
  }, 
  "conceptdoi": "10.5281/zenodo.1460960", 
  "created": "2018-10-12T15:38:57.540623+00:00", 
  "updated": "2020-01-24T19:25:13.182483+00:00", 
  "conceptrecid": "1460960", 
  "revision": 8, 
  "id": 1460961, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.1460961", 
    "description": "<p>&nbsp;<strong>Reference classifications generated with Active Learning for Cloud Detection (ALCD)</strong></p>\n\n<p>This data set provides a reference cloud mask data set for 38 Sentinel-2 scenes. These reference masks have been created with the ALCD tool, developed by Louis Baetens, under the direction of Olivier Hagolle at CESBIO/CNES[1]. They were created to validate the cloud masks generated by the MAJA software [2].</p>\n\n<p>- The `Reference_dataset` directory contains 31 scenes selected in 2017 or 2018.<br>\n- The `Hollstein` directory contains 7 scenes that were used to validate the ALCD tool by comparison to manually generated reference images kindlyprovided by Hollstein et al[3]<br>\nOne of these scenes is present in both directories. For the validation of MAJA, the &quot;Hollstein&quot; scenes were not used because of their acquisition at a time period when Sentinel-2 was not yet operational, with a degraded repetitivity of observations.</p>\n\n<p><strong># Description of the data structure</strong><br>\nThe name of each scene directory is the name of the corresponding Sentinel-2 L1C product.<br>\nIn the scene directory, three sub-directories can be found.<br>\n- `Classification`<br>\n- `Samples`<br>\n- `Statistics`</p>\n\n<p><strong># Description of the files</strong><br>\n- `Classification/classification_map.tif` --- the main product, which is the classified scene. 7 classes are available. Each one is represented with a different integer.<br>\n0: no_data.<br>\n1: not used.<br>\n2: low clouds.<br>\n3: high clouds.<br>\n4: clouds shadows.<br>\n5: land.<br>\n6: water.<br>\n7: snow.</p>\n\n<p>- `Classification/confidence_enhanced.tif` --- enhanced confidence map of the classification. The values are between 0 and 255 (coded on 1 bit).<br>\nThe original confidence map is, for each pixel, the proportion of votes for the majority class as the classification map has been created via a Random Forest algorithm.<br>\nA median filter has been applied to this confidence map. Finally, the value was saved on 1 bit, leading to the value being between 0 and 255.</p>\n\n<p>- `Classification/contours.png` --- the contours of the classes from the classification map, overlayed on the scene. The color code depends on each class.<br>\nGreen: low and high clouds. Yellow: cloud shadows. Blue: water. Purple: snow.</p>\n\n<p>- `Classification/used_parameters.json` --- the parameters that were used to classify the scene. It includes the tile code, the cloudy and clear dates, along with their product reference.</p>\n\n<p>- `Samples/` --- this directory contains all the shapefiles, one per class.</p>\n\n<p>- `Statistics/k_fold_summary.json` --- results of the 10-fold cross-validation on the scene.<br>\n5 metrics are computed, in the order given in the &quot;metrics_names&quot;. &quot;all_metrics&quot; is a list of the 10 folds, with the 5 metrics in the correct order for each fold.<br>\n&quot;means&quot; and &quot;stds&quot; are the means and standard deviations of the 10 folds.</p>\n\n<p><br>\n<strong># References</strong></p>\n\n<p>[1] Baetens, L.; Desjardins, C.; Hagolle, O. Validation of Copernicus Sentinel-2 Cloud Masks Obtained from MAJA, Sen2Cor, and FMask Processors Using Reference Cloud Masks Generated with a Supervised Active Learning Procedure. <em>Remote Sens.</em> <strong>2019</strong>, <em>11</em>, 433.</p>\n\n<p>[2] A multi-temporal method for cloud detection, applied to FORMOSAT-2, VEN&micro;S, LANDSAT and SENTINEL-2 images, O Hagolle, M Huc, D. Villa Pascual, G Dedieu, Remote Sensing of Environment 114 (8), 1747-1755, 2010</p>\n\n<p>[3] Hollstein, A.; Segl, K.; Guanter, L.; Brell, M.; Enesco, M. Ready-to-Use Methods for the Detection of Clouds, Cirrus, Snow, Shadow, Water and Clear Sky Pixels in Sentinel-2 MSI Images. Remote Sens. 2016, 8, 666</p>", 
    "language": "eng", 
    "title": "Sentinel-2 reference cloud masks generated by an active learning method", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "1460960"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "1460961"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "remote-sensing"
      }
    ], 
    "keywords": [
      "Sentinel-2", 
      "Cloud mask", 
      "Validation"
    ], 
    "publication_date": "2018-10-12", 
    "creators": [
      {
        "affiliation": "CESBIO/CNES", 
        "name": "Louis Baetens"
      }, 
      {
        "orcid": "0000-0003-2358-0493", 
        "affiliation": "CESBIO/CNES", 
        "name": "Olivier Hagolle"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.1460960", 
        "relation": "isVersionOf"
      }
    ]
  }
}
2,079
716
views
downloads
All versions This version
Views 2,0792,078
Downloads 716716
Data volume 168.0 GB168.0 GB
Unique views 1,8471,846
Unique downloads 430430

Share

Cite as