Dataset Open Access

The WorldStrat Dataset: Open High-Resolution Satellite Imagery With Paired Multi-Temporal Low-Resolution

Julien Cornebise; Ivan Oršolić; Freddie Kalaitzis

What is this dataset?

Nearly 10,000 km² of free high-resolution and matched low-resolution satellite imagery of unique locations which ensure stratified representation of all types of land-use across the world: from agriculture to ice caps, from forests to multiple urbanization densities.

Those locations are also enriched with typically under-represented locations in ML datasets: sites of humanitarian interest, illegal mining sites, and settlements of persons at risk.

Each high-resolution image (1.5 m/pixel) comes with multiple temporally-matched low-resolution images from the freely accessible lower-resolution Sentinel-2 satellites (10 m/pixel).

We accompany this dataset with a paper, datasheet for datasets and an open-source Python package to: rebuild or extend the WorldStrat dataset, train and infer baseline algorithms, and learn with abundant tutorials, all compatible with the popular EO-learn toolbox.

Why make this?

We hope to foster broad-spectrum applications of ML to satellite imagery, and possibly develop the same power of analysis allowed by costly private high-resolution imagery from free public low-resolution Sentinel2 imagery. We illustrate this specific point by training and releasing several highly compute-efficient baselines on the task of Multi-Frame Super-Resolution.

Licences

  • The high-resolution Airbus imagery is distributed, with authorization from Airbus, under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).
  • The labels, Sentinel2 imagery, and trained weights are released under Creative Commons with Attribution 4.0 International (CC BY 4.0).
  • The source code (will be shortly released on GitHub) under 3-Clause BSD license.
Files (107.0 GB)
Name Size
hr_dataset.tar.gz
md5:ca7167334006f3c17f9071f14c435335
41.5 GB Download
hr_dataset_raw.tar.gz
md5:40a4fd5241e91bffeddee0ed109778da
11.3 GB Download
LICENSE.txt
md5:d97b8d86da83f7e51f2d3205509e4a7b
39.3 kB Download
lr_dataset_l1c.tar.gz
md5:d2dcafa207b1e1bc6c754607f15e9ed6
27.4 GB Download
lr_dataset_l2a.tar.gz
md5:8cfc6a477cee9e9cd8b20ea27227de65
26.8 GB Download
metadata.csv
md5:dfeb3348e79b719bf03c230d5d258839
14.6 MB Download
stratified_train_val_test_split.csv
md5:745035835d835280aa0298a9dc1996d1
282.6 kB Download
WorldStrat_article_and_datasheet.pdf
md5:58d8e87c52bfaec962ab1ca5c3bf48b1
2.6 MB Download
2,593
17,748
views
downloads
All versions This version
Views 2,5932,593
Downloads 17,74817,748
Data volume 211.3 TB211.3 TB
Unique views 2,2852,285
Unique downloads 2,8782,878

Share

Cite as