Published July 4, 2024 | Version 2.0.0
Dataset Open

BigEarthNet v2

  • 1. Berlin Institute for the Foundations of Learning and Data (BIFOLD)
  • 2. ROR icon Technische Universität Berlin
  • 3. ROR icon École Polytechnique Fédérale de Lausanne

Description

BigEarthNet v2.0

The BigEarthNet v2.0 dataset was constructed by the Remote Sensing Image Analysis (RSiM) Group and the Database Systems and Information Management (DIMA) Group at the Technische Universität Berlin (TU Berlin). This work is supported by the European Research Council under the ERC Starting Grant BigEarth and by the Berlin Institute for the Foundations of Learning and Data (BIFOLD).

BigEarthNet v2.0 is a benchmark dataset consisting of 549,488 pairs of Sentinel-1 and Sentinel-2 image patches. To construct BigEarthNet v2.0 with Sentinel-2 image patches (called as BigEarthNet-S2), 115 Sentinel-2 tiles acquired between June 2017 and May 2018 over 10 countries (Austria, Belgium, Finland, Ireland, Kosovo, Lithuania, Luxembourg, Portugal, Serbia, and Switzerland) of Europe were initially selected. All the tiles were atmospherically corrected by the Sentinel-2 Level 2A product generation and formatting tool (sen2cor v2.11). Then, they were divided into 549,488 image patches. Each image patch was associated with a pixel-level reference map and multiple land-cover class labels (i.e., multi-labels) that were derived from the most recent CORINE Land Cover database of the year 2018 (CLC2018 v2020_u1).

To construct BigEarthNet v2.0 with Sentinel-1 image patches (called as BigEarthNet-S1), 312 Sentinel-1 scenes acquired between June 2017 and May 2018 that jointly cover the area of all original 115 Sentinel-2 tiles with close temporal proximity were selected and processed. BigEarthNet-S1 consists of 549,488 preprocessed Sentinel-1 image patches – one for each Sentinel-2 patch.

The BigEarthNet v2.0 dataset includes several significant improvements compared to the previous 1.0 version. These changes include the application of the latest atmospheric correction tool (sen2cor), which results in higher-quality patches. Additionally, the most recent version of the CLC2018 database was utilized to extract label information, overcoming label noise present in BigEarthNet v1.0. Apart from providing patch-level labels, v2.0 additionally includes pixel-level reference maps, making the dataset suitable for pixel- and scene-based learning tasks. Furthermore, BigEarthNet v2.0 introduces a new geographical-based split assignment algorithm, which significantly reduces spatial correlation among the train, validation, and test sets compared to v1.0.

If you use this work, please cite:

K. Clasen, L. Hackel, T. Burgert, G. Sumbul, B. Demir, V. Markl, reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis”, IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2025.

 

Files

Files (118.0 GB)

Name Size Download all
md5:a55eaa2cdf6a917e296bd6601ec1e348
54.4 GB Download
md5:2245ed2d1a93f6ce637d839bc856396e
63.3 GB Download
md5:55687065e77b6d0b0f1ff604a6e7b49c
3.6 MB Download
md5:fe31856f4986d446c9468b59d6387c91
710.2 kB Download
md5:95d85a222fa983faddcac51a19f28917
282.4 MB Download

Additional details

Related works

Is derived from
Dataset: 10.2909/71c95a07-e296-44fc-b22b-415f42acfdf0 (DOI)
Is new version of
Dataset: 10.1109/IGARSS.2019.8900532 (DOI)
Dataset: 10.1109/MGRS.2021.3089174 (DOI)
Is published in
Preprint: arXiv:2407.03653 (arXiv)
Is supplemented by
Software: https://github.com/kai-tub/rico-hdl (URL)

Funding

European Research Council
ERC-2017-STG BigEarth Project 759764

Software