Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping

Shi, Qian; He, Da; Liu, Zhengyu; Liu, Xiaoping; Xue, Jingqian

doi:10.5281/zenodo.10435661

Published January 4, 2024 | Version v3

Dataset Open

Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping

1. Sun Yat-Sen University

We (Intelligent Mining and Analysis of Remote Sensing big data, IMARS) create a large-scale annotated dataset (Globe230k) for land use/land cover (LULC) mapping, which is annotated on Google Earth image of 1 m spatial resolution. Globe230k is annotated by numerous experts and students major in survey and mapping after necessary training, through visual interpretation on very high-resolution images, as well as in-situ field survey, under the guidance of the organized annotation pipeline. Globe230k has three superiorities:

1) Large scale: the Globe230k includes 232,819 annotated images with the size of 512x512 and spatial resolution of 1 m, with more than 3x1010 annotated pixels, and it includes 10 first-level categories.

2) Rich diversity: the annotated images are sampled from worldwide regions, with coverage area of over 60,000 km2, indicating a high variability and diversity. Besides, in order to ensure the category balance, we intentionally give more chance to the rare categories to be sampled, such as wetland, ice/snow, etc.

3) Multi-modal: Globe230k not only contains RGB bands, but also include other important features for Earth system research, such as Normalized differential vegetation index (NDVI), digital elevation model (DEM), vertical-vertical polarization (VV) bands, vertical-horizontal polarization (VH) bands, which can facilitate the multi-modal data fusion research. Due to the large size of the multi-modal dataset (DEM 1.91G, NDVI 164G, VVVH 372G), these dataset are stored on Baidu Yunpan, the download link is :https://pan.baidu.com/s/12AKbiqOXSf4fnm7mYkCE0g?pwd=230k, the extraction code is 230k.

The image patches and their corresponding annotated patches are respectively stored in "image_patch.zip" and "label_patch.zip" file. The RGB image is in forms of ".jpg", with size of 512x512, the pixel value is ranged from 0-255. The annotated patches is in forms of ".png", also with size of 512x512, the pixel value is ranged from 1-10, which respectively represent 1#cropland, 2#forest, 3#grass, 4#shrubland, 5#wetland, 6#water, 7#tundra, 8#impervious, 9#bareland, 10#ice/snow. The corresponding DEM, NDVI and VVVH patches are all in form of ".tif", with size of 512x512 (due to the different resolution of DEM, NDVI and VVVH patches, they are all uniformly resized to the same scale as the image patch).

The total 232,819 pairs are officially divided into training set, validation set, and test set, based on ratio of 7:1:2, which can be find in "train_num.txt","val_num.txt","test_num.txt" file. Based on this division, the official baseline accuracy of several state-of-the-art semantic segmentation can be found in the related arcticle (https://spj.science.org/doi/10.34133/remotesensing.0078).

We hope it can be used as a benchmark to promote further development of global land cover mapping and semantic segmentation algorithm development.

Files

Globe230k User Guides.pdf

Files (12.2 GB)

Name	Size	Download all
Globe230k User Guides.pdf md5:36eb4c63ebe04bd7a77815d4a8db2045	716.1 kB	Preview Download
image_patch.zip md5:054bce9d5d801de131d2dc78299b62ab	11.5 GB	Preview Download
kindly find the download URL of DEM, NDVI and VVVH data in this txt.txt md5:fbb1798dcec594c37b6cde0e9e99675d	241 Bytes	Preview Download
label_patch.zip md5:3ad0333540fff91253092f1eba1d017f	723.8 MB	Preview Download
test_num.txt md5:01bb37630ee50085790d2ca33e659c99	535.5 kB	Preview Download
train_num.txt md5:cb71db1567c769061476191bf16294ad	1.9 MB	Preview Download
val_num.txt md5:d1e060c2f08868137024b1422d25ad6d	269.8 kB	Preview Download

	All versions	This version
Views	5,322	1,302
Downloads	10,081	3,069
Data volume	109.0 TB	38.3 TB

Globe230k: A Benchmark Dense-Pixel Annotation Dataset for Global Land Cover Mapping

Authors/Creators

Description

Files

Globe230k User Guides.pdf

Files (12.2 GB)