Published June 8, 2025 | Version 0.1
Annotation collection Open

Open Land Use Reference Dataset for Palm Oil Landscapes in Indonesia

Description

This dataset was developed under the Lacuna Fund-supported initiative Advancing Oil Palm Mapping in Indonesia with Social Forestry and Machine Learning. It provides a high-resolution, open-access land use reference dataset for supporting machine learning applications in land cover classification. The dataset includes wall-to-wall labeled polygons across 6×6 km grid cells, corresponding monthly satellite imagery mosaics, and a verified validation dataset derived using Collect Earth Online (CEO). The data targets key oil palm production landscapes in Riau and West Sulawesi and supports research on forest change, social forestry, and sustainable land management.

Technical info

The dataset was developed through an integrative data collection and validation approach:

  • Wall-to-Wall Labeling:
    Training data was created through opportunistic sampling across 6x6 km grids overlapping known oil palm regions. Labels were digitized in QGIS and refined through multi-spectral analysis of Planet NICFI imagery from corresponding months. The land cover classification followed a two-tiered typology covering estate crops (e.g., oil palm, rubber, coconut), annual crops, forests, shrubland, wetlands, and other land uses.

  • Satellite Imagery Pairing:
    Each labeled grid is paired with PlanetScope monthly composite images clipped to grid extent. These cloud-minimized mosaics are consistent with the date of interpretation to support supervised machine learning training.

  • Validation Dataset (CEO):
    A stratified random sampling method was applied to generate verification points across diverse land cover classes. Interpreters labeled points in CEO using a standardized survey form and high-resolution imagery. Twenty percent of samples were cross-verified by multiple interpreters to calculate agreement scores as part of QA/QC procedures. Select ambiguous classes were also validated through targeted field checks.

  • Metadata and Format:
    All spatial layers are delivered in GeoJSON format, accompanied by metadata following ISO 19115 standards. Image tiles are delivered as GeoTIFFs, pre-aligned to vector boundaries.

Data Type:

Vector spatial dataset (Polygons, WKT), tabular metadata (.csv/.geojson), JSON-compatible attributes, raster images (GeoTIFF)

Dataset Structure:

Field Name

Type

Description

wkt_geom

Text (WKT)

Full geometric representation of each feature as a MultiPolygon in Well-Known Text format.

fid

Integer

Unique feature ID auto-generated to distinguish individual records.

plotid

Integer

Identifier for the spatial grid unit (e.g., 6x6 km) where the feature was digitized.

class_ENG

Text

Main land cover label following a hierarchical typology (e.g., Palm (Mature), Forest).

class_BAH

Text

Contextual or localized name for the land cover class (e.g., local language term).

class_ID

Integer

Numerical ID assigned to each land cover class to support raster encoding and model training.

timestamp

Date (YYYY-MM-DD)

Date when the feature was digitized and labeled.

sat_time

Date (YYYY-MM-DD)

Acquisition date of the satellite imagery used during interpretation.

 

Land Cover Class Mapping:

Bahasa Indonesia Label

English Label

Class ID

Kelapa sawit (awal tanam)

Palm (Initial planting)

1

Kelapa sawit

Palm

2

Lahan terbangun

Built Up Area

3

Kakao

Cacao

4

Kelapa

Coconut

5

Sawah

Rice Field

6

Karet

Rubber

7

Lahan pertanian lain

Other agricultural field

8

Hutan

Forest

9

Belukar

Shrubland

10

Lahan basah

Wetland

11

Mangrove

Mangrove

12

Badan air

Water body

13

Padang rumput

Grassland

14

Lahan kosong

Bareland

15

Kelas lain

Other class

16

Tidak teridentifikasi

Unknown

0

Annotations:

Labels are expert-generated through remote sensing interpretation and supplemented by input from local field teams. CEO platform allowed interpreters to draw and describe features using temporal image stacks. Cross-checking and interpreter discussions were used to maintain consistency.

Relations to Existing Work:

This dataset was independently created for this project but complements existing regional datasets such as the Forest Data Partnership oil palm probability maps.

Considerations for Using the Data:

This dataset is intended to support machine learning model training and validation for land cover mapping in tropical agricultural regions. Users should consider potential limitations related to cloud cover in source imagery and differences in seasonal appearance of crops. It is particularly valuable for distinguishing oil palm from visually similar classes such as coconut, rubber, and forest.

Associated Imagery:

Each labeled polygon is linked to pre-processed, high-resolution satellite imagery that was used during interpretation. These images are cloud-free and temporally aligned with the label data to facilitate immediate use in AI/ML workflows.

The dataset release also includes multiple layers of reference imagery and data products designed to support advanced model training and validation:

  1. Shapefile of labeled polygons

  2. Raster file encoding Polygon ID (0.3-meter resolution)

  3. Raster file encoding Land Cover Class ID (0.3-meter resolution)

  4. Sentinel-2 composite (RGB + NIR, 10-meter resolution)

  5. Sentinel-2 composite (Red Edge bands 1-3, SWIR1 & SWIR2, 20-meter resolution)

  6. Landsat 8/9 composite (RGBN, 30-meter resolution)

  7. PlanetScope composite (RGBN, 4.7-meter resolution)

  8. Additional high-resolution commercial imagery (in selected sample grids)

All imagery products have been processed for spatial alignment with the labeled data and provided in GeoTIFF format for direct use in remote sensing applications and AI model development.

 

Files

label_grid_01.zip

Files (1.3 GB)

Name Size Download all
md5:95daab79927739531d70c25d220fc8c8
18.5 MB Preview Download
md5:06ca39e90a6db3cfe4cdbb9c00c46b88
19.0 MB Preview Download
md5:3ee0730cec20227db1028e0580de3518
17.9 MB Preview Download
md5:0c8150cdeb055d1558fca4a0cb499241
20.1 MB Preview Download
md5:3dcdf92c7cb7d23ad53b595907e01f0a
17.3 MB Preview Download
md5:1404a015db5981b8278c18cf1c7b563f
16.9 MB Preview Download
md5:c0e344a707417c14a6c7bc4b519e67ba
18.5 MB Preview Download
md5:ace906c081fda2fd69351a85de5856d8
19.5 MB Preview Download
md5:50238424782cca8ecbd73e84e1ff2e7c
17.2 MB Preview Download
md5:4e227006423bb8fa69ab88b1146d126a
15.0 MB Download
md5:8e44848ff456b6082fc189e3e81c8a7c
16.9 MB Preview Download
md5:798def0a1803a6bff3d30bb8ba373094
16.3 MB Preview Download
md5:dd401c8e825c5e9fbeb2bd889dc18f65
18.9 MB Preview Download
md5:28598e28e50ad078b3840417c809ecde
19.0 MB Preview Download
md5:43c97538bdffeabb3d72b28e620c2e95
17.7 MB Preview Download
md5:2d15194e91addc52fba506e969f448a0
18.4 MB Preview Download
md5:bbee53d53aaeee3e34b5a38c414dd981
15.1 MB Preview Download
md5:deccd40b1205e622cda3c843ac28c1b8
20.8 MB Preview Download
md5:2846718b7890f68c1f14a51a25d7557a
21.5 MB Preview Download
md5:af1b086162a57442c53d5789c65ee812
15.6 MB Preview Download
md5:5c58f7620f095e4ad118e0c6ae1b21f2
16.9 MB Preview Download
md5:8023c3d1cf55bc085473cb192803d8b5
15.0 MB Preview Download
md5:900c70fd2966e079c75f763953a77865
16.3 MB Preview Download
md5:c0af66abf048f834ba6e5afe7462b7ea
19.3 MB Preview Download
md5:f9f495ada4b555845d9f44047ebdd634
15.5 MB Preview Download
md5:7794b2bbff11e1ffe460dc2f7cedd8dd
14.1 MB Preview Download
md5:f7a3c357d413566f1ac4c5c8a5b52972
17.2 MB Preview Download
md5:0b957af51ad4cef678401cc8acb484e4
15.1 MB Preview Download
md5:ac962dba285c6c5c9dff931a945528c7
18.0 MB Preview Download
md5:defb12feb1b824dd9876524410571e03
16.7 MB Preview Download
md5:b212a18e7e0bbc4e7a924ac691699a95
14.3 MB Preview Download
md5:2de1de22d51fcadf04fed2f348e1f054
19.3 MB Preview Download
md5:335d4a841aaa0ef2a0f2b5944020de24
17.4 MB Preview Download
md5:8c17a550efe0e4d65d36d2a331ca12be
19.5 MB Preview Download
md5:f404f28ad403c800f7af179185213aab
19.8 MB Preview Download
md5:381dcf77bf261d67e66cc26ce34754d8
17.8 MB Preview Download
md5:3e813acfc8c23de34003ed058ef28fc4
14.7 MB Preview Download
md5:d9b868dea9e4ef3733d039f1ddfd1b45
13.0 MB Preview Download
md5:6b79a7ae2fead09707d9ff1b0b0180f2
17.8 MB Preview Download
md5:e2c5467320078bceed9b5337fbe01321
15.5 MB Preview Download
md5:6025db571190f6afdd1eec55c37826cd
17.7 MB Preview Download
md5:4346748be7a4f408788a35295bca9274
17.8 MB Preview Download
md5:5711bff716690fc776264a1b28251394
19.4 MB Preview Download
md5:dcb3992ec5e7eeecdfb3218cc76aaa52
17.5 MB Preview Download
md5:cbbfa5585511ee6bda353a3707a0058b
16.4 MB Preview Download
md5:e27b5c5699841deb51c3de9a4b568878
18.9 MB Preview Download
md5:515e1c39777b573dea423be8c9efb58a
17.8 MB Preview Download
md5:0e8348e6f29801827eb7a243e5525316
20.0 MB Preview Download
md5:c624678ca29424a17353d0fc40d4c564
14.0 MB Preview Download
md5:17c8ff7e9f90be24ffd58dc2b51f00c1
16.3 MB Preview Download
md5:692d10ce6aa6b1ccbc3739b402ff7246
16.0 MB Preview Download
md5:1b52061f1fa134f10cd4d134bef62700
15.3 MB Preview Download
md5:91d54a5026593939e885c8450b827db3
13.9 MB Preview Download
md5:b30f827ebda2940ab23a254aa257f169
15.1 MB Preview Download
md5:1b433d5e7378b727425b2fb37afdeb3d
16.9 MB Preview Download
md5:07c104191549696c820b37e16c451890
14.5 MB Preview Download
md5:e3911d7dbf45a5b1df9ccee5542ee90e
14.8 MB Preview Download
md5:faf88f9fde03b393a585625ac9e01937
12.8 MB Preview Download
md5:59c2c0fd43ef4e8c51749bfebe16b8eb
20.9 MB Preview Download
md5:7f2a88f3dbd0e0a9f6fdb9b7f69d00e2
17.4 MB Preview Download
md5:0c8e0603b7f691b8be16d68254fd1c4b
16.9 MB Preview Download
md5:605ee9881a584edcd951ac87bdadef2a
15.6 MB Preview Download
md5:515717f7501b369efcc33da1847b9a55
15.3 MB Preview Download
md5:6e4081d691fcbb631e432c088bd8f46f
14.8 MB Preview Download
md5:3ef834cd892ef63140b75e96ee6c7237
17.1 MB Preview Download
md5:3b0e51279ecd20836a3f926ab4a1cc1d
16.0 MB Preview Download
md5:7b35ac281374ead630d9e9848f0d788b
16.1 MB Preview Download
md5:a1456191807ae5158de215c51466c5df
16.5 MB Preview Download
md5:1a4ca4776fb5a334b2e6866f7cba0150
17.5 MB Preview Download
md5:0720f409a9451f956fe0cffdc184606c
16.3 MB Preview Download
md5:7700d4e129b80448568cfdb71b9b459b
16.2 MB Preview Download
md5:3ba8e0de45ef5acc8a0f0d044f7dfa48
19.0 MB Preview Download
md5:c3e5d971c7a51edc28593371bdb72209
17.8 MB Preview Download
md5:0c44a6f13ec76302e993d666a08f4b8a
18.6 MB Preview Download
md5:cdaff039a099ace8acec750799bc8f45
17.8 MB Preview Download
md5:3d4a6c5d26e9d1567fcca3c357d83874
18.7 MB Preview Download

Additional details

Funding

Meridian Institute

Software

Development Status
Active