Published September 12, 2025 | Version v0.1.0
Dataset Open

Cambridge Arboreal Modelling Panoptic 3D (CAMP3D) Dataset

Description

Accurate tree segmentation is a key step in extracting individual-tree metrics from forest laser scans, which are essential for understanding ecosystem functions in carbon cycling and beyond. Over the past decade, tree segmentation algorithms have advanced rapidly with developments in AI. However, existing public 3D forest datasets remain too small to support robust segmentation systems. Motivated by the success of synthetic data in domains such as autonomous driving, we investigate whether similar approaches can benefit tree segmentation. By replacing expensive field data collection and annotation with synthetic data for pretraining, only minimal real plot annotation is required for fine-tuning.

We introduce a new synthetic data generation pipeline for forest vision tasks, integrating modern game engines with physics-based LiDAR simulation. This has produced a large-scale, diverse, annotated 3D forest dataset of unprecedented scope. Experiments with a state-of-the-art tree segmentation algorithm and a widely used real dataset demonstrate that our synthetic data can substantially reduce the need for labelled real data. Dataset generation pipeline: https://github.com/yihshe/CAMP3D.git

The current dataset provides virtual UAV laser scans for 12 forest scenes, including 4 coniferous and 6 deciduous forests (focused on European types), along with two additional scenes (Rainforest and Redwood). It contains instance labels for individual trees and three semantic classes: ground, leaf, and wood. To prepare the data for machine learning, point clouds from each scene are merged, tiled into 50 m × 50 m plots, and split into training (70%), validation (15%), and test (15%) sets. All plots have a point density greater than 1,000 points per square metre, ensuring sufficient resolution for effective tree segmentation learning. 

 

 

Files

Cambridge_Arboreal_Modelling_Panoptic_3D_Dataset.zip

Files (21.4 GB)

Additional details

Software

Repository URL
https://github.com/yihshe/CAMP3D.git
Programming language
Python

References