Published August 8, 2024 | Version 1.0
Dataset Open

Plot-level semantically labelled terrestrial laser scanning point clouds

Creators

Description

Abstract

Point clouds from Terrestrial Laser Scanning (TLS) are an increasingly popular source of data for studying plant structure and function. However, unlocking their full potential currently requires extensive manual processing to extract ecologically important information. One key task is the accurate semantic segmentation of different plant material within point clouds, particularly wood and leaves, which is required to understand plant productivity, architecture, competition, space optimisation and physiology, and is a key step in common approaches to individual tree extraction. Existing automated semantic segmentation methods are primarily developed for single ecosystem types, and whilst they show good accuracy for biomass assessment from the trunk and large branches, often perform less well within the crown. In this study, we demonstrate a new framework that uses a deep learning architecture developed from PointNet++ and pointNEXT for processing 3D point clouds to provide a reliable semantic segmentation of wood and leaf in TLS point clouds from the tree base to branch tips, applied to diverse natural European forests. Our model combines meticulously labelled data with voxel-based sampling and a novel gated reflectance integration module embedded throughout the feature extraction layers. We evaluate its performance across an extensive dataset, encompassing diverse ecosystem types and sensor characteristics. Our results show consistent outperformance against the most widely used PointNet++-based approach for leaf/wood segmentation on a high-density TLS dataset collected across diverse mixed forest plots across all major biomes in Europe. We tested our model against others’ open data from China, Eastern Cameroon, Germany and Finland, collected using both time-of-flight and phase-shift sensors, finding consistently strong performance, showcasing the transferability of our model to a wide range of ecosystems and sensors. Our newly developed evaluation metric for assessing performance in the outer parts of the canopy, such as in twigs and small branches, found our model to clearly outperform the most widely used approach.

Methods

Within each country, we scanned a subset of the 30 m x 30 m FUNDIV plots using a Riegl VZ400i TLS instrument (RIEGL Gmbh, Horn, Austria), scanning at 600MHz and with an angular resolution of 0.04 mrad. All plots were scanned following a 10m grid system with a minimum of 16 upright and 16 tilt scans (following Wilkes et al. 2017), with additional scans to minimise occlusion in dense areas, and on the plot perimeter. To ensure high-quality data with minimal noise, scanning was paused when wind conditions rose above 5 m/s (measured with an anemometer on the ground) or when gusts were visually evident. In order to create our labelled dataset, we used a semi-automated approach informed by existing approaches followed by significant manual cleaning. Vicari et al. (2019) found anisotropy, verticality and linearity to be informative features for leaf-wood separation, so we created these geometric features at spatial scales of approx. 5 cm - 0.5 m (using CloudCompare, 2023). Alongside these, we used reflectance and xyz information for each point and labelled leaf-wood by thresholding these features. We followed this with intensive manual checking and cleaning to ensure high label quality, especially in the smaller branches and twigs. Our dense scanning and labelling strategy means that this dataset is an ideal candidate for training and testing the capabilities of processing algorithms. Our dataset comprises nine 10 m x 10 m blocks of x, y, z coordinates with corresponding reflectance values and labels, all at 1 cm resolution achieved through voxel downsampling. This data was used for training and validation. For comprehensive evaluation, we incorporated additional openly available datasets, which we cropped to reduce size and cleaned to rectify erroneous labels. The original, unmodified versions of these datasets can be accessed as follows:

*Mspace Lab (2024) ‘ForestSemantic: A Dataset for Semantic Learning of Forest from Close-Range Sensing’, Geo-spatial Information Science. Zenodo. https://doi.org/10.5281/zenodo.13285640. Distributed under a Creative Commons Attribution Non Commercial No Derivatives 4.0 International licence.

Wang, Di; Takoudjou, Stéphane Momo; Casella, Eric (2021). LeWoS: A universal leaf‐wood classification method to facilitate the 3D modelling of large tropical trees using terrestrial LiDAR [Dataset]. Dryad. https://doi.org/10.5061/dryad.np5hqbzp6. Distributed under a Creative Commons 0 1.0 Universal licence.

Wan, Peng; Zhang, Wuming; Jin, Shuangna (2021). Plot-level wood-leaf separation for terrestrial laser scanning point clouds [Dataset]. Dryad. https://doi.org/10.5061/dryad.rfj6q5799. Distributed under a Creative Commons CC0 1.0 Universal licence.

Weiser, Hannah; Ulrich, Veit; Winiwarter, Lukas; Esmorís, Alberto M.; Höfle, Bernhard, 2024, "Manually labeled terrestrial laser scanning point clouds of individual trees for leaf-wood separation", https://doi.org/10.11588/data/UUMEDI, heiDATA, V1, UNF:6:9U7BGTgjjsWd1GduT1qXjA== [fileUNF]. Distributed under a Creative Commons Attribution 4.0 International Deed.

*For licensing reasons these data re not included in this repository but can be downladed from the doi provided. 

 

 

Files

evaluate.zip

Files (2.0 GB)

Name Size Download all
md5:a6268b918da9556f1ed96dc4a72cc4c5
1.0 GB Preview Download
md5:29c1a4f1f1bab1717eefc719af448136
339.6 MB Preview Download
md5:159f2ba273960fe20a7a6b6cf575dd8a
672.6 MB Preview Download

Additional details

Software

Programming language
Python
Development Status
Active