Published February 5, 2026 | Version v1

HINT-3D: Human-in-the-Loop Interactive Test-Time Adaptation for 3D Segmentation

  • 1. ROR icon American University of Beirut

Description

We present HINT-3D, a human-in-the-loop test-time
adaptation framework for 3D semantic segmentation. A few
corrective clicks are converted into region masks by a promptable
3D interface (PointSAM). These masks supervise stability-aware
updates to a pretrained backbone at inference. We persist the
updates so later scenes start from improved weights, enabling
cumulative learning. The wrapper is backbone-agnostic: it requires
only logits, a mask-to-index bridge, plus access to a small trainable
parameter set; we instantiate it on KPConv, RandLA-Net, and
Point Transformer v1. On S3DIS Area-5, HINT-3D delivers
strong effort-accuracy gains within a scene, consistent zero-click
improvements across scenes, and reduced Expected Calibration
Error (ECE), while maintaining responsiveness with head-only
updates and uncertainty-gated training. We report mIoU versus
saved masks, cross-scene transfer, ECE, latency, and class-specific
corrections on common indoor failure modes.

Files

HINT_3D.pdf

Files (3.4 MB)

Name Size Download all
md5:f2a0edb84dc9212d49ca350a2a33405b
3.4 MB Preview Download

Additional details

Funding

European Commission
DIDYMOS-XR - DIgital DYnaMic and respOnsible twinS for XR 101092875

Dates

Available
2026-02-05