Published July 9, 2025 | Version v1
Conference paper Open

Spatial-Semantic Reasoning using Large Language Models for Efficient UAV Search Operations

  • 1. ROR icon Faculty of Electrical Engineering and Computing in Zagreb
  • 2. University of Zagreb Faculty of Electrical Engineering and Computing

Description

We present a real-time semantic navigation frame-work for Unmanned Aerial Vehicles (UAVs) focused on improving time efficiency in the Object Goal Navigation (ObjectNav) task. Central to our approach is a Large Language Model (LLM) that interprets user-provided natural language instructions and performs semantic reasoning over detected objects and spatial context to prioritize high-probability search regions. The system combines real-time object detection, 3D spatial mapping, and polynomial spline interpolation for smooth and feasible UAV trajectory planning. Unlike prior methods that rely on offline reasoning or simulator-constrained action spaces, our framework can operate in real time, continuously updating semantic relevance based on new observations. Experiments in both simulated and real-world settings demonstrate reductions in mission duration while maintaining high search accuracy, underscoring the effectiveness of LLM-guided reasoning for time-efficient UAV-based ObjectNav.

Files

mmaletic_uav-llm-reasoning.pdf

Files (7.1 MB)

Name Size Download all
md5:92f560b4055bf2e980d88c1a62cf83d5
7.1 MB Preview Download

Additional details