Spatial-Semantic Reasoning using Large Language Models for Efficient UAV Search Operations
Authors/Creators
Description
We present a real-time semantic navigation frame-work for Unmanned Aerial Vehicles (UAVs) focused on improving time efficiency in the Object Goal Navigation (ObjectNav) task. Central to our approach is a Large Language Model (LLM) that interprets user-provided natural language instructions and performs semantic reasoning over detected objects and spatial context to prioritize high-probability search regions. The system combines real-time object detection, 3D spatial mapping, and polynomial spline interpolation for smooth and feasible UAV trajectory planning. Unlike prior methods that rely on offline reasoning or simulator-constrained action spaces, our framework can operate in real time, continuously updating semantic relevance based on new observations. Experiments in both simulated and real-world settings demonstrate reductions in mission duration while maintaining high search accuracy, underscoring the effectiveness of LLM-guided reasoning for time-efficient UAV-based ObjectNav.
Files
mmaletic_uav-llm-reasoning.pdf
Files
(7.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:92f560b4055bf2e980d88c1a62cf83d5
|
7.1 MB | Preview Download |
Additional details
Software
- Repository URL
- https://github.com/larics/UAV-LLM-semantic-reasoning.git