When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Published March 24, 2026 | Version v1

Conference paper Open

Files

Name	Size	Download all
2506.05551v2.pdf md5:ae114a4bfda1490db72682fa998be1cd	6.8 MB	Preview Download

Views

Downloads

Show more details

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

Neural Information Processing Systems (NeurIPS), December 2025

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more