Published March 24, 2026
| Version v1
Conference paper
Open
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
Authors/Creators
Files
2506.05551v2.pdf
Files
(6.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:ae114a4bfda1490db72682fa998be1cd
|
6.8 MB | Preview Download |