Published May 27, 2026 | Version v5

Per-Entity Bias Mapping for AI Visibility: Why Brand Mentions Require Entity-Specific Calibration

Authors/Creators

  • 1. Neural Awareness, Budapest, Hungary

Description

AI-mediated answer systems increasingly influence how brands, organizations, experts, and products are represented to users. Existing visibility approaches reduce this representation to mention rate, citation rate, or aggregate platform visibility. This paper argues that such aggregate metrics are insufficient because entities exhibit systematically different AI visibility error profiles that aggregate scores cannot detect. We identify three distinct failure modes. First, small or underrepresented entities may suffer from invisibility due to weak structured data, limited knowledge graph presence, and low co-citation density. Second, large or highly familiar entities may suffer disproportionately from false attribution and hallucinated authority — a pattern we formalize as the Brand Hallucination Paradox, where model familiarity creates a stronger surface for plausible but incorrect completions. Third, entities from underrepresented regions such as Central and Eastern Europe face a structural CEE Entity Infrastructure Gap operating across three technical layers: knowledge graph absence (missing Wikidata identifiers), named entity recognition performance deficits for morphologically complex languages, and entity linking difficulty due to linguistic variation — producing AI visibility deficits that content optimization alone cannot resolve. A fourth dimension cuts across all three: the Parametric-Retrieval Lag Asymmetry. AI answer systems operate through two knowledge mechanisms with fundamentally different update speeds. Retrieval-augmented systems can incorporate new information within days, while parametric memory encoded in model weights updates only at retraining intervals spanning twelve to twenty-four months. Entities undergoing rapid change — rebranding, mergers, market entry, reputation events — may be accurately represented in retrieval-augmented outputs while remaining outdated or absent in parametric outputs, producing measurably divergent representations of the same entity across platforms or query modes. This paper introduces Per-Entity Bias Mapping as a framework for calibrating AI visibility at the level of individual entities. The framework distinguishes raw mentions from verified mentions and proposes a ten-dimensional measurement agenda covering retrieval inclusion, mention probability, hallucination rate, citation fidelity, source authority, and parametric-retrieval lag. We introduce ghost cartography as a unifying mechanism for entity-level AI misrepresentation: when entities occupy sparse latent regions, models produce confident, evidence-like output interpolated from neighboring dense regions. This yields a two-dimensional confabulation space in which the Brand Hallucination Paradox captures fabricated presence (Type 3) and Parametric-Retrieval Lag Asymmetry captures frozen representation (Type 4). We further frame AI visibility as a field phenomenon: entities become machine-recognizable figures through citation density, co-citation patterns, structured data, and knowledge graph anchoring, not through self-description alone. Empirical study: n=100 Hungarian B2B entities, 1,400 probe runs, 2,062 sources, non-AI HTTP+Crossref verification. Platforms: claude-sonnet-4-6 and gpt-4o-2024-08-06. Data collection: 2026-05-22, 07:25-12:16 UTC. Full data released as supplementary materials (CC BY 4.0).

Notes

v5 — Ghost Cartography Edition (2026-05-27). Extends v4 with theoretical and conceptual framing of entity-level AI misrepresentation. Introduces Ghost Cartography as a unifying mechanism for sparse-region entity confabulation; formalizes Type 3 (Brand Hallucination Paradox) and Type 4 (Parametric-Retrieval Lag Asymmetry) confabulation as a two-dimensional space; adds field model of AI visibility (element->figure, field navigation). Key additions: half-known zone (S1), brand != landing page (S2), S3.7-3.9 Ghost Cartography definitions, S3.10 field model, citation network as machine medium (S10), revised source-level logistic regression (OR=0.37, 95% CI [0.24-0.56], p<0.001), program statement (S14), contributions C8+C9. No new primary empirical dataset; empirical foundation remains v4 study (n=100 Hungarian B2B entities, 1,400 probe runs, 2,062 source records). New references: [48] Ardoin et al. EMNLP 2025; [49] Manovich (2020) Cultural Analytics; [50] Fisher (2016) The Weird and the Eerie.

Files

AIVISIBILITY-PER-ENTITY-BIAS-MAP-PREPRINT-DRAFT-EN-2026-05-27.md

Files (1.4 MB)

Name Size Download all
md5:07751df34d5cb20e56da36a1d409463f
121.8 kB Preview Download
md5:068aca72a41e358dfc33b95368355f4a
377.7 kB Preview Download
md5:dd350824ab833cdbbbc5e4b6c643e8c7
845.2 kB Preview Download
md5:4595bbeb1163a55b4c04a1ebdaca3422
4.5 kB Preview Download
md5:b5d1ef73bdcfa161c4e79368cceb7bb4
12.1 kB Download

Additional details

Related works