Explainability Analysis of Retrieval-Driven Behavior in RAG Pipelines

Kim, Yujin; Rajan, Ranjidha

doi:10.5281/zenodo.18945160

Published February 3, 2026 | Version v2

Publication Open

Explainability Analysis of Retrieval-Driven Behavior in RAG Pipelines

1. Metropolitan State University of Denver

This work presents a systematic explainability analysis of retrieval driven behavior in Retrieval Augmented Generation RAG pipelines. The study examines how embedding selection, FAISS based vector retrieval, and generator architectures collectively influence answer correctness, reasoning stability, and hallucination behavior.

Using controlled experiments on the SQuAD v2 dataset, the analysis quantifies retrieval precision, semantic drift, and error propagation across the pipeline. Multiple explainability methods are applied, including attention analysis, integrated gradients attribution, and confidence calibration, to trace how retrieved evidence is consumed by the generator.

The results show that retrieval precision is the dominant factor governing RAG reliability. When retrieval is semantically aligned, the generator produces stable and grounded outputs. When retrieval drifts or becomes ambiguous, output variance and error rates increase sharply. These findings highlight the importance of retrieval centric evaluation and provide actionable insights for designing more transparent and robust RAG systems.

Files

Explainability Analysis of Retrieval-Driven Behavior in RAG Pipelines.pdf

Files (352.0 kB)

Name	Size	Download all
Explainability Analysis of Retrieval-Driven Behavior in RAG Pipelines.pdf md5:4a9a8d5d24e70ec3d40a0ad0e8b669fc	352.0 kB	Preview Download

	All versions	This version
Views	57	31
Downloads	42	30
Data volume	20.8 MB	13.0 MB

Explainability Analysis of Retrieval-Driven Behavior in RAG Pipelines

Authors/Creators

Description

Files

Explainability Analysis of Retrieval-Driven Behavior in RAG Pipelines.pdf

Files (352.0 kB)