Published June 11, 2026 | Version v0.9

KR-Housing-LongRAG-Bench

Authors/Creators

  • 1. Seoul School of Integrated Sciences and Technologies (aSSIST); Posicube Inc.

Description

A copyright-safe Korean long-context benchmark for evaluating long-context LLMs, RAG systems, and table/tool pipelines over real housing announcements, public tabular data, and housing statutes. The public release contains QA labels, evidence locators, deterministic predicates, answerability labels, split and provider/region metadata, and long-context-bundle references. It does not redistribute raw PDF/HWP/HWPX documents, bundle text, API keys, or hidden gold answers. v0.8 is a human-review repair build (1,997 QA): all positional-cloze probes were regenerated into natural source-grounded questions or removed, and location/answer errors were fixed (LLM-assisted gpt-5.4 + Claude cross-model review).

Files

comsa33/kr-housing-longrag-bench-v0.9.zip

Files (1.2 MB)

Name Size Download all
md5:337bf49560a4f5d0c4ff45b826dfd6c4
1.2 MB Preview Download

Additional details

Related works