Published May 6, 2025 | Version v8

Touché-25-Advertisement-in-Retrieval-Augmented-Generation

  • 1. ROR icon Leipzig University
  • 2. Friedrich-Schiller-Universität Jena
  • 3. ROR icon University of Kassel
  • 4. hessian.AI
  • 5. ScaDS.AI

Description

Dataset for Sub-Task 1 (Generation) of the Touché 2025 Task 4. The goal of this task is to research advertisements in retrieval augmented generation (RAG). Towards this goal, the dataset provides queries from the Webis Generated Native Ads 2024 dataset and corresponding document segments from the segmented version of MS MARCO V2.1.

Files

README.md

Files (129.5 MB)

Name Size Download all
md5:e27d6cf04ad3b0ebc8b0aced3e34bc83
177 Bytes Download
md5:3d0573af6f9d311452137521c3711f1e
341.8 kB Download
md5:e9d5a72ae7d84aa9ab31f9d0174bed14
412.7 kB Download
md5:355ba3cca724f27da19bc51c2deca4eb
43.6 MB Download
md5:486f3e543af5882dc022b35ab6a8bc96
18.8 kB Download
md5:9baf2104c914184718ec74e3600675dc
85.1 MB Download
md5:9ff452db0d5524534de8cec7d8bb5d13
8.3 kB Preview Download

Additional details

Related works

Is derived from
Dataset: arXiv:1611.09268 (arXiv)
Dataset: 10.5281/zenodo.10802427 (DOI)

Funding

European Commission
OpenWebSearch.EU - Piloting a Cooperative Open Web Search Infrastructure to Support Europe's Digital Sovereignty 101070014