There is a newer version of the record available.

Published April 17, 2026 | Version v4
Dataset Open

Webis Generated Native Ads 2025

Description

Dataset Summary

This dataset was created to train ad blocking systems on the task of identifying advertisements in the responses of large language models (LLMs) and search engines that use retrieval-augmented generation (RAG). It is the successor of the Webis Generated Native Ads 2024 dataset.

 

Citation

@misc{heineking:2025,
  author =                   {Sebastian Heineking and Ines Zelch and Wilhelm Pertsch and Christian Deubel and Matthias Hagen and Martin Potthast},
  title =                    {{Webis Generated Native Ads 2025}},
  doi =                      {10.5281/zenodo.16941607},
  year =                     2025
}

Files

README.md

Files (329.3 MB)

Name Size Download all
md5:2caacd3a77194b8a65d88a417fc4f558
31.2 kB Download
md5:c19e75152ef7539b1fdd1e38502b6b26
38.8 kB Download
md5:5352711d6336c81e7b68d4f79c38565e
3.5 MB Download
md5:fa1f1a36abebf9e1cf806cf855176eac
3.7 kB Preview Download
md5:69cdbd8fc32256950da5b591169e397b
21.6 kB Preview Download
md5:c7f3f80164f3aae125313a7d3f521395
881.4 kB Download
md5:7490e68890c551220bdddabc3371551e
10.4 MB Download
md5:89cb3d4acff62a692255c0c9bfdca142
4.7 MB Download
md5:d36850c5345b89e4b5a431ba09a77b50
54.2 MB Download
md5:df7a039b2fb90bec6d639c1bf6bde9b0
825.3 kB Download
md5:3e9fb5fc0a7c4aab1940f0898728c35c
9.7 MB Download
md5:a5a976bbdc8a06c3e74d9c4c02cc0f09
1.7 MB Download
md5:d072e33be9197dd2e08501d54a94a1a8
7.2 MB Download
md5:6d2be4179bea0c1ab91d9ed1612afc8b
9.4 MB Download
md5:dbbf725f7b0d7f1c75b51aca1200b60e
38.4 MB Download
md5:70d702989ef2eafb831542e109433981
1.7 MB Download
md5:488878a29a616c4b25cdaecd1261c7d2
6.8 MB Download
md5:dd4f6eed44aa53a5b2c4e9b5bad592dd
9.5 MB Download
md5:3c7b9b243ef51467c3efc1706bc7b034
15.6 MB Download
md5:51685882eb934231eede0ad69ed8d654
49.8 MB Download
md5:28b2a284778ce81e74ffbd354b2a261d
81.3 MB Download
md5:49a40f4aecb015f516c3f8bdda4a23f4
8.9 MB Download
md5:3e6f98e2857a3e8126d258aa19755635
14.5 MB Download

Additional details

Related works

Is new version of
Dataset: 10.5281/zenodo.15270283 (DOI)

Funding

European Commission
OpenWebSearch.EU - Piloting a Cooperative Open Web Search Infrastructure to Support Europe's Digital Sovereignty 101070014

Dates

Created
2025-08-25