Published April 24, 2026 | Version v6
Dataset Open

Webis Generated Native Ads 2025

Description

Dataset Summary

This dataset was created to train ad blocking systems on the task of identifying advertisements in the responses of large language models (LLMs) and search engines that use retrieval-augmented generation (RAG). It is the successor of the Webis Generated Native Ads 2024 dataset.

 

Citation

@misc{heineking:2025,
  author =                   {Sebastian Heineking and Ines Zelch and Wilhelm Pertsch and Christian Deubel and Matthias Hagen and Martin Potthast},
  title =                    {{Webis Generated Native Ads 2025}},
  doi =                      {10.5281/zenodo.16941607},
  year =                     2025
}

Files

README.md

Files (329.3 MB)

Name Size Download all
md5:a74ca7a327bc2afa9ff3b9be5cdf53f7
5.1 kB Download
md5:57b58f8e4392dce1d7016d4343b5fa29
40.6 kB Download
md5:2caacd3a77194b8a65d88a417fc4f558
31.2 kB Download
md5:50cfd90741907ac46f1a0b0b7a120b4f
40.4 kB Download
md5:5352711d6336c81e7b68d4f79c38565e
3.5 MB Download
md5:fa1f1a36abebf9e1cf806cf855176eac
3.7 kB Preview Download
md5:7aabd4e9541b5827b50acc9fd1c0295d
21.7 kB Preview Download
md5:c7f3f80164f3aae125313a7d3f521395
881.4 kB Download
md5:7490e68890c551220bdddabc3371551e
10.4 MB Download
md5:89cb3d4acff62a692255c0c9bfdca142
4.7 MB Download
md5:d36850c5345b89e4b5a431ba09a77b50
54.2 MB Download
md5:df7a039b2fb90bec6d639c1bf6bde9b0
825.3 kB Download
md5:3e9fb5fc0a7c4aab1940f0898728c35c
9.7 MB Download
md5:a5a976bbdc8a06c3e74d9c4c02cc0f09
1.7 MB Download
md5:d072e33be9197dd2e08501d54a94a1a8
7.2 MB Download
md5:6d2be4179bea0c1ab91d9ed1612afc8b
9.4 MB Download
md5:dbbf725f7b0d7f1c75b51aca1200b60e
38.4 MB Download
md5:70d702989ef2eafb831542e109433981
1.7 MB Download
md5:488878a29a616c4b25cdaecd1261c7d2
6.8 MB Download
md5:dd4f6eed44aa53a5b2c4e9b5bad592dd
9.5 MB Download
md5:3c7b9b243ef51467c3efc1706bc7b034
15.6 MB Download
md5:51685882eb934231eede0ad69ed8d654
49.8 MB Download
md5:28b2a284778ce81e74ffbd354b2a261d
81.3 MB Download
md5:49a40f4aecb015f516c3f8bdda4a23f4
8.9 MB Download
md5:3e6f98e2857a3e8126d258aa19755635
14.5 MB Download

Additional details

Related works

Is new version of
Dataset: 10.5281/zenodo.15270283 (DOI)

Funding

European Commission
OpenWebSearch.EU - Piloting a Cooperative Open Web Search Infrastructure to Support Europe's Digital Sovereignty 101070014

Dates

Created
2025-08-25