Published January 15, 2024 | Version v2
Dataset Open

A dataset to assess Microsoft Copilot Answers in the Context of Swiss, Bavarian and Hesse Elections.

  • 1. Ai Forensics
  • 2. AID4So, IN(3), Universitat Obierta Catalinya
  • 3. Università degli Studi di Milano-Bicocca
  • 4. Fondazione ISI - Istituto per l'lnterscambio Scientifico
  • 5. Pompeu Fabra University

Description

This dataset allows to assess the emerging challenges posed by Generative Artificial Intelligence, when doing Active Retrieval  Augmented Generation (RAG), especially when summarizing trustworthy sources on the internet. As a case study, we focus on Microsoft Copilot, an innovative software that integrates Large Language Models (LLMs) and Search Engines (SE) making advanced AI accessible to the general public. The core contribution of this paper is the presentation of the largest public database to date of RAG responses to user prompts, collected during the 2023 electoral campaigns in the Swiss, Bavaria and Hesse. This dataset was compiled with the assistance of a group of experts who posed realistic voter questions and conducted fact-checking of Microsoft Copilot's responses. It contains prompts and answers in English, German, French and Italian. All the collection happened during the electoral  campaign, between 21 August 2023 and 2 October 2023. The paper makes available the full set of 5,561 pairs of prompts and answers, including the quoted URLs for the source referenced in the answers. In addition to the dataset itself,  we provide 1374 answers labeled by a group of experts who rated the accuracy of the answers in providing factual information. This resource is intended to facilitate further research into the performance of LLMs in the context of elections, defined as "high-risk scenario" by the Digital Service Act (DSA).

Files

Microsof-Copilot-Answers_in-Swiss-Bavarian-Hess-Elections.csv

Files (9.6 MB)

Name Size Download all
md5:e3e9c88ea2d8cdb136c4f67f22a46c0c
9.6 MB Preview Download
md5:f571d6db188c6128567ee9bbbaf70174
7.1 kB Download

Additional details

Identifiers

Other
1

Related works

Is derived from
Report: https://aiforensics.org//uploads/AIF_AW_Bing_Chat_Elections_Report_ca7200fe8d.pdf (Other)

Dates

Collected
2023-09-21
Data collection start
Collected
2023-10-02
Data collection end

References