GENERAL INFORMATION

Title of Dataset: 
Chatbots: (S)elected Moderation 

Subtitle: 
Measuring the Moderation of Election-Related Content Across Chatbots, Languages and Electoral Contexts


Date of data collection: 
from 2024-07-17 to 2024-07-19.

Geographic location of data collection: 
Netherlands

SHARING/ACCESS INFORMATION

Licenses/restrictions placed on the data: 
This publication is licensed under a Creative Commons Attribution 4.0 International License.
https://creativecommons.org/licenses/by/4.0/deed.en

Links to publications that cite or use the data: 
https://aiforensics.org/work/chatbots-moderation

Links to other publicly accessible locations of the data: 
NA

Links/relationships to ancillary data sets: 
NA

Was data derived from another source? 
No

METHODOLOGICAL INFORMATION
Please see the methodology described at https://aiforensics.org/work/chatbots-moderation.


Methods for processing the data: 
The process involved analyzing the Document Object Model of the web pages that were accessed. During this examination, key metadata were identified and extracted from the HTML structure. Once this information was successfully extracted, the rest of the HTML page, which primarily consisted of code and elements not pertinent to the needed information, was discarded. This approach ensured that only the most relevant and useful data was retained, while all unnecessary and extraneous HTML components were efficiently removed, streamlining the data collection and analysis process.

Instrument- or software-specific information needed to interpret the data: 
NA

Standards and calibration information, if appropriate: 
NA

Environmental/experimental conditions: 
NA


Variable List:
prompt - (str) Text of the prompt.
answer - (str) Text of the answer.
country - (str) Country of the IP address used by the automated browser.
language - (object) Language of the question.
template_slug - (str) Identifier of the prompt template (possibly containing one placeholder) used to instantiate the 'prompt'.
origin_rev - (str) Revision of the 'template_slug' used to instantiate the 'prompt'.
user_action_set - (str) Identifier of the 'prompt' (instantiation of a template).
experiment_slug - (str) Identifier of the experiment group.
sample_date - (str) Start time of the interaction with the chatbot.
week - (int64) Week number.
attributions - (str) Link quoted in the answer.
attribution_links - (str) Links for attributions.
search_query - (str) Search query used by the chatbot: not available.

Missing data codes:
NA

Specialized formats or other abbreviations used: 
NA
