Published March 2, 2023 | Version 1.0.0
Dataset Restricted

AuTexTification

Description

Training datasets for the AuTexTification shared task at IberLEF 2023. This task aims to boost research on the detection of text generated automatically by text generation models. Participants must develop models that exploit clues about linguistic form and meaning to distinguish automatically generated text from human text.

It consists of two tasks, both for English and Spanish: 

1) Generated or Human: determine whether the text has been automatically generated or not.

2) Model Attribution: determine what language model generated a text.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

Let us know what you want to use the dataset for, e.g., participating on the AuTexTification shared task.

Please, include your name/team name, also include your institution and supervisor, if any. 

You are currently not logged in. Do you have an account? Log in here