There is a newer version of the record available.

Published August 12, 2025 | Version v1
Dataset Open

Spanish Future Tense Dataset for Evaluating LLM Choice: Morphological vs. Periphrastic

  • 1. Universidad Autónoma de Madrid
  • 2. ROR icon Universidad Politécnica de Madrid
  • 3. ROR icon Universidad Complutense de Madrid
  • 4. ROR icon New York University

Description

The dataset contains test questions to evaluate LLMs in Spanish

  • 100 questions are about Spanish futures with prospective meaning. With two possible options (morphological future and periphrastic future), the LLMs must choose one of the two, both of which are possible in Spanish.
  • 65 questions are about Spanish futures with epistemic meaning. With two possible options (morphological future and periphrastic future), the LLMs must choose the correct answer, with only the morphological future being possible in these contexts.

Some sentences were drawn from the Corpus de Referencia del Español Actual (CREA) of the Real Academia Española and subsequently adapted for inclusion in the dataset.

Files

Files (24.1 kB)

Name Size Download all
md5:8a5b9e1d78ef0eeb99b927eed83e7b44
24.1 kB Download

Additional details

Funding

Agencia Estatal de Investigación
FUN4DATE PID2022-136684OB-C21/C22
European Commission
SMARTY - Scalable and Quantum Resilient Heterogeneous Edge Computing enabling Trustworthy AI 101140087
OpenAI (United States)
Researcher Access Program