Published March 8, 2023 | Version v2
Dataset Open

Open Source Conversational LLMs knowledge about Spanish words (100 words)

  • 1. ROR icon Universidad Politécnica de Madrid
  • 2. ROR icon Carlos III University of Madrid
  • 3. ROR icon University of Valladolid

Description

Dataset with a sample of 100 Spanish words and their definition and two examples of usage given by LLM models.

  • Frequencies_100_words_CREA.xlsx: usage frequency of the words in Spanish
  • llm_*.xlsx: responses from the LLM models about if they know the word, their definition, and two examples of sentences using them. It includes the manual validation from an expert in Spanish language. LLMs tested:
    • Llama-2-7b-chat-hf (quantization 32 bit)
    • Llama-2-13b-chat-hf (quantization 16 bits)
    • Llama-2-70b-chat-hf (quantization 4 bits)
    • Mistral-7b-Instruct (quantization 32 bits)
    • Mixtral-8x7b-Instruct (quantization 4 bits)
    • Gemma-7b-it (quantization 32 bits)
    • SOLAR-10.7b-Instruct (quantization 16 bits)
    • Yi-6b-Chat (quantization 32 bits)
    • Yi-34b-Chat (quantization 8 bits)
    • Bloomz-7b1 (quantization 32 bits)
    • FLOR-6.3b-Instructed (quantization 32 bits)
    • Bertin-6b (quantization 32 bits)
  • validation_*.xlsx: authomatic validation made by gpt3.5 and gpt4 of the llm_responses_100_words

Files

Files (1.4 MB)

Name Size Download all
md5:d262f6e1a9b8fc7f259302f46c6fe376
11.8 kB Download
md5:f772bfbd131afb82e5348dd607c067c3
26.3 kB Download
md5:c6d10f53442e10a2ad757d2f896c6539
19.6 kB Download
md5:b70abbe289d92c06c33e2600584d5775
66.7 kB Download
md5:35bf2e464c99eb04a1867f63af864df0
26.5 kB Download
md5:7b30969df82024422be9e533e70edfbe
112.5 kB Download
md5:f5dbfe1e354d04be28dea0f81f3abd7a
115.2 kB Download
md5:55902eac087b00b84dceeaeab684885c
114.1 kB Download
md5:87df4fee2a0e818aadf9219c328f498f
34.8 kB Download
md5:abb6e4ad3f5b152c0cbc9d95c68b49c3
123.6 kB Download
md5:f7eb298b478fde6ac23fcbb90884ee4e
119.8 kB Download
md5:14180070ab09aec341b388ef80414234
99.3 kB Download
md5:bc9dc49fdda59bcaa26babbb8b7f0f6d
115.7 kB Download
md5:334ada793dd08376e23acb643e8edc06
190.4 kB Download
md5:9da5fdfa5105cc6e9938b9bce4fa8f4d
190.3 kB Download

Additional details

Funding

Fun4Date PID2022-136684OB-C21/C22
Agencia Estatal de Investigación