Published October 23, 2020 | Version 1.0
Dataset Open

Zipf's laws of meaning in Catalan Datasets

  • 1. Neus
  • 2. Jaume
  • 3. Ramon
  • 4. Lluís
  • 5. Antoni

Description

This are the datasets referenced in the paper "Zipf’s laws of meaning in Catalan". The datasets are:

DIEC2_CTILC_senseCG. It contains the following information:

  • lema: lemma in Catalan
  • n_sentits: number of meanings for that lemma,
  • freq: frequency for that lemma.
  • rank of that lemma.

Those lemmas are contained in both of the following corpuses:

CTILC: https://ctilc.iec.cat.

DIEC: Institut d’Estudis Catalans. Diccionari de la llengua catalana [en línia]. 2a ed. Barcelona: Edicions 62: Enciclopèdia Catalana, 2007. [1a ed., 1995] <https://dlc.iec.cat/> [Consulta: October 2020]

DIEC2_GLISSANDO_senseCG.

It contains the following information:

  • lema: lemma in Catalan
  • n_sentits: number of meanings for that lemma,
  • freq: frequency for that lemma.
  • rank of that lemma.

Those lemmas are contained in both of the following corpuses:

Glissando: http://catalog.elra.info/en-us/repository/browse/ELRA-S0407/

DIEC: Institut d’Estudis Catalans. Diccionari de la llengua catalana [en línia]. 2a ed. Barcelona: Edicions 62: Enciclopèdia Catalana, 2007. [1a ed., 1995] <https://dlc.iec.cat/> [Consulta: October 2020]

 

Notes

This work has been funded by the project PRO2020-S03 (RCO03080449Ling ̈u ́ıstica Quantitativa, Institut d'Estudis Catalans). JB, RFC and AHF450are also funded by the grant TIN2017-89244-R from Ministerio de Econo-451mia, Industria y Competitividad (Gobierno de Espa ̃na) and the recognition4522017SGR-856 (MACDA) from AGAUR (Generalitat de Catalunya).

Files

DIEC2_CTILC_senseCG.zip

Files (450.8 kB)

Name Size Download all
md5:533aa51cd6289b6998d1a4f1a6bd4e9b
427.3 kB Preview Download
md5:302ceb640a68042b72437c9aabdab41e
23.5 kB Preview Download

Additional details

References

  • Institut d'Estudis Catalans. Diccionari de la llengua catalana [en línia]. 2a ed. Barcelona: Edicions 62: Enciclopèdia Catalana, 2007. [1a ed., 1995] <https://dlc.iec.cat/> [Consulta: October 2020]