francescocabiddu/transformers-child-wsd: v1.0 Initial Release

francescocabiddu

doi:10.5281/zenodo.8200803

Published July 31, 2023 | Version v1.0

Software Open

francescocabiddu/transformers-child-wsd: v1.0 Initial Release

francescocabiddu

This is the first official release of our Word Sense Disambiguation project: "Comparing children and large language models in word sense disambiguation: Insights and challenges." In this version, we include the entire codebase necessary to reproduce the results of our paper and appendix.

Key features include:

Scripts to generate training and test stimuli. Scripts to run 45 different Transformer models. Scripts to randomly initialize model weights and downsample the training sets. Scripts to compute sense prototypes and evaluate model performance. The complete list of Python library requirements in an environment.yml file. A detailed directory structure with all the data, results, and downloaded BabyBERTa models. A Python script to process the Spoken British National Corpus and save it to the R project directory.

Please refer to the README for detailed setup and usage instructions.

Files

francescocabiddu/transformers-child-wsd-v1.0.zip

Files (319.2 MB)

Name	Size
francescocabiddu/transformers-child-wsd-v1.0.zip md5:59cf25abca3e2431546ee782c187b0b3	319.2 MB	Preview Download

Additional details

Is supplement to: https://github.com/francescocabiddu/transformers-child-wsd/tree/v1.0 (URL)

	All versions	This version
Views	144	144
Downloads	25	25
Data volume	8.9 GB	8.9 GB

francescocabiddu/transformers-child-wsd: v1.0 Initial Release

Authors/Creators

Description

Files

francescocabiddu/transformers-child-wsd-v1.0.zip

Files (319.2 MB)

Additional details

Related works