HJ-Ky-0.1: fastText Models

Alekseev, Anton; Kabaeva, Gulnara

doi:10.5281/zenodo.14544743

Published December 22, 2024 | Version 0.1

Model Open

HJ-Ky-0.1: fastText Models

Here we release fastText embeddings described in the paper as follows:

[...] we trained fastText embeddings on the Leipzig Corpus data. The training scheme was alsoSkip-Gram Negative Sampling, with 10 epochs, vector dimensions of 100 and 300, a window size of 5,and 10 negative samples. Character n-grams of 3 to 6 characters and 2,000,000 hashing buckets wereused for the hashing trick.

Files

Files (3.7 GB)

Name	Size	Download all
fasttext-tok.skipgram.100.ws5.bin.7z md5:c1fee3674299bcb1bc1aa6213f78318b	930.5 MB	Download
fasttext-tok.skipgram.300.ws5.bin.7z md5:6f8e53b4e742321215ae1f8ff4f89869	2.8 GB	Download

Additional details

Is supplement to: Journal article: 10.56634/16948335.2023.4.1723-1731 (DOI); Preprint: arXiv:2411.10724 (arXiv)

Views

Downloads

Show more details

	All versions	This version
Views	55	55
Downloads	74	74
Data volume	149.6 GB	149.6 GB

More info on how stats are collected....

DOI

Resource type

Model

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: December 22, 2024
Modified: December 22, 2024

HJ-Ky-0.1: fastText Models

Creators

Description

Files

Files (3.7 GB)

Additional details

Related works