Published August 6, 2020
| Version 1.0.0
Dataset
Open
Word2Vec model - Czech legislation
Creators
- 1. Faculty of Information Technology, Czech Technical University in Prague + Faculty of Mathematics and Physics, Charles University
- 2. Faculty of Mathematics and Physics, Charles University
Description
Word2Vec embedding model trained on Czech legislation (from April 2020) corpus using gensim implementation with the following parameters in addition to default settings:
- vector dimension = \(400\),
- window size = \(10\),
- word minimum count = \(10\),
- sample = \(10^{-5}\).