There is a newer version of the record available.

Published July 13, 2020 | Version v2.3.2

Software Open

explosion/spaCy: v2.3.2: Improved Korean tokenizer speed, experimental character-based pretraining and bug fixes

1. Founder @explosion
2. Explosion & OxyKodit
3. LogMeIn, Meltwater
4. Cotonoha
5. German Autolabs
6. @kouchtv
7. @explosion
8. @PyThaiNLP
9. @codecentric
10. PKSHA Technology
11. @Semantics3
12. BotXO
13. SUNY Binghamton - Computer Science

✨ New features and improvements

Improve Korean tokenizer speed.
Add experimental character-based pretraining.

🔴 Bug fixes

Fix issue #5728: Fix French lemmatizer.
Fix issue #5729: Fix lemmatizer for python 2.7.
Fix issue #5751: Fix meta serialization in train CLI.

👥 Contributors

Thanks to @graue70, @mikeizbicki, @jbesomi, @gandersen101 and @DeNeutoy for the pull requests and contributions.

Files

explosion/spaCy-v2.3.2.zip

Files (5.9 MB)

Name	Size	Download all
explosion/spaCy-v2.3.2.zip md5:4abcb084fc2d351ef34356da91294f71	5.9 MB	Preview Download

Additional details

Is supplement to: https://github.com/explosion/spaCy/tree/v2.3.2 (URL)

23K

Views

701

Downloads

Show more details

	All versions	This version
Views	22,767	168
Downloads	701	8
Data volume	15.1 GB	47.4 MB

More info on how stats are collected....

DOI

Resource type

Software

Publisher

Zenodo

Technical metadata

Created: July 13, 2020
Modified: October 5, 2023