Published July 13, 2020
| Version v2.3.2
Software
Open
explosion/spaCy: v2.3.2: Improved Korean tokenizer speed, experimental character-based pretraining and bug fixes
Creators
- Ines Montani1
- Matthew Honnibal1
- Matthew Honnibal1
- Sofie Van Landeghem2
- Henning Peters
- Adriane Boyd
- Maxim Samsonov
- Jim Geovedi
- Jim Regan
- György Orosz3
- Paul O'Leary McCann4
- Søren Lind Kristiansen
- Duygu Altinok5
- Roman6
- Leander Fiedler
- Grégory Howard
- Explosion Bot7
- Sam Bozek
- Wannaphong Phatthiyaphaibun8
- Mark Amery
- Björn Böing9
- Pradeep Kumar Tippa
- Yohei Tamura10
- Leif Uwe Vogelsang
- Ramanan Balakrishnan11
- Vadim Mazaev
- GregDubbin
- jeannefukumaru
- Jens Dahl Møllerhøj12
- Avadh Patel13
- 1. Founder @explosion
- 2. Explosion & OxyKodit
- 3. LogMeIn, Meltwater
- 4. Cotonoha
- 5. German Autolabs
- 6. @kouchtv
- 7. @explosion
- 8. @PyThaiNLP
- 9. @codecentric
- 10. PKSHA Technology
- 11. @Semantics3
- 12. BotXO
- 13. SUNY Binghamton - Computer Science
Description
✨ New features and improvements
- Improve Korean tokenizer speed.
- Add experimental character-based pretraining.
- Fix issue #5728: Fix French lemmatizer.
- Fix issue #5729: Fix lemmatizer for python 2.7.
- Fix issue #5751: Fix meta serialization in train CLI.
Thanks to @graue70, @mikeizbicki, @jbesomi, @gandersen101 and @DeNeutoy for the pull requests and contributions.
Files
explosion/spaCy-v2.3.2.zip
Files
(5.9 MB)
Name | Size | Download all |
---|---|---|
md5:4abcb084fc2d351ef34356da91294f71
|
5.9 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/explosion/spaCy/tree/v2.3.2 (URL)