Published August 1, 2019
| Version v2.1.7
Software
Open
explosion/spaCy: v2.1.7: Improved evaluation, better language factories and bug fixes
Creators
- Matthew Honnibal1
- Ines Montani1
- Matthew Honnibal1
- Henning Peters
- Sofie Van Landeghem2
- Maxim Samsonov
- Jim Geovedi
- Jim Regan
- György Orosz3
- Søren Lind Kristiansen
- Paul O'Leary McCann
- Duygu Altinok4
- Roman5
- Grégory Howard
- Sam Bozek
- Explosion Bot6
- Mark Amery
- Wannaphong Phatthiyaphaibun7
- Leif Uwe Vogelsang
- Björn Böing
- Pradeep Kumar Tippa
- jeannefukumaru
- GregDubbin
- Vadim Mazaev
- Ramanan Balakrishnan8
- Jens Dahl Møllerhøj9
- wbwseeker
- Magnus Burton
- thomasO
- Avadh Patel10
- 1. Founder @explosion
- 2. OxyKodit
- 3. LogMeIn, Meltwater
- 4. German Autolabs
- 5. @kouchtv
- 6. @explosion
- 7. @PyThaiNLP
- 8. @Semantics3
- 9. mollerhoj
- 10. SUNY Binghamton - Computer Science
Description
✨ New features and improvements
- Add
Token.tensor
andSpan.tensor
attributes. - Support simple training format of
(text, annotations)
instead of only(doc, gold)
fornlp.evaluate
. - Add support for
"lang_factory"
setting in modelmeta.json
(see #4031). - Also support
"requirements"
inmeta.json
to define packages for setup'sinstall_requires
. - Improve
Pipe
base class methods and make them less presumptuous. - Improve Danish and Korean tokenization.
- Improve error messages when deserializing model fails.
- Fix issue #3669, #3962: Fix dependency copy in
Span.as_doc
that could cause segfault. - Fix issue #3968: Fix bug in per-entity scores.
- Fix issue #4000: Improve entity linking API.
- Fix issue #4022: Fix error when Korean text contains special characters.
- Fix issue #4030: Handle edge case when calling
TextCategorizer.predict
with emptyDoc
. - Fix issue #4045: Correct
Span.sent
docs. - Fix issue #4048: Fix
init-model
command if there's no vocab. - Fix issue #4052: Improve per-type scoring of NER.
- Fix issue #4054: Ensure the
lang
ofnlp
andnlp.vocab
stay consistent. - Fix bugs in
Token.similarity
andSpan.similarity
when called via hook.
- Add documentation for
gold.align
helper. - Add more explicit section on processing text.
- Improve documentation on disabling pipeline components.
- Fix various typos and inconsistencies.
Thanks to @sorenlind, @pmbaumgartner, @svlandeg, @FallakAsad, @BreakBB, @adrianeboyd, @polm, @b1uec0in, @mdaudali and @ejarkm for the pull requests and contributions.
Files
explosion/spaCy-v2.1.7.zip
Files
(31.2 MB)
Name | Size | Download all |
---|---|---|
md5:34056422d41252a964b1a7e4515c6d95
|
31.2 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/explosion/spaCy/tree/v2.1.7 (URL)