Published August 20, 2021
| Version v3.1.2
Software
Open
explosion/spaCy: v3.1.2: Improved spancat component and various bugfixes
Creators
- Ines Montani1
- Matthew Honnibal1
- Matthew Honnibal1
- Sofie Van Landeghem2
- Adriane Boyd
- Henning Peters
- Maxim Samsonov
- Paul O'Leary McCann3
- Jim Geovedi
- Jim Regan
- György Orosz4
- Duygu Altinok5
- Søren Lind Kristiansen
- Roman6
- Explosion Bot7
- Leander Fiedler
- Grégory Howard
- Wannaphong Phatthiyaphaibun8
- Yohei Tamura9
- Sam Bozek
- murat
- Mark Amery
- Björn Böing10
- Pradeep Kumar Tippa
- Leif Uwe Vogelsang
- Ramanan Balakrishnan11
- Vadim Mazaev
- GregDubbin
- jeannefukumaru
- Walter Henry
- 1. Founder @explosion
- 2. Explosion & OxyKodit
- 3. Cotonoha
- 4. LogMeIn, Meltwater
- 5. German Autolabs
- 6. @kouchtv
- 7. @explosion
- 8. @PyThaiNLP
- 9. PKSHA Technology
- 10. @codecentric
- 11. @Semantics3
Description
✨ New features and improvements
- NEW: Provide scores for the
SpanCategorizer
predictions. - NEW: Broader compatibility with type checkers thanks to
.pyi
stub files. - NEW: Auto-detect package dependencies in
spacy package
. - New
INTERSECTS
operator for the Matcher. - More debugging info for
spacy project
push
andpull
commands. - Allow passing in a precomputed array for speeding up multiple
Span.as_doc
calls. - The default
da
transformer is now the same as the one from the trained pipelines (Maltehb/danish-bert-botxo
).
- Fix issue #8767: Fix offsets of empty and out-of-bounds spans.
- Fix issue #8774: Ensure
debug data
runs correctly with a custom tokenizer. - Fix issue #8784: Fix incorrect
ISSUBSET
andISSUPERSET
in schema and docs. - Fix issue #8796: Respect the
no_skip
value forspacy project run
. - Fix issue #8810: Make
ConsoleLogger
flush after each logging line. - Fix issue #8819: Pass
exclude
when serializing the vocab. - Fix issue #8830: Avoid adding sourced vectors hashes if not necessary.
- Fix issue #8970: Fix
allow_overlap
default for span categorizer scoring. - Fix issue #8982: Add glossary entry for
_SP
. - Fix issue #9007: Fix span categorizer training on nested entities.
- New developer documentation covering spaCy's internals and code conventions.
- Added a documentation section on preparing training data in spaCy's binary format.
- Updated some error/log messages to be more informative.
- Various updates to the documentation.
- A few new additions to the spaCy universe.
@adrianeboyd, @bbieniek, @DuyguA, @ezorita, @HLasse, @honnibal, @ines, @kabirkhan, @kevinlu1248, @ldorigo, @Ledenel, @nsorros, @polm, @svlandeg, @swfarnsworth, @themrmax, @thomashacker
Files
explosion/spaCy-v3.1.2.zip
Files
(10.7 MB)
Name | Size | Download all |
---|---|---|
md5:cb9e3b4bc3464a2f7ad3c7700d9de69e
|
10.7 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/explosion/spaCy/tree/v3.1.2 (URL)