There is a newer version of the record available.

Published August 2, 2021 | Version v0.0.3
Software Open

bigdata-ustc/EduNLP: EduNLP v0.0.3

Description

1. update formula ast: supporting more symbols and functions defined in katex
2. add tokens to vector tools, including word2vec and doc2vec using gensim
3. sci4sif support tokenization grouped by segments
4. add special tokens: \SIFTag and \SIFSep
5. add item to vector tools
6. add interface for getting pretrained models, where the supported model names can be accessed by `edunlp i2v` in the command console

Files

bigdata-ustc/EduNLP-v0.0.3.zip

Files (831.2 kB)

Name Size Download all
md5:e619f685de151d9a39a6a92b3640f46b
831.2 kB Preview Download

Additional details

Related works