Published May 21, 2023
| Version 1.3.1
Dataset
Open
PaRuS
Description
PaRuS is a morphologically tagged and dependency-parsed 2.5 B token corpus of Russian sentences. It consists of more than 150 M isolated sentences taken from open-source texts. The annotation scheme is that of the SynTagRus corpus.
Notes
Files
Files
(32.0 GB)
Name | Size | Download all |
---|---|---|
md5:6866af01c53e286ce5762dd912d3c992
|
15.2 GB | Download |
md5:8005513e005eea933939b8994514684b
|
16.8 GB | Download |