Published March 31, 2023
| Version 1.3.0
Dataset
Open
PaRuS
Description
PaRuS is a morphologically tagged and dependency-parsed 2.5 B token corpus of Russian sentences. It consists of more than 150 M isolated sentences taken from open-source texts. The annotation scheme is that of the SynTagRus corpus.
Notes
Files
Files
(32.0 GB)
Name | Size | Download all |
---|---|---|
md5:e91414e22aabc255aa8ca5885d242a8f
|
15.2 GB | Download |
md5:22574e33cc443ecbd228b505d1d758d5
|
16.8 GB | Download |