Published May 9, 2024
| Version 1.3.4
Dataset
Open
PaRuS
Description
PaRuS is a morphologically tagged and dependency-parsed 2.5 B token corpus of Russian sentences. It consists of more than 150 M isolated sentences taken from open-source texts. The annotation scheme is that of the SynTagRus corpus.
Notes
Files
Files
(32.0 GB)
Name | Size | Download all |
---|---|---|
md5:2fc360b2b828a52c8d13d44ca18969fc
|
15.2 GB | Download |
md5:3d6976649845a4cb8c9778dc485984f8
|
16.8 GB | Download |
Additional details
Dates
- Created
-
2024-05