Published May 20, 2020 | Version v1
Dataset Open

RuBQ

  • 1. JetBrains Research

Description

We present RuBQ (pronounced [`rubik]) -- Russian Knowledge Base Questions, a KBQA dataset that consists of 1,500 Russian questions of varying complexity along with their English machine translations, corresponding SPARQL queries, answers, as well as a subset of Wikidata covering entities with Russian labels. To the best of our knowledge, this is the first Russian KBQA and semantic parsing dataset.

The dataset is thought to be used as a development and test sets in cross-lingual transfer, few-shot learning, or learning with synthetic data scenarios. Detailed information about RuBQ can be found on the Github page.

Files

RuBQ.zip

Files (179.5 kB)

Name Size Download all
md5:b19e82af34ed9651fb0e98639838d1d1
179.5 kB Preview Download