Dataset Open Access

PrevDistro - Preverb Distributions in Hungarian

Kalivoda, Ágnes


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Kalivoda, Ágnes</dc:creator>
  <dc:date>2021-06-21</dc:date>
  <dc:description>PrevDistro (Preverb Distributions) is an open-source dataset containing 41.5 million corpus occurrences of 49 preverb-verb construction types. It consists of the following columns:


	1 sid: ID
	2 constype: construction type
	3 subtype: construction subtype
	4 prevpos: preverb position
	5 prev: preverb
	6 verb: verb lemma
	7 intervening: intervening words (as lemmas)
	8 actform: actual form (the same content as in column 10, but this column is lowercase)
	9 left: left context
	10 kwic: keyword in context
	11 right: right context
	12 docid: document ID from the Hungarian Gigaword Corpus
	13 title: document title
	14 style: document style (e.g. official, press, ...)
	15 region: document region (e.g. Transylvania, Subcarpathia, ...)
	16 year: year of publication (sometimes several years can be found in one document)


The first row stands for the header. If a cell's value is unspecified, it is marked with underscore (_).</dc:description>
  <dc:description>PrevDistro 1.0.0 (deprecated) can be found at https://science-data.hu/dataset.xhtml?persistentId=doi:10.5072/FK2/TRSD50
In PrevDistro 2.0.0, several new columns were added and the already existing data has undergone some fixes as well.</dc:description>
  <dc:identifier>https://zenodo.org/record/6349410</dc:identifier>
  <dc:identifier>10.5281/zenodo.6349410</dc:identifier>
  <dc:identifier>oai:zenodo.org:6349410</dc:identifier>
  <dc:language>hun</dc:language>
  <dc:relation>doi:10.15774/PPKE.BTK.2021.019</dc:relation>
  <dc:relation>doi:10.5281/zenodo.6349409</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://opensource.org/licenses/GPL-3.0</dc:rights>
  <dc:subject>linguistics</dc:subject>
  <dc:subject>Hungarian</dc:subject>
  <dc:subject>preverb constructions</dc:subject>
  <dc:subject>preverb</dc:subject>
  <dc:subject>verbal prefix</dc:subject>
  <dc:subject>verbal particle</dc:subject>
  <dc:subject>construction</dc:subject>
  <dc:title>PrevDistro - Preverb Distributions in Hungarian</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
39
3
views
downloads
All versions This version
Views 3939
Downloads 33
Data volume 39.7 GB39.7 GB
Unique views 2828
Unique downloads 33

Share

Cite as