Dataset Open Access

SemFi - Finnish Semantic Database with Syntactic Relations

Hämäläinen, Mika


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.1463685</identifier>
  <creators>
    <creator>
      <creatorName>Hämäläinen, Mika</creatorName>
      <givenName>Mika</givenName>
      <familyName>Hämäläinen</familyName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-9315-1278</nameIdentifier>
      <affiliation>University of Helsinki</affiliation>
    </creator>
  </creators>
  <titles>
    <title>SemFi - Finnish Semantic Database with Syntactic Relations</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2018</publicationYear>
  <subjects>
    <subject>Finnish</subject>
    <subject>Computational creativity</subject>
    <subject>Poem generation</subject>
    <subject>Semantics</subject>
    <subject>Meaning</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2018-10-16</date>
  </dates>
  <language>fi</language>
  <resourceType resourceTypeGeneral="Dataset"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/1463685</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsReferencedBy">10.5281/zenodo.1454650</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.1137733</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/zenodo</relatedIdentifier>
  </relatedIdentifiers>
  <version>2.1</version>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by-sa/4.0/legalcode">Creative Commons Attribution Share Alike 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;SemFi is a semantic database for Finnish in which the words are linked to each other by the syntactic relations and their frequency in a big corpus.&lt;/p&gt;

&lt;p&gt;SemFi is based on the syntactic bigrams of The Finnish Internet Parsebank provided by Turku University.&lt;/p&gt;

&lt;p&gt;The semfi.db file is an SQLite database and it is the one that should be used. The results_json.zip is mainly intended for those who are interested in working with SemUr which is a translated version of SemFi.&lt;/p&gt;

&lt;p&gt;The previous version of this dataset has successfully been used in the hard AI task of creating Finnish poetry automatically.&amp;nbsp;That data still powers the computationally creative system,&lt;a href="http://runokone.cs.helsinki.fi/"&gt; Poem Machine&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;More information and an online UI to browse the data&amp;nbsp;is available on&amp;nbsp;&lt;a href="https://mikakalevi.com/semfi"&gt;https://mikakalevi.com/semfi/&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Cite as&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;H&amp;auml;m&amp;auml;l&amp;auml;inen, Mika. (2018).&amp;nbsp;&lt;a href="https://helda.helsinki.fi//bitstream/handle/10138/282733/paper9.pdf?sequence=1"&gt;Extracting a Semantic Database with Syntactic Relations for Finnish to Boost Resources for Endangered Uralic Languages&lt;/a&gt;. In The Proceedings of Logic and Engineering of Natural Language Semantics 15 (LENLS15)&lt;/p&gt;</description>
  </descriptions>
</resource>
1,176
221
views
downloads
All versions This version
Views 1,176995
Downloads 221181
Data volume 416.2 GB352.8 GB
Unique views 970876
Unique downloads 145122

Share

Cite as