Dataset Open Access

SemFi - Finnish Semantic Database with Syntactic Relations

Hämäläinen, Mika


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">fin</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Finnish</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Computational creativity</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Poem generation</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Semantics</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Meaning</subfield>
  </datafield>
  <controlfield tag="005">20200124192518.0</controlfield>
  <controlfield tag="001">1463685</controlfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">309980977</subfield>
    <subfield code="z">md5:612cec6dc3ff172f8a742fd59f89c953</subfield>
    <subfield code="u">https://zenodo.org/record/1463685/files/results.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">3765992448</subfield>
    <subfield code="z">md5:a0f0da3b2ebea99fd3192ba8c551449c</subfield>
    <subfield code="u">https://zenodo.org/record/1463685/files/semfi.db</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">8043</subfield>
    <subfield code="z">md5:3a63551e47b27075f38901c12a1a1e5f</subfield>
    <subfield code="u">https://zenodo.org/record/1463685/files/semfyier.py</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2018-10-16</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-zenodo</subfield>
    <subfield code="o">oai:zenodo.org:1463685</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">University of Helsinki</subfield>
    <subfield code="0">(orcid)0000-0001-9315-1278</subfield>
    <subfield code="a">Hämäläinen, Mika</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">SemFi - Finnish Semantic Database with Syntactic Relations</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-zenodo</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by-sa/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution Share Alike 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;SemFi is a semantic database for Finnish in which the words are linked to each other by the syntactic relations and their frequency in a big corpus.&lt;/p&gt;

&lt;p&gt;SemFi is based on the syntactic bigrams of The Finnish Internet Parsebank provided by Turku University.&lt;/p&gt;

&lt;p&gt;The semfi.db file is an SQLite database and it is the one that should be used. The results_json.zip is mainly intended for those who are interested in working with SemUr which is a translated version of SemFi.&lt;/p&gt;

&lt;p&gt;The previous version of this dataset has successfully been used in the hard AI task of creating Finnish poetry automatically.&amp;nbsp;That data still powers the computationally creative system,&lt;a href="http://runokone.cs.helsinki.fi/"&gt; Poem Machine&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;More information and an online UI to browse the data&amp;nbsp;is available on&amp;nbsp;&lt;a href="https://mikakalevi.com/semfi"&gt;https://mikakalevi.com/semfi/&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Cite as&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;H&amp;auml;m&amp;auml;l&amp;auml;inen, Mika. (2018).&amp;nbsp;&lt;a href="https://helda.helsinki.fi//bitstream/handle/10138/282733/paper9.pdf?sequence=1"&gt;Extracting a Semantic Database with Syntactic Relations for Finnish to Boost Resources for Endangered Uralic Languages&lt;/a&gt;. In The Proceedings of Logic and Engineering of Natural Language Semantics 15 (LENLS15)&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isReferencedBy</subfield>
    <subfield code="a">10.5281/zenodo.1454650</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.1137733</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.1463685</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
1,176
221
views
downloads
All versions This version
Views 1,176995
Downloads 221181
Data volume 416.2 GB352.8 GB
Unique views 970876
Unique downloads 145122

Share

Cite as