Dataset Open Access
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd"> <identifier identifierType="DOI">10.5281/zenodo.1463685</identifier> <creators> <creator> <creatorName>Hämäläinen, Mika</creatorName> <givenName>Mika</givenName> <familyName>Hämäläinen</familyName> <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-9315-1278</nameIdentifier> <affiliation>University of Helsinki</affiliation> </creator> </creators> <titles> <title>SemFi - Finnish Semantic Database with Syntactic Relations</title> </titles> <publisher>Zenodo</publisher> <publicationYear>2018</publicationYear> <subjects> <subject>Finnish</subject> <subject>Computational creativity</subject> <subject>Poem generation</subject> <subject>Semantics</subject> <subject>Meaning</subject> </subjects> <dates> <date dateType="Issued">2018-10-16</date> </dates> <language>fi</language> <resourceType resourceTypeGeneral="Dataset"/> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/1463685</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsReferencedBy">10.5281/zenodo.1454650</relatedIdentifier> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.1137733</relatedIdentifier> <relatedIdentifier relatedIdentifierType="URL" relationType="IsPartOf">https://zenodo.org/communities/zenodo</relatedIdentifier> </relatedIdentifiers> <version>2.1</version> <rightsList> <rights rightsURI="https://creativecommons.org/licenses/by-sa/4.0/legalcode">Creative Commons Attribution Share Alike 4.0 International</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract"><p>SemFi is a semantic database for Finnish in which the words are linked to each other by the syntactic relations and their frequency in a big corpus.</p> <p>SemFi is based on the syntactic bigrams of The Finnish Internet Parsebank provided by Turku University.</p> <p>The semfi.db file is an SQLite database and it is the one that should be used. The results_json.zip is mainly intended for those who are interested in working with SemUr which is a translated version of SemFi.</p> <p>The previous version of this dataset has successfully been used in the hard AI task of creating Finnish poetry automatically.&nbsp;That data still powers the computationally creative system,<a href="http://runokone.cs.helsinki.fi/"> Poem Machine</a>.</p> <p>More information and an online UI to browse the data&nbsp;is available on&nbsp;<a href="https://mikakalevi.com/semfi">https://mikakalevi.com/semfi/</a>.</p> <p><strong>Cite as</strong></p> <p>H&auml;m&auml;l&auml;inen, Mika. (2018).&nbsp;<a href="https://helda.helsinki.fi//bitstream/handle/10138/282733/paper9.pdf?sequence=1">Extracting a Semantic Database with Syntactic Relations for Finnish to Boost Resources for Endangered Uralic Languages</a>. In The Proceedings of Logic and Engineering of Natural Language Semantics 15 (LENLS15)</p></description> </descriptions> </resource>
All versions | This version | |
---|---|---|
Views | 1,329 | 1,061 |
Downloads | 258 | 212 |
Data volume | 478.7 GB | 398.9 GB |
Unique views | 1,061 | 925 |
Unique downloads | 174 | 147 |