Dataset Open Access

FinMeter models

Hämäläinen, Mika; Alnajjar, Khalid


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "fin", 
    "@type": "Language", 
    "name": "Finnish"
  }, 
  "description": "<p>This contains data files needed for FinMeter.</p>\n\n<p>This data is complementary for FinMeter Python library described in:</p>\n\n<p>Mika H&auml;m&auml;l&auml;inen and Khalid Alnajjar (2019).&nbsp;Let&#39;s FACE it. Finnish Poetry Generation with Aesthetics and Framing. In <em>the Proceedings of The 12th International Conference on Natural Language Generation</em>.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p>Sources:</p>\n\n<p>The pretrained vectors for Finnish (es - I know) and English (en) are from&nbsp;E. Grave, P. Bojanowski, P. Gupta, A. Joulin, T. Mikolov,&nbsp;<em><a href=\"https://arxiv.org/abs/1802.06893\">Learning Word Vectors for 157 Languages</a>&nbsp;.&nbsp;Creative Commons Attribution-Share-Alike License 3.0</em>. See&nbsp;<a href=\"https://fasttext.cc/docs/en/crawl-vectors.html\">https://fasttext.cc/docs/en/crawl-vectors.html</a></p>\n\n<p>The word2vec model trained on the Finnish Internet ParseBank is from&nbsp;Kanerva, Jenna; Luotolahti, Juhani; Laippala, Veronika; Ginter, Filip: Syntactic N-gram Collection from a Large-Scale Corpus of Internet Finnish. Proceedings of the Sixth International Conference Baltic HLT. 2014.&nbsp;<a href=\"http://ebooks.iospress.nl/volumearticle/38025\">paper</a>.&nbsp;&nbsp;Creative Commons Attribution-ShareAlike 4.0 International License. See&nbsp;<a href=\"http://bionlp.utu.fi/finnish-internet-parsebank.html\">http://bionlp.utu.fi/finnish-internet-parsebank.html</a></p>\n\n<p>The Finnish concreteness data has been&nbsp;automatically translated from&nbsp;Brysbaert, Marc, Amy Beth Warriner, and Victor Kuperman. &quot;<a href=\"http://crr.ugent.be/papers/Brysbaert_Warriner_Kuperman_BRM_Concreteness_ratings.pdf\">Concreteness ratings for 40 thousand generally known English word lemmas.</a>&quot;&nbsp;<em>Behavior research methods</em>&nbsp;46.3 (2014): 904-911.&nbsp;Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. see&nbsp;<a href=\"http://crr.ugent.be/archives/1330\">http://crr.ugent.be/archives/1330</a></p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "University of Helsinki", 
      "@id": "https://orcid.org/0000-0001-9315-1278", 
      "@type": "Person", 
      "name": "H\u00e4m\u00e4l\u00e4inen, Mika"
    }, 
    {
      "affiliation": "University of Helsinki", 
      "@id": "https://orcid.org/0000-0002-7986-2994", 
      "@type": "Person", 
      "name": "Alnajjar, Khalid"
    }
  ], 
  "url": "https://zenodo.org/record/3473456", 
  "datePublished": "2019-10-04", 
  "version": "1.0.0", 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/en.bin", 
      "encodingFormat": "bin", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/es.bin", 
      "encodingFormat": "bin", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/fi_concreteness.txt", 
      "encodingFormat": "txt", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/fin-word2vec-lemma.bin", 
      "encodingFormat": "bin", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/rel_matrix_n_csr.hkl", 
      "encodingFormat": "hkl", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/unigrams_sorted_5k.txt", 
      "encodingFormat": "txt", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.3473456", 
  "@id": "https://doi.org/10.5281/zenodo.3473456", 
  "@type": "Dataset", 
  "name": "FinMeter models"
}
50
163
views
downloads
All versions This version
Views 5050
Downloads 163163
Data volume 210.1 GB210.1 GB
Unique views 4545
Unique downloads 4848

Share

Cite as