Dataset Open Access

FinMeter models

Hämäläinen, Mika; Alnajjar, Khalid


Citation Style Language JSON Export

{
  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.3473456", 
  "language": "fin", 
  "title": "FinMeter models", 
  "issued": {
    "date-parts": [
      [
        2019, 
        10, 
        4
      ]
    ]
  }, 
  "abstract": "<p>This contains data files needed for FinMeter.</p>\n\n<p>This data is complementary for FinMeter Python library described in:</p>\n\n<p>Mika H&auml;m&auml;l&auml;inen and Khalid Alnajjar (2019).&nbsp;Let&#39;s FACE it. Finnish Poetry Generation with Aesthetics and Framing. In <em>the Proceedings of The 12th International Conference on Natural Language Generation</em>.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p>Sources:</p>\n\n<p>The pretrained vectors for Finnish (es - I know) and English (en) are from&nbsp;E. Grave, P. Bojanowski, P. Gupta, A. Joulin, T. Mikolov,&nbsp;<em><a href=\"https://arxiv.org/abs/1802.06893\">Learning Word Vectors for 157 Languages</a>&nbsp;.&nbsp;Creative Commons Attribution-Share-Alike License 3.0</em>. See&nbsp;<a href=\"https://fasttext.cc/docs/en/crawl-vectors.html\">https://fasttext.cc/docs/en/crawl-vectors.html</a></p>\n\n<p>The word2vec model trained on the Finnish Internet ParseBank is from&nbsp;Kanerva, Jenna; Luotolahti, Juhani; Laippala, Veronika; Ginter, Filip: Syntactic N-gram Collection from a Large-Scale Corpus of Internet Finnish. Proceedings of the Sixth International Conference Baltic HLT. 2014.&nbsp;<a href=\"http://ebooks.iospress.nl/volumearticle/38025\">paper</a>.&nbsp;&nbsp;Creative Commons Attribution-ShareAlike 4.0 International License. See&nbsp;<a href=\"http://bionlp.utu.fi/finnish-internet-parsebank.html\">http://bionlp.utu.fi/finnish-internet-parsebank.html</a></p>\n\n<p>The Finnish concreteness data has been&nbsp;automatically translated from&nbsp;Brysbaert, Marc, Amy Beth Warriner, and Victor Kuperman. &quot;<a href=\"http://crr.ugent.be/papers/Brysbaert_Warriner_Kuperman_BRM_Concreteness_ratings.pdf\">Concreteness ratings for 40 thousand generally known English word lemmas.</a>&quot;&nbsp;<em>Behavior research methods</em>&nbsp;46.3 (2014): 904-911.&nbsp;Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. see&nbsp;<a href=\"http://crr.ugent.be/archives/1330\">http://crr.ugent.be/archives/1330</a></p>", 
  "author": [
    {
      "family": "H\u00e4m\u00e4l\u00e4inen, Mika"
    }, 
    {
      "family": "Alnajjar, Khalid"
    }
  ], 
  "version": "1.0.0", 
  "type": "dataset", 
  "id": "3473456"
}
50
163
views
downloads
All versions This version
Views 5050
Downloads 163163
Data volume 210.1 GB210.1 GB
Unique views 4545
Unique downloads 4848

Share

Cite as