Dataset Open Access

FinMeter models

Hämäläinen, Mika; Alnajjar, Khalid


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/en.bin"
      }, 
      "checksum": "md5:d72ddc55d7f32e26dcb11e2f2b5c138d", 
      "bucket": "01f7b561-3f93-46b1-9391-488c8911abac", 
      "key": "en.bin", 
      "type": "bin", 
      "size": 5393166296
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/es.bin"
      }, 
      "checksum": "md5:4c1d1570e1f7456f3a48d92868f0fa62", 
      "bucket": "01f7b561-3f93-46b1-9391-488c8911abac", 
      "key": "es.bin", 
      "type": "bin", 
      "size": 1497954280
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/fi_concreteness.txt"
      }, 
      "checksum": "md5:836745563679b08550de13bb7713e227", 
      "bucket": "01f7b561-3f93-46b1-9391-488c8911abac", 
      "key": "fi_concreteness.txt", 
      "type": "txt", 
      "size": 1824059
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/fin-word2vec-lemma.bin"
      }, 
      "checksum": "md5:882670227a07af80d23852f9051b61cf", 
      "bucket": "01f7b561-3f93-46b1-9391-488c8911abac", 
      "key": "fin-word2vec-lemma.bin", 
      "type": "bin", 
      "size": 2681551329
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/rel_matrix_n_csr.hkl"
      }, 
      "checksum": "md5:549ef9dfec64d5e6febedcf7e19ba1f3", 
      "bucket": "01f7b561-3f93-46b1-9391-488c8911abac", 
      "key": "rel_matrix_n_csr.hkl", 
      "type": "hkl", 
      "size": 663652524
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac/unigrams_sorted_5k.txt"
      }, 
      "checksum": "md5:40199a8b76838f5faaf295f1832dd747", 
      "bucket": "01f7b561-3f93-46b1-9391-488c8911abac", 
      "key": "unigrams_sorted_5k.txt", 
      "type": "txt", 
      "size": 801539
    }
  ], 
  "owners": [
    40509
  ], 
  "doi": "10.5281/zenodo.3473456", 
  "stats": {
    "version_unique_downloads": 48.0, 
    "unique_views": 45.0, 
    "views": 50.0, 
    "version_views": 50.0, 
    "unique_downloads": 48.0, 
    "version_unique_views": 45.0, 
    "volume": 210061176170.0, 
    "version_downloads": 163.0, 
    "downloads": 163.0, 
    "version_volume": 210061176170.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.3473456", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.3473455", 
    "bucket": "https://zenodo.org/api/files/01f7b561-3f93-46b1-9391-488c8911abac", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.3473455.svg", 
    "html": "https://zenodo.org/record/3473456", 
    "latest_html": "https://zenodo.org/record/3473456", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.3473456.svg", 
    "latest": "https://zenodo.org/api/records/3473456"
  }, 
  "conceptdoi": "10.5281/zenodo.3473455", 
  "created": "2019-10-05T16:51:25.655491+00:00", 
  "updated": "2020-01-24T19:25:14.306650+00:00", 
  "conceptrecid": "3473455", 
  "revision": 4, 
  "id": 3473456, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.3473456", 
    "description": "<p>This contains data files needed for FinMeter.</p>\n\n<p>This data is complementary for FinMeter Python library described in:</p>\n\n<p>Mika H&auml;m&auml;l&auml;inen and Khalid Alnajjar (2019).&nbsp;Let&#39;s FACE it. Finnish Poetry Generation with Aesthetics and Framing. In <em>the Proceedings of The 12th International Conference on Natural Language Generation</em>.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p>Sources:</p>\n\n<p>The pretrained vectors for Finnish (es - I know) and English (en) are from&nbsp;E. Grave, P. Bojanowski, P. Gupta, A. Joulin, T. Mikolov,&nbsp;<em><a href=\"https://arxiv.org/abs/1802.06893\">Learning Word Vectors for 157 Languages</a>&nbsp;.&nbsp;Creative Commons Attribution-Share-Alike License 3.0</em>. See&nbsp;<a href=\"https://fasttext.cc/docs/en/crawl-vectors.html\">https://fasttext.cc/docs/en/crawl-vectors.html</a></p>\n\n<p>The word2vec model trained on the Finnish Internet ParseBank is from&nbsp;Kanerva, Jenna; Luotolahti, Juhani; Laippala, Veronika; Ginter, Filip: Syntactic N-gram Collection from a Large-Scale Corpus of Internet Finnish. Proceedings of the Sixth International Conference Baltic HLT. 2014.&nbsp;<a href=\"http://ebooks.iospress.nl/volumearticle/38025\">paper</a>.&nbsp;&nbsp;Creative Commons Attribution-ShareAlike 4.0 International License. See&nbsp;<a href=\"http://bionlp.utu.fi/finnish-internet-parsebank.html\">http://bionlp.utu.fi/finnish-internet-parsebank.html</a></p>\n\n<p>The Finnish concreteness data has been&nbsp;automatically translated from&nbsp;Brysbaert, Marc, Amy Beth Warriner, and Victor Kuperman. &quot;<a href=\"http://crr.ugent.be/papers/Brysbaert_Warriner_Kuperman_BRM_Concreteness_ratings.pdf\">Concreteness ratings for 40 thousand generally known English word lemmas.</a>&quot;&nbsp;<em>Behavior research methods</em>&nbsp;46.3 (2014): 904-911.&nbsp;Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. see&nbsp;<a href=\"http://crr.ugent.be/archives/1330\">http://crr.ugent.be/archives/1330</a></p>", 
    "language": "fin", 
    "title": "FinMeter models", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3473455"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "3473456"
          }
        }
      ]
    }, 
    "version": "1.0.0", 
    "publication_date": "2019-10-04", 
    "creators": [
      {
        "orcid": "0000-0001-9315-1278", 
        "affiliation": "University of Helsinki", 
        "name": "H\u00e4m\u00e4l\u00e4inen, Mika"
      }, 
      {
        "orcid": "0000-0002-7986-2994", 
        "affiliation": "University of Helsinki", 
        "name": "Alnajjar, Khalid"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.3473449", 
        "relation": "isSupplementedBy", 
        "resource_type": "dataset"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.3473455", 
        "relation": "isVersionOf"
      }
    ]
  }
}
50
163
views
downloads
All versions This version
Views 5050
Downloads 163163
Data volume 210.1 GB210.1 GB
Unique views 4545
Unique downloads 4848

Share

Cite as