Dataset Open Access

Palmetto position storing Lucene index of Dutch Wikipedia

van der Zwaan, Janneke M.; Marx, Maarten; Kamps, Jaap


JSON-LD (schema.org) Export

{
  "description": "<p>Dutch language resource for calculating topic coherence with Palmetto [1, 2]. The dataset is a position storing Lucene index of the Dutch Wikipedia [3]. It was created in the context of the Netherlands eScience Center Dilipad project [4]. The pdf file contains the results of a case study that shows best topic coherence measure for topics consisting of Dutch nouns is NPMI.</p>\n\n<p>More details can be found in the README.</p>\n\n<p>[1] M. Roeder, A. Both, and A. Hinneburg. Exploring the space of topic coherence measures. In <em>Proceedings of the Eighth ACM International Conference on Web Search and Data Mining</em>, pages 399&ndash;408, 2015.</p>\n\n<p>[2] http://aksw.org/Projects/Palmetto.html</p>\n\n<p>[3] https://dumps.wikimedia.org/nlwiki/20151102/</p>\n\n<p>[4] https://www.esciencecenter.nl/project/dilipad</p>", 
  "license": "http://creativecommons.org/licenses/by-sa/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Netherlands eScience Center", 
      "@type": "Person", 
      "name": "van der Zwaan,  Janneke M."
    }, 
    {
      "affiliation": "University of Amsterdam", 
      "@type": "Person", 
      "name": "Marx, Maarten"
    }, 
    {
      "affiliation": "University of Amsterdam", 
      "@type": "Person", 
      "name": "Kamps, Jaap"
    }
  ], 
  "url": "https://zenodo.org/record/46377", 
  "datePublished": "2016-02-22", 
  "keywords": [
    "topic modeling", 
    "topic coherence", 
    "Palmetto", 
    "Dutch", 
    "Wikipedia"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/e9e57e1b-1c51-4141-beba-a2e9441be923/case_study.pdf", 
      "@type": "DataDownload", 
      "fileFormat": "pdf"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e9e57e1b-1c51-4141-beba-a2e9441be923/README.md", 
      "@type": "DataDownload", 
      "fileFormat": "md"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/e9e57e1b-1c51-4141-beba-a2e9441be923/nlwiki-palmetto.tar.gz", 
      "@type": "DataDownload", 
      "fileFormat": "gz"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.46377", 
  "@id": "https://doi.org/10.5281/zenodo.46377", 
  "@type": "Dataset", 
  "name": "Palmetto position storing Lucene index of Dutch Wikipedia"
}
3,252
49
views
downloads
All versions This version
Views 3,2523,252
Downloads 4949
Data volume 6.6 GB6.6 GB
Unique views 3,2393,239
Unique downloads 3737

Share

Cite as