Dataset Open Access

Palmetto position storing Lucene index of Dutch Wikipedia

van der Zwaan, Janneke M.; Marx, Maarten; Kamps, Jaap


Citation Style Language JSON Export

{
  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.46377", 
  "title": "Palmetto position storing Lucene index of Dutch Wikipedia", 
  "issued": {
    "date-parts": [
      [
        2016, 
        2, 
        22
      ]
    ]
  }, 
  "abstract": "<p>Dutch language resource for calculating topic coherence with Palmetto [1, 2]. The dataset is a position storing Lucene index of the Dutch Wikipedia [3]. It was created in the context of the Netherlands eScience Center Dilipad project [4]. The pdf file contains the results of a case study that shows best topic coherence measure for topics consisting of Dutch nouns is NPMI.</p>\n\n<p>More details can be found in the README.</p>\n\n<p>[1] M. Roeder, A. Both, and A. Hinneburg. Exploring the space of topic coherence measures. In <em>Proceedings of the Eighth ACM International Conference on Web Search and Data Mining</em>, pages 399&ndash;408, 2015.</p>\n\n<p>[2] http://aksw.org/Projects/Palmetto.html</p>\n\n<p>[3] https://dumps.wikimedia.org/nlwiki/20151102/</p>\n\n<p>[4] https://www.esciencecenter.nl/project/dilipad</p>", 
  "author": [
    {
      "given": "Janneke M.", 
      "family": "van der Zwaan"
    }, 
    {
      "given": "Maarten", 
      "family": "Marx"
    }, 
    {
      "given": "Jaap", 
      "family": "Kamps"
    }
  ], 
  "type": "dataset", 
  "id": "46377"
}
3,695
66
views
downloads
All versions This version
Views 3,6953,696
Downloads 6666
Data volume 8.6 GB8.6 GB
Unique views 3,6813,682
Unique downloads 4848

Share

Cite as