Dataset Open Access
van der Zwaan, Janneke M.; Marx, Maarten; Kamps, Jaap
Dutch language resource for calculating topic coherence with Palmetto [1, 2]. The dataset is a position storing Lucene index of the Dutch Wikipedia . It was created in the context of the Netherlands eScience Center Dilipad project . The pdf file contains the results of a case study that shows best topic coherence measure for topics consisting of Dutch nouns is NPMI.
More details can be found in the README.
 M. Roeder, A. Both, and A. Hinneburg. Exploring the space of topic coherence measures. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, pages 399–408, 2015.