Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published October 1, 2019 | Version v1
Conference paper Open

Communities of related terms in Karst terminology co-occurrence network

  • 1. Jožef Stefan Institute
  • 2. University of Ljubljana, Ljubljana, Slovenia
  • 3. Jožef Stefan Institute, Usher Institute of Population Health Sciences and Informatics, Edinburgh Medical School, Edinburgh, UK

Description

Karst science is an attractive field of interdisciplinary research with rich terminology. This study was performed as part of a project aiming at developing novel approaches to terminology extraction and visualization, in line with the understanding of knowledge, as represented in texts, as conceptually dynamic and linguistically varied. The aim of this paper is to investigate how powerful graph-based methods can be used for visualizing and analysing domain terminology. In order to detect communities in karst terminology, we analyse the frequently cooccurring karst terms in a scientific corpus of karstologic literature. The most frequent cooccurrence pairs, which included ten or more co-occurrences within the whole corpus, are delivered as input to the Louvain community detection algorithm and visualized as a domain graph. The resulting data was evaluated by domain experts who found that the detected term groups are meaningful and correspond to different types of karst phenomena. The results are further discussed in relation to more standard topic modelling approaches, using Latent Dirichlet Allocation and Non-negative Matrix Factorization algorithms.

Files

Miljkovic_eLex_2019_20.pdf

Files (587.2 kB)

Name Size Download all
md5:45cb16c63b9a598871e4a373402a35dc
587.2 kB Preview Download

Additional details

Funding

EMBEDDIA – Cross-Lingual Embeddings for Less-Represented Languages in European News Media 825153
European Commission