Communities of related terms in Karst terminology co-occurrence network
- 1. Jožef Stefan Institute
- 2. University of Ljubljana, Ljubljana, Slovenia
- 3. Jožef Stefan Institute, Usher Institute of Population Health Sciences and Informatics, Edinburgh Medical School, Edinburgh, UK
Description
Karst science is an attractive field of interdisciplinary research with rich terminology. This study was performed as part of a project aiming at developing novel approaches to terminology extraction and visualization, in line with the understanding of knowledge, as represented in texts, as conceptually dynamic and linguistically varied. The aim of this paper is to investigate how powerful graph-based methods can be used for visualizing and analysing domain terminology. In order to detect communities in karst terminology, we analyse the frequently cooccurring karst terms in a scientific corpus of karstologic literature. The most frequent cooccurrence pairs, which included ten or more co-occurrences within the whole corpus, are delivered as input to the Louvain community detection algorithm and visualized as a domain graph. The resulting data was evaluated by domain experts who found that the detected term groups are meaningful and correspond to different types of karst phenomena. The results are further discussed in relation to more standard topic modelling approaches, using Latent Dirichlet Allocation and Non-negative Matrix Factorization algorithms.
Files
Miljkovic_eLex_2019_20.pdf
Files
(587.2 kB)
Name | Size | Download all |
---|---|---|
md5:45cb16c63b9a598871e4a373402a35dc
|
587.2 kB | Preview Download |