Published June 20, 2022 | Version v2
Software Open

WikiDoMiner: Wikipedia Domain-specific Miner

  • 1. University of Luxembourg
  • 2. University of Ottawa

Description

We introduce WikiDoMiner --  a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers acquire an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Having the possibility to build a such a corpus is important since domain-specific datasets are scarce. WikiDoMiner generates the corpus by first extracting a set of domain-specific keywords from the RS, and then querying Wikipedia for these keywords. 
The output of WikiDoMiner is a set of Wikipedia articles that are relevant to the domain of the input RS.
Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering essential tasks, e.g., ambiguity handling, consistency checking, and question answering. 

Files

WikiDoMiner-main.zip

Files (12.0 kB)

Name Size Download all
md5:ad109482b371dbab0d04471fd887f9ec
12.0 kB Preview Download