Conference paper Open Access

Using NLP to support terminology extraction and domain scoping: report on the H2020 DESIRA project

Bacco Manlio; Brunori Gianluca; Dell'Orletta Felice; Ferrari Alessio

The ongoing phenomenon of digitisation is changing social and work life, with tangible effects on the socio-economic context. Understanding the impact, opportunities, and threats of digital transformation requires the identication of viewpoints from a large diversity of stakeholders, from policy makers to domain experts, and from engineers to common citizens. The DESIRA (Digitisation: Economic and Social Impacts in Rural Areas) EU H2020 project1 considers rural areas, with a strong focus on agricultural and forestry activities, and aims at assessing the impact of digital technologies in those domains by involving a large number of stakeholders, all across Europe, around 20 focal questions. Given the involvement of stakeholders with diverse background and skills, a primary goal of the project is to develop domain-specic and interactive reference taxonomies (i.e., structured classications of terms) to facilitate common understanding of technologies in use in each domain at today. The taxonomies, which aims at easing the learning of the meaning of technical and domain-specic terms, are going to be exploited by the stakeholders in 20 Living Labs built around the focal questions. This report paper focuses on the semi-automatic development of the taxonomies through natural language processing (NLP) techniques based on context-specic term extraction. Furthermore, we crawl Wikipedia to enrich the taxonomies with additional categories and denitions. We plan to validate the taxonomies through fieeld studies within the Living Labs.

Files (513.4 kB)
Name Size
NLP4RE-paper5[1].pdf
md5:9d3d451a29c92c4c5c35d81657d83bbd
513.4 kB Download
43
31
views
downloads
All versions This version
Views 4343
Downloads 3131
Data volume 15.9 MB15.9 MB
Unique views 3737
Unique downloads 3030

Share

Cite as