Published November 1, 2020 | Version pre-published version
Conference paper Open

Leveraging Linguistic Linked Data for Cross-Lingual Model Transfer in the Pharmaceutical Domain

  • 1. University of Zaragoza
  • 2. Goethe University Frankfurt
  • 3. Semalytix GmbH

Description

We describe the use of linguistic linked data to support a cross-lingual transfer framework for sentiment analysis in the pharmaceutical domain. The proposed system dynamically gathers translations from the Linked Open Data (LOD) cloud, particularly from Apertium RDF, in order to project a deep learning-based sentiment classifier from one language to another, thus enabling scalability and avoiding the need of model re-training when transferred across languages. We describe the whole pipeline traversed by the multilingual data, from their conversion into RDF based on a new dynamic and flexible transformation framework, through their linking and publication as linked data, and finally their exploitation in the particular use case. Based on experiments on projecting a sentiment classifier from English to Spanish, we demonstrate how linked data techniques are able to enhance the multilingual capabilities of a deep learning-based approach in a dynamic and scalable way, in a real application scenario from the pharmaceutical domain.

Files

PaL_Apertium_RDF_pipeline.pdf

Files (721.0 kB)

Name Size Download all
md5:24c25cc5c94b00ccd86c05fb1e0b40d3
721.0 kB Preview Download

Additional details

Funding

Pret-a-LLOD – Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors 825182
European Commission