Published December 12, 2018 | Version v1
Conference paper Open

Challenges of Terminology Extraction from Legal Spanish Corpora

  • 1. Ontology Engineering Group, Universidad Polit´ecnica de Madrid

Description

Untangling the complexities of legal documentation is an imperative need for non practitioners of the legal profession. The terminology used in the domain is complex and it usually requires expert knowledge to be fully understood, since the legal framework is constantly being updated and the meaning of terms vary accordingly. Non-proprietary Automatic Terminology Extraction (ATE) tools are required in this particular domain in which documents contain private and sensitive data. This paper describes methods for obtaining accurate legal terms from labour law corpora, overcoming the difficulties present in the area, and also analyses the peculiarities of the legal jargon, specifically, in Spanish language. The performed experiments, executed with JATE, a well-known open source library in the ATE literature, are still preliminary, but promising.

Files

07.pdf

Files (1.8 MB)

Name Size Download all
md5:84eb0158553a958dbe6d6093bae78d31
1.8 MB Preview Download

Additional details

Funding

European Commission
Lynx - Building the Legal Knowledge Graph for Smart Compliance Services in Multilingual Europe 780602