Adapting Term Recognition to an Under-Resourced Language: the Case of Irish

John P. McCrae; Adrian Doyle

doi:10.18653/v1/w19-6907

Published August 19, 2019 | Version v1

Conference paper Open

Adapting Term Recognition to an Under-Resourced Language: the Case of Irish

1. National University of Ireland Galway

Automatic Term Recognition (ATR) is an important method for the summarization and analysis of large corpora, and normally requires a significant amount of linguistic input, in particular the use of part-of-speech taggers. For an under-resourced language such as Irish, the resources necessary for this may be scarce or entirely absent. We evaluate two methods for the automatic extraction of terms, based on the small part-of-speech-tagged corpora that are available for Irish and on a large terminology list, and show that both methods can produce viable term extractors. We evaluate this with a newly constructed corpus that is the first available corpus for term extraction in Irish. Our results shine some light on the challenge of adapting natural language processing systems to under-resourced scenarios.

Files

mccrae2019adapting.pdf

Files (225.0 kB)

Name	Size	Download all
mccrae2019adapting.pdf md5:2a814cbb8d94471956d282cb20575060	225.0 kB	Preview Download

Additional details

European Commission
ELEXIS - European Lexicographic Infrastructure 731015

142

Views

Downloads

Show more details

	All versions	This version
Views	142	142
Downloads	80	80
Data volume	18.7 MB	18.7 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

Celtic Language Technology Workshop 2019

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: August 28, 2019
Modified: July 22, 2024

Adapting Term Recognition to an Under-Resourced Language: the Case of Irish

Authors/Creators

Description

Files

mccrae2019adapting.pdf

Files (225.0 kB)

Additional details

Funding