Cross-lingual Inflection as a Data Augmentation Method for Parsing

Muñoz-Ortiz, Alberto; Gómez-Rodríguez, Carlos; Vilares, David

doi:10.5281/zenodo.11400554

Published May 30, 2022 | Version v1

Conference paper Open

Cross-lingual Inflection as a Data Augmentation Method for Parsing

1. Universidade da Coruña

Abstract (English)

We propose a morphology-based method for low-resource (LR) dependency parsing. We train a morphological inflector for target LR languages, and apply it to related rich-resource (RR) treebanks to create cross-lingual (x-inflected) treebanks that resemble the target LR language. We use such inflected treebanks to train parsers in zero- (training on x-inflected treebanks) and few-shot (training on x-inflected and target language treebanks) setups. The results show that the method sometimes improves the baselines, but not consistently.

Other (English)

This work is supported by a 2020 Leonardo Grant for Researchers and Cultural Creators from the FBBVA,3 as well as by the European Research Council (ERC), under the European Union’s Horizon 2020 research and innovation programme (FASTPARSE, grant agreement No 714150). The work is also supported by ERDF/MICINN-AEI (SCANNER-UDC, PID2020-113230RB-C21), by Xunta de Galicia (ED431C 2020/11), and by Centro de Investigación de Galicia “CITIC” which is funded by Xunta de Galicia, Spain and the European Union (ERDF - Galicia 2014–2020 Program), by grant ED431G 2019/01.

Files

MuñozOrtiz_2022_Cross_lingual_inflection_data_augmentation_method_parsing.pdf

Files (198.5 kB)

Name	Size	Download all
MuñozOrtiz_2022_Cross_lingual_inflection_data_augmentation_method_parsing.pdf md5:62c8cee738a7a48697a0165b3795a830	198.5 kB	Preview Download

Additional details

Handle: 2183/36647

Views

Downloads

Show more details

	All versions	This version
Views	49	49
Downloads	23	23
Data volume	4.8 MB	4.8 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Association for Computational Linguistics

Imprint

Proceedings of the Third Workshop on Insights from Negative Results in NLP, 54-61.

Conference

Third Workshop on Insights from Negative Results in NLP (Insights 2022), Dublin, Ireland, 26 May 2022

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 31, 2024
Modified: May 31, 2024

Cross-lingual Inflection as a Data Augmentation Method for Parsing

Authors/Creators

Abstract (English)

Other (English)

Files

MuñozOrtiz_2022_Cross_lingual_inflection_data_augmentation_method_parsing.pdf

Files (198.5 kB)

Additional details

Identifiers