Learning Simplifications for Specific Target Audiences

doi:10.5281/zenodo.1410314

Published July 15, 2018 | Version v1

Conference paper Open

Learning Simplifications for Specific Target Audiences

1. University of Sheffield

Text simplification (TS) is a monolingual text-to-text transformation task where an original (complex) text is transformed into a target (simpler) text. Most recent work is based on sequence-to-sequence neural models similar to those used for machine translation (MT). Different from MT, TS data comprises more elaborate transformations, such as sentence splitting. It can also contain multiple simplifications of the same original text targeting different audiences, such as school grade levels. We explore these two features of TS to build models tailored for specific grade levels. Our approach uses a standard sequence-to-sequence architecture where the original sequence is annotated with information about the target audience and/or the (predicted) type of simplification operation. We show that it outperforms state-of-the-art TS approaches (up to 3 and 12 BLEU and SARI points, respectively), including when training data for the specific complex-simple combination of grade levels is not available, i.e. zero-shot learning.

Files

document.pdf

Files (176.7 kB)

Name	Size	Download all
document.pdf md5:0112e8264562b63b88b86f612a47b55a	176.7 kB	Preview Download

Additional details

SIMPATICO – SIMplifying the interaction with Public Administration Through Information technology for Citizens and cOmpanies 692819: European Commission

	All versions	This version
Views	168	168
Downloads	103	103
Data volume	18.9 MB	18.9 MB

Learning Simplifications for Specific Target Audiences

Creators

Description

Files

document.pdf

Files (176.7 kB)

Additional details

Funding