Conference paper Open Access

Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs

Fernando Alva-Manchego; Joachim Bingel; Gustavo Henrique Paetzold; Carolina Scarton; Lucia Specia


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>Current research in text simplification has been hampered by two central problems: (i) the small amount of high-quality parallel simplification data available, and (ii) the lack of explicit annotations of simplification operations, such as deletions or substitutions, on existing data. While the recently introduced Newsela corpus&nbsp;has alleviated the first problem, simplifications still need to be learned directly from parallel text using black-box, end-to-end approaches rather than from explicit annotations. These complex-simple parallel sentence pairs often differ to such a high degree that generalization becomes difficult. &nbsp;End-to-end models also make it hard to interpret what is actually learned from data. &nbsp;We propose a method that decomposes the task of TS into its sub-problems. We devise a way to automatically identify operations in a parallel corpus and introduce a sequence-labeling approach based on these annotations. Finally, we provide insights on the types of transformations that different approaches can model.</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "University of Sheffield", 
      "@type": "Person", 
      "name": "Fernando Alva-Manchego"
    }, 
    {
      "affiliation": "University of Copenhagen", 
      "@type": "Person", 
      "name": "Joachim Bingel"
    }, 
    {
      "affiliation": "University of Sheffield", 
      "@type": "Person", 
      "name": "Gustavo Henrique Paetzold"
    }, 
    {
      "affiliation": "University", 
      "@type": "Person", 
      "name": "Carolina Scarton"
    }, 
    {
      "affiliation": "Univer", 
      "@type": "Person", 
      "name": "Lucia Specia"
    }
  ], 
  "headline": "Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs", 
  "image": "https://zenodo.org/static/img/logos/zenodo-gradient-round.svg", 
  "datePublished": "2017-11-27", 
  "url": "https://zenodo.org/record/1042505", 
  "@context": "https://schema.org/", 
  "identifier": "https://doi.org/10.5281/zenodo.1042505", 
  "@id": "https://doi.org/10.5281/zenodo.1042505", 
  "@type": "ScholarlyArticle", 
  "name": "Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs"
}
36
28
views
downloads
All versions This version
Views 3636
Downloads 2828
Data volume 5.0 MB5.0 MB
Unique views 3030
Unique downloads 2828

Share

Cite as