Published January 28, 2019 | Version v1
Dataset Open

Common20LS: A Lexical Simplification Dataset with Demographic Information

  • 1. Universidade Tecnológica Federal do Paraná - Toledo
  • 2. University of Sheffield

Description

Common20LS is a dataset for the task of Lexical Simplification that contains demographic information about the annotators. It consists on 20 Lexical Simplification problems annotated by 262 people. Each annotated instance is composed of a sentence, a target complex word or phrase, and a set of simplifications suggested by humans ranked by simplicity.

Files

Common20LS.txt

Files (12.0 MB)

Name Size Download all
md5:2613e84ab1b05f8723813e385b2fe59d
12.0 MB Preview Download
md5:fdde8863e3c9fff327253f0191a2cbe2
590 Bytes Preview Download

Additional details

Funding

SIMPATICO – SIMplifying the interaction with Public Administration Through Information technology for Citizens and cOmpanies 692819
European Commission