Common20LS: A Lexical Simplification Dataset with Demographic Information

doi:10.5281/zenodo.2551474

Simpatico project community

Published January 28, 2019 | Version v1

Dataset Open

Common20LS: A Lexical Simplification Dataset with Demographic Information

1. Universidade Tecnológica Federal do Paraná - Toledo
2. University of Sheffield

Common20LS is a dataset for the task of Lexical Simplification that contains demographic information about the annotators. It consists on 20 Lexical Simplification problems annotated by 262 people. Each annotated instance is composed of a sentence, a target complex word or phrase, and a set of simplifications suggested by humans ranked by simplicity.

Files

Common20LS.txt

Files (12.0 MB)

Name	Size	Download all
Common20LS.txt md5:2613e84ab1b05f8723813e385b2fe59d	12.0 MB	Preview Download
README.txt md5:fdde8863e3c9fff327253f0191a2cbe2	590 Bytes	Preview Download

Additional details

SIMPATICO – SIMplifying the interaction with Public Administration Through Information technology for Citizens and cOmpanies 692819: European Commission

296

Views

Downloads

Show more details

	All versions	This version
Views	296	296
Downloads	45	45
Data volume	420.1 MB	420.1 MB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: January 28, 2019
Modified: January 24, 2020

Common20LS: A Lexical Simplification Dataset with Demographic Information

Creators

Description

Files

Common20LS.txt

Files (12.0 MB)

Additional details

Funding