Published May 19, 2024 | Version v1
Journal article Open

The English Language Learner Insight, Proficiency and Skills Evaluation (ELLIPSE) Corpus

Description

This paper introduces the open-source English Language Learning Insight, Proficiency and Skills Evaluation (ELLIPSE) corpus, which comprises ~6,500 essays written by English Language Learners (ELLs). All essays were written during state-wide standardized annual testing in the United States. The essays were written on 29 different independent prompts. Individual difference information is made available for each essay including economic status, gender, grade level (8-12), and race/ethnicity. Each essay was scored by two trained human raters for English language proficiency including an overall score of English proficiency and analytic scores for cohesion, syntax, vocabulary, phraseology, grammar, and conventions. The paper provides reliability on the human judgments of proficiency reported for the corpus. The ELLIPSE corpus addresses many of the concerns found in existing learner corpora including unique holistic and analytic scores for each ELL essay. The corpus also includes limited demographic and individual difference data for each ELL.

Files

ellipse_pre_print.pdf

Files (706.7 kB)

Name Size Download all
md5:e7f51f2c69d908a28d13381b11526d6c
706.7 kB Preview Download

Additional details