The English Language Learner Insight, Proficiency and Skills Evaluation (ELLIPSE) Corpus
Creators
Description
This paper introduces the open-source English Language Learning Insight, Proficiency and Skills Evaluation (ELLIPSE) corpus, which comprises ~6,500 essays written by English Language Learners (ELLs). All essays were written during state-wide standardized annual testing in the United States. The essays were written on 29 different independent prompts. Individual difference information is made available for each essay including economic status, gender, grade level (8-12), and race/ethnicity. Each essay was scored by two trained human raters for English language proficiency including an overall score of English proficiency and analytic scores for cohesion, syntax, vocabulary, phraseology, grammar, and conventions. The paper provides reliability on the human judgments of proficiency reported for the corpus. The ELLIPSE corpus addresses many of the concerns found in existing learner corpora including unique holistic and analytic scores for each ELL essay. The corpus also includes limited demographic and individual difference data for each ELL.
Files
ellipse_pre_print.pdf
Files
(706.7 kB)
Name | Size | Download all |
---|---|---|
md5:e7f51f2c69d908a28d13381b11526d6c
|
706.7 kB | Preview Download |