Nonstandard English and the Automated Scoring of Open-Ended Math Problems

Abubakir Siedahmed; Jaclyn Ocumpaugh; Zelda Ferris; Dinesh Kodwani; Neil Heffernan; Eamon Worden

doi:10.5281/zenodo.15870175

Published July 12, 2025 | Version v1

Conference paper Open

Nonstandard English and the Automated Scoring of Open-Ended Math Problems

Contributors

Editor (5):

1. University of Minnesota, USA
2. Weizmann Institute of Science, Israel
3. CNR-ITD, Italy
4. University of Palermo, Italy
5. University of Illinois at Urbana-Champaign, USA

Recent advances in AI have opened the door for the automated scoring of open-ended math problems, which were previously much more difficult to assess at scale. However, we know that biases still remain in some of these algorithms. For example, recent research on the automated scoring of student essays has shown that certain varieties of English are more strongly penalized for non-standard English than they are for other differences that reduce the quality of students' writing. This study examines that issue in a new domain, investigating the potential for large language models to accurately grade open-ended math problems produced by students who speak and write in non-standard English. Specifically, we look at four features of African American Vernacular English (AAVE), which range in the degree to which they are unique to AAVE or are common in other non-standard dialects. We then compare the scoring of answers that were produced by students using these dialect features to a control group of synthetic data--where we converted all non-standard dialect features to standard English. Results show that minor changes in the number of dialect features per student response do not impact GPTs automated scoring, but prompt engineering efforts did.

Files

2025.EDM.long-papers.195.pdf

Files (4.2 MB)

Name	Size	Download all
2025.EDM.long-papers.195.pdf md5:a94dabe0e6244c7362d23f0fd5d0b184	4.2 MB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	96	96
Downloads	42	42
Data volume	211.3 MB	211.3 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

International Educational Data Mining Society

Imprint

Proceedings of the 18th International Conference on Educational Data Mining, 254--264. ISBN: 978-1-7336736-6-2.

Conference

Proceedings of the 18th International Conference on Educational Data Mining , Palermo, Italy, July-2025

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: July 14, 2025
Modified: July 14, 2025

Nonstandard English and the Automated Scoring of Open-Ended Math Problems

Authors/Creators

Contributors

Editor (5):

Description

Files

2025.EDM.long-papers.195.pdf

Files (4.2 MB)