There is a newer version of the record available.

Published January 15, 2024 | Version 1.0
Dataset Open

Automated Programming Exercise Generation in the Era of Large Language Models

  • 1. ROR icon University of Stuttgart

Description

Lecturers are increasingly attempting to use large language models (LLMs) to simplify and make the creation of exercises for students more efficient. Efforts are also being made to automate the exercise creation process in software engineering (SE) education. This study explores the use of advanced LLMs, including GPT-4 and LaMDA, for automated programming exercise creation in higher education and compares the results with related work using GPT-3.5-turbo. Utilizing applications such as ChatGPT, Bing AI Chat, and Google Bard, we identify LLMs capable of initiating different exercise designs. However, manual refinement is crucial for accuracy. Common error patterns across LLMs highlight challenges in complex programming concepts, while specific strengths in various topics showcase model distinctions. This research underscores LLMs' value in exercise generation, emphasizing the critical role of human supervision in refining these processes. Our concise insights cater to educators, practitioners, and other researchers seeking to enhance SE education through LLM applications.

Files

LLMs-Programming-Exercises.zip

Files (31.4 MB)

Name Size Download all
md5:01f1ce75188628197ad75229b94ed1ef
31.4 MB Preview Download