Automated Programming Exercise Generation in the Era of Large Language Models

Niklas Meißner; Sandro Speth; Steffen Becker

doi:10.5281/zenodo.8298490

Published January 15, 2024 | Version 1.0

Dataset Open

Automated Programming Exercise Generation in the Era of Large Language Models

1. University of Stuttgart

Lecturers are increasingly attempting to use large language models (LLMs) to simplify and make the creation of exercises for students more efficient. Efforts are also being made to automate the exercise creation process in software engineering (SE) education. This study explores the use of advanced LLMs, including GPT-4 and LaMDA, for automated programming exercise creation in higher education and compares the results with related work using GPT-3.5-turbo. Utilizing applications such as ChatGPT, Bing AI Chat, and Google Bard, we identify LLMs capable of initiating different exercise designs. However, manual refinement is crucial for accuracy. Common error patterns across LLMs highlight challenges in complex programming concepts, while specific strengths in various topics showcase model distinctions. This research underscores LLMs' value in exercise generation, emphasizing the critical role of human supervision in refining these processes. Our concise insights cater to educators, practitioners, and other researchers seeking to enhance SE education through LLM applications.

Files

LLMs-Programming-Exercises.zip

Files (31.4 MB)

Name	Size	Download all
LLMs-Programming-Exercises.zip md5:01f1ce75188628197ad75229b94ed1ef	31.4 MB	Preview Download

	All versions	This version
Views	644	260
Downloads	59	12
Data volume	2.0 GB	376.4 MB

Automated Programming Exercise Generation in the Era of Large Language Models

Creators

Description

Files

LLMs-Programming-Exercises.zip

Files (31.4 MB)