Leveraging Large Language Models (LLM) for Python Unit Test

Medlen, Jiri; Bari, Emese; Tank, Devarshi

doi:10.5281/zenodo.19170305

Published 2025 | Version v1

Publication Open

Leveraging Large Language Models (LLM) for Python Unit Test

Abstract

This study evaluates the capability of six state-of-the-art Large Language Models (LLMs): Perplexity AI, Claude Sonnet 4.5, Gemini 2.5 Pro, ChatGPT (GPT-5), DeepSeek-V3.2-Exp, and Llama-4-Maverick, to generate production-quality Python code with comprehensive unit tests.

Files

Files (72.7 kB)

Name	Size	Download all
Leveraging Large Language Models (LLM) for Python Unit Test Updated 1.docx md5:1681fb62a38f0da9c5880d850413e74b	72.7 kB	Download

Views

Downloads

Show more details

	All versions	This version
Views	26	26
Downloads	2	2
Data volume	145.4 kB	145.4 kB

More info on how stats are collected....

DOI

Resource type

Publication

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 22, 2026
Modified: March 22, 2026

Leveraging Large Language Models (LLM) for Python Unit Test

Authors/Creators

Description

Abstract

Files

Files (72.7 kB)