AI Coding Proficiency
Authors/Creators
Description
We conduct the first comprehensive empirical study examining AI proficiency across 170 third-party libraries and 61 task scenarios, evaluating six widely used LLMs. Our findings reveal that libraries with similar functionalities can exhibit up to 84% differences in the quality score of LLM-generated code, while different models also exhibit quality gaps among their generation results using the same library
This repository contains the prototype of both the dataset construction pipeline and the evaluation scripts, as well as our datasets and analysis results.
Files
AICodingProficiency-main.zip
Files
(10.3 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:acc5de9dfd8462594bc5f445d3c063b0
|
10.3 MB | Preview Download |