Published January 26, 2026 | Version v5
Dataset Open

Let's Make Every Pull Request Meaningful: An Empirical Analysis of Developer and Agentic Pull Requests

Authors/Creators

Description

In recent years, the automatic generation of pull requests (PRs) using AI agents has become increasingly common.
Although AI-generated PRs are fast and easy to create, their merge rates tend to be lower than those created by humans.
In this study, we conduct a large-scale empirical analysis of 40,214 PRs collected from the AIDev dataset.
We extract 64 features spanning six families, and fit statistical regression models to analyze and compare the features associated with PR merge outcomes for human and agentic PRs, as well as across three prominent AI agents.
The findings of this study provide insights into improving PR quality through human–AI collaboration.

Files

MSR-mining challenge-online-appendix.pdf

Files (95.8 MB)

Name Size Download all
md5:59a1e8a53d16c69c10692f79d68b6eae
285.3 kB Preview Download
md5:609999d455545b8a82bcf260d0a30f78
95.5 MB Preview Download