APIBench: A Benchmark Dataset for Evaluating API Recommendation Approaches in Python and Java

Peng, Yun; Li, Shuqing; Gu, Wenwei; Li, Yichen; Wang, Wenxuan; Gao, Cuiyun; Lyu, Michael

doi:10.5281/zenodo.5797297

Published December 22, 2021 | Version v1.0

Dataset Open

APIBench: A Benchmark Dataset for Evaluating API Recommendation Approaches in Python and Java

1. The Chinese University of Hong Kong
2. Harbin Institute of Technology, Shenzhen

APIBench is the benchmark dataset APIBench released in the paper "Revisiting, Benchmarking and Exploring APIRecommendation: How Far Are We?".

APIBench contains two sub-dataset for evaluating the performance of query-based and code-based API recommendation approaches, namely APIBench-Q and APIBench-C. Each sub-dataset has a Java version and a Python version.

APIBench-Q contains 4,309 Python queries and 6,563 Java queries collected from Stack Overflow posts generated from Aug 2008 to Feb 2021 and tutorial websites Geeks4Geeks, Java2s, and Kode Java in April 2021.

APIBench-C contains 2,361 Python projects and 1,477 Java projects mined from GitHub in April 2021.

Please read the README.md file for detailed information about the benchmark.

The evaluation results of existing API recommendation approaches can be found in this GitHub repository.

Files

APIBench.zip

Files (7.8 GB)

Name	Size	Download all
APIBench.zip md5:56bf7573a7b26b26613d91bfa63f340f	7.8 GB	Preview Download
README.md md5:9382056c539c10c5a8d8e2d5db5c9192	91.6 kB	Preview Download

Additional details

Is identical to: Journal article: 10.1109/TSE.2022.3197063 (DOI)

Citations

Oops! Something went wrong while fetching results.

	All versions	This version
Views	2,334	1,876
Downloads	1,174	988
Data volume	11.5 TB	10.8 TB

APIBench: A Benchmark Dataset for Evaluating API Recommendation Approaches in Python and Java

Creators

Description

Files

APIBench.zip

Files (7.8 GB)

Additional details

Related works