Dataset Open Access

A Longitudinal Analysis of Bloated Java Dependencies

César, Soto-Valero; Thomas Durieux; Benoit Baudry

A Longitudinal Analysis of Bloated Java Dependencies

This repository contains the data and script for the paper "A Longitudinal Analysis of Bloated Java Dependencies"

Repository Structure

- dataset
  - projects.csv            # list of 500 projects used in the paper
  - commits.csv             # list of commits that are analyzed
  - project_dependabot.json # dependabot commits for each project
  - project_releases.json   # commits associated to a release for each project
- dependency_usage_tree
  - <project>
    - <commit>
      - depclean.json       # the dependency usage tree extracted by Deplean
      -     # Maven compilation log
      -    # Deplean log
- script
  - create_dataset.js       # ceate projects.csv and commits.csv based on project_releases.json and project_dependabot.json
  - read_dependency_usage_tree.js # extract the information from dependency_usage_tree and generate a csv file
  -             # read dependency_usage_tree.csv and generate the macro and table for the paper
Files (3.2 GB)
Name Size
3.2 GB Download
All versions This version
Views 5450
Downloads 66
Data volume 19.0 GB19.0 GB
Unique views 5148
Unique downloads 55


Cite as