Published August 30, 2024 | Version 2024-08-30 & metrics
Dataset Open

Goblin: Neo4J Maven Central dependency graph

  • 1. SAP, LIP6, Sorbonne University

Description

This repository contains a Neo4j dump of  Maven Central dependency graph generated using goblinDependencyMiner.
To import this graph into neo4j, please use a version 4.x.

Our dependency graph structure and metamodel are shown in images "goblin_dg_structure" and "metamodel".

The latest available version dates from August 30, 2024, contains 15,117,217 nodes (658,078 libraries and 14,459,139 releases) and 134,119,545 edges (119,660,406 dependencies and 14,459,139 versioning edges).

This repository contains two dump of the database:

  • goblin_maven_30_08_24.dump: This dataset contains the entire Maven Central dependency graph.
  • with_metrics_goblin_maven_30_08_24.dump: This dataset is the same as the previous one, but enriched with new “AddedValue” nodes (44,035,495 new nodes) representing the following metrics: CVE (dated September 4, 2024), freshness, popularity and speed. More information in this tutorial.

More details in the dedicated paper: Goblin: A Framework For Enriching And Querying the Maven Central Dependency Graph (https://doi.org/10.1145/3643991.3644879) - 21st International Conference on Mining Software Repositories (MSR'24).
If you use it, please cite this paper: https://dl.acm.org/doi/10.1145/3643991.3644879

⚠️ This dataset is the subject of the Mining Challenge at the MSR 2025 conference, more information here.

Files

goblin_dg_structure.png

Files (15.3 GB)

Name Size Download all
md5:13e7922b1ec6d95b15fff38f31bb6cfa
27.3 kB Preview Download
md5:703624adc0d9b21a2edac5648969bcc7
4.8 GB Download
md5:ae500ebc6efb629c4ccc96b3c7f4e152
28.5 kB Preview Download
md5:09fc5de0e87fb1b06e373f30fdc1b91a
10.6 GB Download

Additional details

Dates

Updated
2024-08-30