Goblin: Neo4J Maven Central dependency graph
Description
This repository contains a Neo4j dump of Maven Central dependency graph generated using goblinDependencyMiner.
To import this graph into neo4j, please use a version 4.x.
Our dependency graph structure and metamodel are shown in images "goblin_dg_structure" and "metamodel".
The latest available version dates from August 30, 2024, contains 15,117,217 nodes (658,078 libraries and 14,459,139 releases) and 134,119,545 edges (119,660,406 dependencies and 14,459,139 versioning edges).
This repository contains two dump of the database:
- goblin_maven_30_08_24.dump: This dataset contains the entire Maven Central dependency graph.
- with_metrics_goblin_maven_30_08_24.dump: This dataset is the same as the previous one, but enriched with new “AddedValue” nodes (44,035,495 new nodes) representing the following metrics: CVE (dated September 4, 2024), freshness, popularity and speed. More information in this tutorial.
More details in the dedicated paper: Goblin: A Framework For Enriching And Querying the Maven Central Dependency Graph (https://doi.org/10.1145/3643991.3644879) - 21st International Conference on Mining Software Repositories (MSR'24).
If you use it, please cite this paper: https://dl.acm.org/doi/10.1145/3643991.3644879
⚠️ This dataset is the subject of the Mining Challenge at the MSR 2025 conference, more information here.
Files
goblin_dg_structure.png
Additional details
Dates
- Updated
-
2024-08-30
Software
- Repository URL
- https://github.com/Goblin-Ecosystem/goblinDependencyMiner