Goblin: Neo4J Maven Central dependency graph
Description
This repository contains a Neo4j dump of Maven Central dependency graph generated using goblinDependencyMiner.
To import this graph into neo4j, please use a version 4.x.
Our dependency graph structure and metamodel are shown in images "goblin_dg_structure" and "metamodel".
The latest available version dates from April 20, 2025, contains 16,939,391 nodes (712,509 libraries and 16,226,882 releases) and 152,434,085 edges (136,207,203 dependencies and 16,226,882 versioning edges).
This repository contains two dump of the database:
- goblin_maven_20_04_25.dump: This dataset contains the entire Maven Central dependency graph.
- with_metrics_goblin_maven_20_04_25.dump: This dataset is the same as the previous one, but enriched with new “AddedValue” nodes (49,393,155 new nodes) representing the following metrics: CVE (dated may 13, 2025), freshness, popularity and speed. More information in this tutorial.
More details in the dedicated paper: Goblin: A Framework For Enriching And Querying the Maven Central Dependency Graph (https://doi.org/10.1145/3643991.3644879) - 21st International Conference on Mining Software Repositories (MSR'24).
If you use it, please cite this paper: https://dl.acm.org/doi/10.1145/3643991.3644879
⚠️ This dataset is the subject of the Mining Challenge at the MSR 2025 conference, more information here.
Files
graphStructure.png
Files
(15.1 GB)
Name | Size | Download all |
---|---|---|
md5:ca3c5188626d0f089b7cb1fde11ad084
|
6.2 GB | Download |
md5:fcacfd99147b1c108f05aaaa68bec9e6
|
74.4 kB | Preview Download |
md5:a94b8f66d5987cddafa63b8a8ebe8096
|
8.9 GB | Download |
Additional details
Dates
- Updated
-
2025-04-20
Software
- Repository URL
- https://github.com/Goblin-Ecosystem/goblinDependencyMiner