Published April 20, 2025 | Version 2025-04-20 & metrics
Dataset Open

Goblin: Neo4J Maven Central dependency graph

  • 1. SAP, LIP6, Sorbonne University
  • 2. ROR icon Université Paris Nanterre

Description

This repository contains a Neo4j dump of  Maven Central dependency graph generated using goblinDependencyMiner.
To import this graph into neo4j, please use a version 4.x.

Our dependency graph structure and metamodel are shown in images "goblin_dg_structure" and "metamodel".

The latest available version dates from April 20, 2025, contains 16,939,391 nodes (712,509 libraries and 16,226,882 releases) and 152,434,085 edges (136,207,203 dependencies and 16,226,882 versioning edges).

This repository contains two dump of the database:

  • goblin_maven_20_04_25.dump: This dataset contains the entire Maven Central dependency graph.
  • with_metrics_goblin_maven_20_04_25.dump: This dataset is the same as the previous one, but enriched with new “AddedValue” nodes (49,393,155 new nodes) representing the following metrics: CVE (dated may 13, 2025), freshness, popularity and speed. More information in this tutorial.

More details in the dedicated paper: Goblin: A Framework For Enriching And Querying the Maven Central Dependency Graph (https://doi.org/10.1145/3643991.3644879) - 21st International Conference on Mining Software Repositories (MSR'24).
If you use it, please cite this paper: https://dl.acm.org/doi/10.1145/3643991.3644879

⚠️ This dataset is the subject of the Mining Challenge at the MSR 2025 conference, more information here.

Files

graphStructure.png

Files (15.1 GB)

Name Size Download all
md5:ca3c5188626d0f089b7cb1fde11ad084
6.2 GB Download
md5:fcacfd99147b1c108f05aaaa68bec9e6
74.4 kB Preview Download
md5:a94b8f66d5987cddafa63b8a8ebe8096
8.9 GB Download

Additional details

Dates

Updated
2025-04-20