Dataset Open Access

Maven central dependency graph

Amine Benelallam; Nicolas Harrand; César Soto Valero; Benoit Baudry; Olivier Barais


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Maven Repository</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Mining OSS</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Dependency Graph</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Maven Central</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">JVM</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Open-source</subfield>
  </datafield>
  <controlfield tag="005">20200124192514.0</controlfield>
  <datafield tag="500" ind1=" " ind2=" ">
    <subfield code="a">The Maven dependency graph is the fruit of a collaboration between the DiverSE team (Inria Rennes, France) and CASTOR project (KTH, Sweden). Instructions on how to use and reproduce the dataset can be found in the dataset's repository on [Github](https://github.com/diverse-project/maven-miner). A complete description of the dataset and usages can be found in the accompanying [paper] (https://arxiv.org/abs/1901.05392).</subfield>
  </datafield>
  <controlfield tag="001">1489120</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">KTH, Sweden</subfield>
    <subfield code="0">(orcid)0000-0002-2491-2771</subfield>
    <subfield code="a">Nicolas Harrand</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">KTH, Sweden</subfield>
    <subfield code="0">(orcid)0000-0003-0541-6411</subfield>
    <subfield code="a">César Soto Valero</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">KTH, Sweden</subfield>
    <subfield code="0">(orcid)0000-0002-4015-4640</subfield>
    <subfield code="a">Benoit Baudry</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Diverse team, Inria, Univ Rennes, CNRS, IRISA, France</subfield>
    <subfield code="0">(orcid)0000-0002-4551-8562</subfield>
    <subfield code="a">Olivier Barais</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">71985268</subfield>
    <subfield code="z">md5:f95582c55246e826cf2f8bc009746b7c</subfield>
    <subfield code="u">https://zenodo.org/record/1489120/files/maven-data.csv.tar.xz</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">2756433408</subfield>
    <subfield code="z">md5:247fb32f6d431b59c21c6bd2504b3222</subfield>
    <subfield code="u">https://zenodo.org/record/1489120/files/maven-data.docker.tar</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">282856820</subfield>
    <subfield code="z">md5:e34db6419bd1541b1eda86002ff15267</subfield>
    <subfield code="u">https://zenodo.org/record/1489120/files/maven-data.raw.tar.xz</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2018-11-15</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-msr</subfield>
    <subfield code="o">oai:zenodo.org:1489120</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="4">
    <subfield code="c">1--5</subfield>
    <subfield code="n">Computer Science - Software Engineering</subfield>
    <subfield code="p">The Maven Dependency Graph: a Temporal Graph-based Representation of Maven Central</subfield>
    <subfield code="v">arXiv e-prints</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Diverse team, Inria, Univ Rennes, CNRS, IRISA, France</subfield>
    <subfield code="0">(orcid)0000-0003-3064-8302</subfield>
    <subfield code="a">Amine Benelallam</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Maven central dependency graph</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-msr</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">731529</subfield>
    <subfield code="a">Software Testing AMPlification</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by-sa/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution Share Alike 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;The Maven dependency graph is an open dataset of Maven Central artifacts, their dependencies, as well as other relationships. Its main intent is to domesticate the wild within and around the Maven central ecosystem, in particular, and JVM-based libraries at large, making it more harnessable to both academics and industry. It is intended to answer high-level research questions concerning artifacts releases, evolution, and usage trends over time. It can also be used to assist researchers in selecting relevant datasets, among the mass of existing software artifact, for assessing particular empirical software engineering challenges. The complexity of these questions can range from simple pattern matching to advanced big data analysis and machine learning techniques.&lt;br&gt;
&lt;br&gt;
The accompanying paper to this dataset is has been accepted for publication in the proceedings of the International Conference on Mining Software Repositories 2019 and has received the MSR 2019 Data Showcase Award. This paper is available for download&amp;nbsp;on &lt;a href="https://arxiv.org/abs/1901.05392"&gt;arXiv&lt;/a&gt;.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.1489119</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.1489120</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
2,496
2,063
views
downloads
All versions This version
Views 2,4962,496
Downloads 2,0632,063
Data volume 3.9 TB3.9 TB
Unique views 2,2522,252
Unique downloads 581581

Share

Cite as