There is a newer version of this record available.

Dataset Open Access Open Source Repository and Dependency Metadata

Andrew Nesbitt; Benjamin Nickolls

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="">
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">software</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">dependencies</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">open source</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">package managers</subfield>
  <controlfield tag="005">20200213211013.0</controlfield>
  <controlfield tag="001">808273</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Benjamin Nickolls</subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">5894601554</subfield>
    <subfield code="z">md5:be8015f7e70481da6b43af63372be626</subfield>
    <subfield code="u"></subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2017-06-15</subfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o"></subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Andrew Nesbitt</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a"> Open Source Repository and Dependency Metadata</subfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Creative Commons Attribution Share Alike 4.0 International</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2"></subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;&lt;strong&gt;What is in this release?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;In this release you will find data about software distributed and/or crafted publicly on the Internet. You will find information about its development, its distribution and its relationship with other software included as a dependency. You will not find any information about the individuals who create and maintain these projects.&lt;/p&gt;

&lt;p&gt;Further information and documentation on this data set can be found at;/p&gt;

&lt;p&gt;For enquiries please contact;/p&gt;

&lt;p&gt;This dataset contains seven csv files:&lt;/p&gt;


&lt;p&gt;A project is a piece of software available on any one of the 33 package managers supported by;/p&gt;


&lt;p&gt;A version is an immutable published version of a Project from a package manager. Not all package managers have a concept of publishing versions, often relying directly on tags/branches from a revision control tool.&lt;/p&gt;


&lt;p&gt;A tag is equivalent to a tag in a revision control system. Tags are sometimes used instead of Versions where a package manager does not use the concept of versions. Tags are often semantic version numbers.&lt;/p&gt;


&lt;p&gt;Dependencies describe the relationship between a project and the software it builds upon. Dependencies belong to Version. Each Version can have different sets of dependencies. Dependencies point at a specific Version or range of versions of other projects.&lt;/p&gt;


&lt;p&gt;A repository represents a publically accessible source code repository from either, or Repositories are distinct from Projects, they are not distributed via a package manager and typically an application for end users rather than component to build upon.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Repository dependencies&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;A repository dependency is a dependency upon a Version from a package manager has been specified in a manifest file, either as a manually added dependency committed by a user or listed as a generated dependency listed in a lockfile that has been automatically generated by a package manager and committed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Projects with related Repository fields&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This is an alternative projects export that denormalizes a projects related source code repository inline to reduce the need to join between two data sets.&lt;/p&gt;


&lt;p&gt;This dataset is released under the Creative Commons Attribution-ShareAlike 4.0 International Licence.&lt;/p&gt;

&lt;p&gt;This licence provides the user with the freedom to use, adapt and redistribute this data. In return the user must publish any derivative work under a similarly open licence, attributing as a data source. The full text of the licence is included in the data.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Access, Attribution and Citation&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The dataset is available to download from Zenodo at;/p&gt;

&lt;p&gt;Please attribute as a data source by including the words ‘Includes data from’ and reference the Digital Object identifier: 10.5281/Zenodo.808273.&lt;/p&gt;</subfield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.808272</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.808273</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
All versions This version
Views 18,7054,759
Downloads 43,2771,557
Data volume 770.3 TB9.2 TB
Unique views 14,8314,188
Unique downloads 11,3171,164


Cite as