Published January 29, 2020 | Version v1
Presentation Open

The SWH-ID: a digital fingerprint identifying software source code

  • 1. Université Paris Diderot - Paris 7 , Inria
  • 2. Inria

Description

The Software Heritage universal archive of software source code relies on
well established techniques used in software development communities to
identify the over 20 billion code artifacts it preserves with cryptographic hashes in a Merkle DAG data structure.

 

 

 

Files

2020-01-29-Pidapalooza-swh-id.pdf

Files (4.0 MB)

Name Size Download all
md5:a8a527fa6a16f506167df64746957051
4.0 MB Preview Download

Additional details

References

  • Jean-François Abramatic, Roberto Di Cosmo, Stefano Zacchiroli Building the Universal Archive of Source Code, Communications of the ACM, October 2018 (doi:10.1145/3183558)
  • Roberto Di Cosmo, Morane Gruenpeter, Stefano Zacchiroli Referencing Source Code Artifacts: a Separate Concern in Software Citation, Computing in Science and Engineering, IEEE, pp.1-9. (doi:10.1109/MCSE.2019.2963148) (hal-02446202)
  • The Software Heritage team, swh-model, 0.0.52, 2019, software (swh:1:dir:742a129f510ecfcf3859de289bfd623384a37eb0;origin=https://pypi.org/project/swh.model/)