Presentation Open Access

The SWH-ID: a digital fingerprint identifying software source code

Di Cosmo, Roberto; Gruenpeter, Morane

The Software Heritage universal archive of software source code relies on
well established techniques used in software development communities to
identify the over 20 billion code artifacts it preserves with cryptographic hashes in a Merkle DAG data structure.




Files (4.0 MB)
Name Size
4.0 MB Download
  • Jean-François Abramatic, Roberto Di Cosmo, Stefano Zacchiroli Building the Universal Archive of Source Code, Communications of the ACM, October 2018 (doi:10.1145/3183558)

  • Roberto Di Cosmo, Morane Gruenpeter, Stefano Zacchiroli Referencing Source Code Artifacts: a Separate Concern in Software Citation, Computing in Science and Engineering, IEEE, pp.1-9. (doi:10.1109/MCSE.2019.2963148) (hal-02446202)

  • The Software Heritage team, swh-model, 0.0.52, 2019, software (swh:1:dir:742a129f510ecfcf3859de289bfd623384a37eb0;origin=

All versions This version
Views 159159
Downloads 8686
Data volume 342.5 MB342.5 MB
Unique views 148148
Unique downloads 6868


Cite as