Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published July 30, 2019 | Version v3
Dataset Open

Wikimedia Commons photos by prominent users and their usage across the web

  • 1. Wikimedia Italia

Description

Extract from the Wikimedia Commons database containing a list of users selected by the community for having uploaded high quality photos; list of 310k photos of theirs and of the subset of 59k photos sent to Infringement.Report for matching; list of domains whose matches were ignored as not useful for copyleft license enforcement. Domains were then matched for their rank in the Tranco list and the number of image usages found, and ranked by a mix of the two criteria.

Files

2019-Commons-ImageMatchedDomains.csv

Files (91.0 MB)

Name Size Download all
md5:9e8ab06c4511f715929994d5939f1ca9
1.0 MB Preview Download
md5:3258a7d6bc738caabdac0f9c0baf9587
29.6 MB Download
md5:ec5821d5413af83b5bdd4b624e5a5dfe
552.8 kB Download
md5:cd840068d72f8af5fba518331af1798e
1.7 MB Download
md5:bfd2e76110d0479498e06f442a2a3db9
58.1 MB Download
md5:1ceebf5bc771ffec7280403dbe8406d2
550 Bytes Download
md5:a56718fc1c605cb6bad880e48bde6c6a
817 Bytes Preview Download

Additional details

Related works

Cites
Dataset: 10.14722/ndss.2019.23386 (DOI)

References

  • Victor Le Pochat, Tom Van Goethem, Samaneh Tajalizadehkhoob, Maciej KorczyƄski, and Wouter Joosen. 2019. "Tranco: A Research-Oriented Top Sites Ranking Hardened Against Manipulation," Proceedings of the 26th Annual Network and Distributed System Security Symposium (NDSS 2019). https://doi.org/10.14722/ndss.2019.23386