There is a newer version of the record available.

Published July 30, 2019 | Version v2
Dataset Open

Wikimedia Commons photos by prominent users and their usage across the web

  • 1. Wikimedia Italia

Description

Extract from the Wikimedia Commons database containing a list of users selected by the community for having uploaded high quality photos; list of 310k photos of theirs and of the subset of 59k photos sent to Infringement.Report for matching; list of domains whose matches were ignored as not useful for copyleft license enforcement. Domains were then matched for their rank in the Tranco list and the number of image usages found, and ranked by a mix of the two criteria.

Notes

CC-0 does not apply to the Tranco list itself, for which see its website.

Files

PhotoClaimIgnoredDomains.txt

Files (107.0 MB)

Name Size Download all
md5:3b1ef44a268cadf06735d1e698438280
17.0 MB Download
md5:3258a7d6bc738caabdac0f9c0baf9587
29.6 MB Download
md5:ec5821d5413af83b5bdd4b624e5a5dfe
552.8 kB Download
md5:cd840068d72f8af5fba518331af1798e
1.7 MB Download
md5:bfd2e76110d0479498e06f442a2a3db9
58.1 MB Download
md5:1ceebf5bc771ffec7280403dbe8406d2
550 Bytes Download
md5:a56718fc1c605cb6bad880e48bde6c6a
817 Bytes Preview Download

Additional details

References

  • Victor Le Pochat, Tom Van Goethem, Samaneh Tajalizadehkhoob, Maciej KorczyƄski, and Wouter Joosen. 2019. "Tranco: A Research-Oriented Top Sites Ranking Hardened Against Manipulation," Proceedings of the 26th Annual Network and Distributed System Security Symposium (NDSS 2019). https://doi.org/10.14722/ndss.2019.23386