Dataset Restricted Access
Maass, Max;
Schwär, Stephan;
Hollick, Matthias
<?xml version='1.0' encoding='utf-8'?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:adms="http://www.w3.org/ns/adms#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dct="http://purl.org/dc/terms/" xmlns:dctype="http://purl.org/dc/dcmitype/" xmlns:dcat="http://www.w3.org/ns/dcat#" xmlns:duv="http://www.w3.org/ns/duv#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:frapo="http://purl.org/cerif/frapo/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:gsp="http://www.opengis.net/ont/geosparql#" xmlns:locn="http://www.w3.org/ns/locn#" xmlns:org="http://www.w3.org/ns/org#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:prov="http://www.w3.org/ns/prov#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:schema="http://schema.org/" xmlns:skos="http://www.w3.org/2004/02/skos/core#" xmlns:vcard="http://www.w3.org/2006/vcard/ns#" xmlns:wdrs="http://www.w3.org/2007/05/powder-s#"> <rdf:Description rdf:about="https://doi.org/10.5281/zenodo.6509751"> <rdf:type rdf:resource="http://www.w3.org/ns/dcat#Dataset"/> <dct:type rdf:resource="http://purl.org/dc/dcmitype/Dataset"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">https://doi.org/10.5281/zenodo.6509751</dct:identifier> <foaf:page rdf:resource="https://doi.org/10.5281/zenodo.6509751"/> <dct:creator> <rdf:Description rdf:about="http://orcid.org/0000-0001-9346-8486"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">0000-0001-9346-8486</dct:identifier> <foaf:name>Maass, Max</foaf:name> <foaf:givenName>Max</foaf:givenName> <foaf:familyName>Maass</foaf:familyName> <org:memberOf> <foaf:Organization> <foaf:name>Secure Mobile Networking Lab, TU Darmstadt</foaf:name> </foaf:Organization> </org:memberOf> </rdf:Description> </dct:creator> <dct:creator> <rdf:Description rdf:about="http://orcid.org/0000-0002-9728-3937"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">0000-0002-9728-3937</dct:identifier> <foaf:name>Schwär, Stephan</foaf:name> <foaf:givenName>Stephan</foaf:givenName> <foaf:familyName>Schwär</foaf:familyName> <org:memberOf> <foaf:Organization> <foaf:name>Secure Mobile Networking Lab, TU Darmstadt</foaf:name> </foaf:Organization> </org:memberOf> </rdf:Description> </dct:creator> <dct:creator> <rdf:Description rdf:about="http://orcid.org/0000-0002-9163-5989"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <dct:identifier rdf:datatype="http://www.w3.org/2001/XMLSchema#string">0000-0002-9163-5989</dct:identifier> <foaf:name>Hollick, Matthias</foaf:name> <foaf:givenName>Matthias</foaf:givenName> <foaf:familyName>Hollick</foaf:familyName> <org:memberOf> <foaf:Organization> <foaf:name>Secure Mobile Networking Lab, TU Darmstadt</foaf:name> </foaf:Organization> </org:memberOf> </rdf:Description> </dct:creator> <dct:title>A Dataset of over 500.000 commercial email newsletters, as collected by PrivacyMail.info</dct:title> <dct:publisher> <foaf:Agent> <foaf:name>Zenodo</foaf:name> </foaf:Agent> </dct:publisher> <dct:issued rdf:datatype="http://www.w3.org/2001/XMLSchema#gYear">2022</dct:issued> <dct:issued rdf:datatype="http://www.w3.org/2001/XMLSchema#date">2022-06-12</dct:issued> <owl:sameAs rdf:resource="https://zenodo.org/record/6509751"/> <adms:identifier> <adms:Identifier> <skos:notation rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">https://zenodo.org/record/6509751</skos:notation> <adms:schemeAgency>url</adms:schemeAgency> </adms:Identifier> </adms:identifier> <dct:isVersionOf rdf:resource="https://doi.org/10.5281/zenodo.6509750"/> <dct:description><p>This dataset contains the data from roughly two years of operating PrivacyMail.info, an Open Source Email privacy measurement platform. It contains slightly over 500.000 commercial newsletters, as crowdsourced by users of PrivacyMail.info. You can find the methodology discussed in our paper: Max Maass, Stephan Schw&auml;r, and Matthias Hollick. &quot;Towards transparency in email tracking.&quot; Annual Privacy Forum, 2019. The source code can be found on github.com/privacymail/privacymail</p> <p>Please note that, due to its crowdsourced nature, this dataset is a sample of opportunity - it is not representative for all newsletters on the Internet, and likely contains biases based on how it was collected. Notably, German-language newsletters will likely be heavily over-represented.</p> <p><strong>Dataset Structure</strong><br> The dataset is structured as follows: On the top level are folders describing the website the newsletter belongs to. Inside that folder are subfolders for each identity that was registered for that website. Inside each of these folders are a series of .eml files that represent the received email messages.</p> <p><strong>Copyright and Licensing</strong><br> This dataset is set to non-public due to copyright concerns: The contents of the email messages are (presumably) protected by copyright in most jurisdictions. Most copyright doctrines contain exceptions for non-commercial research use - thus, we feel it is appropriate and acceptable to share the data on a case-by-case basis, the same way we did before shutting down PrivacyMail.info. When requesting access to the data, please briefly describe what research you want to conduct with it, and we will grant you access.</p> <p>We thus do not put any explicit license on this dataset. Please do not share the raw data publicly. We request that you cite the above-mentioned paper and this dataset in any publications that result from it.</p></dct:description> <dct:accessRights rdf:resource="http://publications.europa.eu/resource/authority/access-right/RESTRICTED"/> <dct:accessRights> <dct:RightsStatement rdf:about="info:eu-repo/semantics/restrictedAccess"> <rdfs:label>Restricted Access</rdfs:label> </dct:RightsStatement> </dct:accessRights> <dcat:distribution> <dcat:Distribution> <dcat:accessURL rdf:resource="https://doi.org/10.5281/zenodo.6509751"/> </dcat:Distribution> </dcat:distribution> </rdf:Description> </rdf:RDF>
All versions | This version | |
---|---|---|
Views | 199 | 199 |
Downloads | 12 | 12 |
Data volume | 86.9 GB | 86.9 GB |
Unique views | 167 | 167 |
Unique downloads | 10 | 10 |