Conference paper Open Access

Extracting Provenance Metadata from Privacy Policies

Pandit, Harshvardhan J.; O'Sullivan, Declan; Lewis, Dave

Privacy policies are legal documents that describe activities over personal data such as its collection, usage, processing, sharing, and storage. Expressing this information as provenance metadata can aid in legal accountability as well as modelling of data usage in real-world use-cases. In this paper, we describe our early work on identification, extraction, and representation of provenance information within privacy policies. We discuss the adoption of entity extraction approaches using concepts and keywords defined by the GDPRtEXT resource along with using annotated privacy policy corpus from the UsablePrivacy project. We use the previously published GDPRov ontology (an extension of PROV-O) to model provenance model extracted from privacy policies.

Files (183.4 kB)
Name Size
preprint.pdf
md5:44b26b61641b0fd93d7bb319f5ef7c69
183.4 kB Download
3
3
views
downloads
All versions This version
Views 33
Downloads 33
Data volume 550.3 kB550.3 kB
Unique views 33
Unique downloads 33

Share

Cite as