Journal article Open Access

Frictionless Data and Data Packages

Jo Barratt; Serah Rono; Paul Walsh

There is significant friction in the acquisition, sharing, and reuse of research data. It is estimated that eighty percent of data analysis is invested in the cleaning and mapping of data. This friction hampers researchers not well versed in data processing techniques from reusing an ever-increasing amount of research data available on the web and within scientific data repositories. Frictionless Data is an ongoing project at Open Knowledge International focused on removing this friction in a variety of circumstances. We are doing this by developing a set of tools, specifications, and best practices for describing, publishing, and validating data. The heart of this project is the “Data Package”, a containerization format for data based on existing practices for publishing open-source software.

Preprint submitted to RO2018 workshop at IEEE eScience Conference 2018
Files (49.9 kB)
Name Size
RO2018FrictionlessDataandDataPackages.html
md5:1deb39c1fb1c5e4b964275edcd694e2e
49.9 kB Download
60
22
views
downloads
All versions This version
Views 6060
Downloads 2222
Data volume 1.1 MB1.1 MB
Unique views 5757
Unique downloads 2121

Share

Cite as