Published November 8, 2024 | Version v1
Conference paper Open

Purity: a New Dimension for Measuring Data Centralization Quality

  • 1. ROR icon Tecnalia
  • 2. ROR icon Universidad de Deusto
  • 3. DeustoTech - Deusto Institute of Technology

Description

Data has become an asset for companies, originating from various sources, such as IoT paradigms. It is crucial to safeguard its life cycle using suitable, scalable, and effective technologies, like those enabled by cloud computing models. However, in order to extract value from this data, complementary processes of collection, refinement, cleaning, or modeling, among many others, are required. Furthermore, organizations greatly vary in their methodologies and approaches to handling data, which further emphasizes the need for standardized techniques. In this regard, data management methodologies promote the adoption of the various dimensions of
data quality in order to ensure the reliability of data across different systems and processes. The main contribution of this manuscript is the proposal of a new data quality dimension, coined purity, to measure the importance of the data in a processing pipeline topology. As a result, organizations can better guarantee the quality of their datasets in order to raise the success of data-driven endeavors within organizations. The proposed methodology is validated in an urban mobility use case.

Files

3694860.3694862.pdf

Files (649.6 kB)

Name Size Download all
md5:421a331b0380124c898316667b3d82e1
649.6 kB Preview Download

Additional details

Funding

DATA Monetization, Interoperability, Trading & Exchange Grant agreement ID: 101092989
European Commission