Published August 30, 2019 | Version 1.0
Conference paper Open

Implementing big data lake for heterogeneous data sources

  • 1. University of Oulu
  • 2. *University of Oulu
  • 3. Finland Dell EMC
  • 4. Ireland Draxis Environmental S.A

Description

Modern connected cities are more and more leveraging advances in ICT to improve their services and the quality of life of their inhabitants. The data generated from different sources, such as environmental sensors, social networking platforms, traffic counters, are harnessed to achieve these end goals. However, collecting, integrating, and analyzing all the heterogeneous data sources available from the cities is a challenge. This article suggests a data lake approach built on Big Data technologies, to gather all the data together for further analysis. The platform, described here, enables data collection, storage, integration, and further analysis and visualization of the results. This solution is the first attempt to integrate a diverse set of data sources from four pilot cities as part of the CUTLER project (Coastal urban development through the lenses of resiliency). The design and implementation details, as well as usage scenarios are presented in this paper.

Notes

6. Hassan Mehmood, Ekaterina Gilman, Marta Cortes, Panos Kostakos, Andrew Byrne, Katerina Valta, Stavros Tekes and Jukka Riekki, "Implementing big data lake for heterogeneous data sources", in Proc. International Workshop on Data-Driven Smart Cities (DASC 2019), Macau SAR, China, April 2019. (DOI: https://doi.org/10.1109/ICDEW.2019.00-37)

Files

Implementing big data lake for heterogeneous data sources.pdf

Files (613.1 kB)

Additional details

Funding

CUTLER – Coastal Urban developmenT through the LEnses of Resiliency 770469
European Commission