Preprint Open Access
Roman, Dumitru; Nikolov, Nikolay; Elvesæter, Brian; Soylu, Ahmet; Radu, Prodnan; Kimovski, Dragi; Marrella, Andrea; Leotta, Francesco; Benvenuti, Dario; Matskin, Mihhail; Ledakis, Giannis; Simonet-Boulogne, Anthony; Perales, Fernando; Kharlamov, Evgeny; Ulisses, Alexandre; Solberg, Arnor; Ceccarelli, Raffaele
With the recent developments of Internet of Things (IoT) and cloud-based technologies, massive amounts of data are generated by heterogeneous sources and stored through dedicated cloud solutions. Often organizations generate much more data than they are able to interpret, and current Cloud Computing technologies cannot fully meet the requirements of the Big Data processing applications and their data transfer overheads. Many data are stored for compliance purposes only but not used and turned into value, thus becoming Dark Data, which are not only an untapped value, but also pose a risk for organizations. To guarantee a better exploitation of Dark Data, the DataCloud project aims to realize novel methods and tools for effective and efficient management of the Big Data Pipeline lifecycle encompassing the Computing Continuum.