Report Open Access
Panova, Denitsa Ivova; Martin Marquez, Manuel; Romero Marin, Antonio
Nowadays the world is over flooded with information. With the rise of Big Data, huge amounts of information, diverging in their variety, velocity and volume, a new data analytics branch has developed. It has emerged from the need to offer to the researchers an easier way, an intuitive software with which to analyze the accumulated pile of information.
The purpose of this project is to investigate some of these data analytics platforms and evaluate them for the unique requirements of CERN environment and use cases. The question which needs to be answered is - are those platforms able to facilitate the users to conduct analysis without the necessity to acquire new skill set. Initial point of the report is to give context of the project and to specify why the use of Big Data technologies is required at CERN. Additionally, it gives an overview of the Big Data tools which are used in the evaluated platforms. The core of the report consists of the mechanism behind each of three data analytics platforms and their evaluation in terms of their merits and downsides.