Published September 1, 2014 | Version v1
Report Open

Making sense of data streams: Complex Event Processing for Controls Applications

  • 1. CERN openlab Summer Student
  • 2. Summer Student Supervisor

Description

Project Specification

CERN is currently investigating the usage of data analysis technologies to study the behaviour of the industrial control systems. An activity related to these analysis is using Complex Event Processing (CEP) tools to classify a real abnormal behaviour from one generated by a human intervention on the system. In this study the Complex Event Processing classification is run over signals produced by variety of sensors and simulators. The selected tools is Esper. Presented here work consists of installing chosen tool, and developing the classification system with it to address given above merit.

Abstract

This project aims at building a tool to process live streams of data produced by various sensors and artificial generators. To this end, a Java code is written, which uses Esper Complex Event Processing package to: receive data feeds, apply user defined rules and filters, and pass the resulting information to a clustering framework. The last step employs Affinity Propagation based clustering algorithm, which choice is motivated by its dynamic adaptation to number of clusters in the data. This is key feature in data streaming scenario as the number of clusters can evolve with time. Furthermore, [Zhang et al., 2013] have documented overall good performance of Affinity Propagation in cases of live data analysis. Finally, presented here approach is compared and contrasted against static clustering algorithms applied to data gathered from streams incoming over one run of the program, followed by in-depth results analysis.

Files

CERN_openlab_Kacper_Sokol_Blazej.pdf

Files (2.6 MB)

Name Size Download all
md5:1ea26edc0a5c3ca61c41cc13dc07c5bf
2.6 MB Preview Download