Published July 1, 2010 | Version v1
Figure Open

Data format figures-DATA MINING LEARNING MODELS AND ALGORITHMS ON A SCADA SYSTEM DATA REPOSITORY

  • 1. Eds. B. Iantovics

Description

The original data set included noisy, missing and inconsistent data. Data
preprocessing improved the quality of the data and facilitated e±cient data
mining tasks.
Before the experiment, we prepared data suitable to next operation as
following steps:
² Delete or replace missing values;
² Delete redundant properties (columns);
² Data Transformation;
² Data Discretization;
² Export data to a required .ar® or .csv format ¯le [11].
The original and modi¯ed formats of data set are shown in Figure 1 and
Figure 2.
Data visualization is also a very useful technique because it helps to deter-
mine the di±culty of the learning problem. We visualized with Weka single
attributes (1-d) and pairs of attributes (2-d). The ¯gure 3 shows the variation
of the temperature in time.

Notes

https://www.edusoft.ro/brain/index.php/brain/issue/view/7

Files

data format.jpg

Files (172.1 kB)

Name Size Download all
md5:def9767a73ba281b24cb8be0a61f3819
172.1 kB Preview Download