Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published March 1, 2019 | Version Open Access
Conference paper Open

Eliminating noise in the matrix profile

  • 1. Universiteit Gent

Description

As companies are increasingly measuring their products and services, the amount of time series data is rising
and techniques to extract usable information are needed. One recently developed data mining technique for
time series is the Matrix Profile. It consists of the smallest z-normalized Euclidean distance of each subsequence
of a time series to all other subsequences of another series. It has been used for motif and discord
discovery, for segmentation and as building block for other techniques. One side effect of the z-normalization
used is that small fluctuations on flat signals are upscaled. This can lead to high and unintuitive distances for
very similar subsequences from noisy data. We determined an analytic method to estimate and remove the
effects of this noise, adding only a single, intuitive parameter to the calculation of the Matrix Profile. This
paper explains our method and demonstrates it by performing discord discovery on the Numenta Anomaly
Benchmark and by segmenting the PAMAP2 activity dataset. We find that our technique results in a more
intuitive Matrix Profile and provides improved results in both usecases for series containing many flat, noisy
subsequences. Since our technique is an extension of the Matrix Profile, it can be applied to any o

Files

8605190.pdf

Files (872.0 kB)

Name Size Download all
md5:88aa462b716759067f2dd4ecaaa224ec
872.0 kB Preview Download

Additional details

Funding

Z-BRE4K – Strategies and Predictive Maintenance models wrapped around physical systems for Zero-unexpected-Breakdowns and increased operating life of Factories 768869
European Commission