Published March 21, 2017 | Version v1
Conference paper Open

In-DBMS Sampling-based Sub-trajectory Clustering

  • 1. Dept. of Statistics and Ins. Science University of Piraeus Piraeus, Greece
  • 2. ept. of Informatics University of Piraeus Piraeus, Greece
  • 3. Dept. of Informatics University of Piraeus Piraeus, Greece
  • 4. Dept. of Business Administration TEI of Crete Agios Nikolaos, Crete, Greece

Description

In this paper, we propose an efficient in-DBMS solution for the problem of sub-trajectory clustering and outlier detection in large moving object datasets. The method relies on a two-phase process: a voting-and-segmentation phase that segments trajectories according to a local density criterion and trajectory similarity criteria, followed by a sampling-and-clustering phase that selects the most representative sub-trajectories to be used as seeds for the clustering process. Our proposal, called S 2 T-Clustering (for Sampling-based Sub-Trajectory Clustering) is novel since it is the first, to our knowledge, that addresses the pure spatiotemporal sub-trajectory clustering and outlier detection problem in a real-world setting (by ‘pure’ we mean that the entire spatiotemporal information of trajectories is taken into consideration). Moreover, our proposal can be efficiently registered as a database query operator in the context of extensible DBMS (namely, PostgreSQL in our current implementation). The effectiveness and the efficiency of the proposed algorithm are experimentally validated over synthetic and real-world trajectory datasets, demonstrating that S 2 T-Clustering outperforms an off-the-shelf in-DBMS solution using PostGIS by several orders of magnitude.

Files

paper-262.pdf

Files (3.4 MB)

Name Size Download all
md5:8c46e7ce745cc1a78f3e75cbf6165cd7
3.4 MB Preview Download

Additional details

Funding

European Commission
datACRON - Big Data Analytics for Time Critical Mobility Forecasting 687591