Dataset Open Access

Synthetic Dataset for Outlier Detection

Koncar, Philipp

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Koncar, Philipp</dc:creator>
  <dc:description>This synthetically generated dataset can be used to evaluate outlier detection algorithms. It has 10 attributes and 1000 observations, of which 100 are labeled as outliers. Two-dimensional combinations of attributes form differently shaped clusters.

	Attribute 0 &amp; Attribute 1: Two circular clusters
	Attribute 2 &amp; Attribute 3: Two banana shaped clusters
	Attribute 4 &amp; Attribute 5: Three point clouds
	Attribute 6 &amp; Attribute 7: Two point clouds with variances
	Attribute 8 &amp; Attribute 9: Three anisotropic shaped clusters. 

The "outlier" column states whether an observation is an outlier or not. Additionally, the .zip file contains 10 stratified randomized train test splits (70% train, 30% test).</dc:description>
  <dc:subject>outlier detection</dc:subject>
  <dc:title>Synthetic Dataset for Outlier Detection</dc:title>
All versions This version
Views 1,9401,941
Downloads 276275
Data volume 278.9 MB277.8 MB
Unique views 1,8491,850
Unique downloads 268267


Cite as