Published August 25, 2024 | Version 1.0.0
Dataset Open

Three Annotated Anomaly Detection Datasets for Line-Scan Algorithms

  • 1. ROR icon University of Sydney

Description

Summary

This dataset contains two hyperspectral and one multispectral anomaly detection images, and their corresponding binary pixel masks. They were initially used for real-time anomaly detection in line-scanning, but they can be used for any anomaly detection task.

They are in .npy file format (will add tiff or geotiff variants in the future), with the image datasets being in the order of (height, width, channels). The SNP dataset was collected using sentinelhub, and the Synthetic dataset was collected from AVIRIS. The Python code used to analyse these datasets can be found at: https://github.com/WiseGamgee/HyperAD

How to Get Started

All that is needed to load these datasets is Python (preferably 3.8+) and the NumPy package. Example code for loading the Beach Dataset if you put it in a folder called "data" with the python script is:

import numpy as np

# Load image file
hsi_array = np.load("data/beach_hsi.npy")
n_pixels, n_lines, n_bands = hsi_array.shape
print(f"This dataset has {n_pixels} pixels, {n_lines} lines, and {n_bands}.")

# Load image mask
mask_array = np.load("data/beach_mask.npy")
m_pixels, m_lines = mask_array.shape
print(f"The corresponding anomaly mask is {m_pixels} pixels by {m_lines} lines.")

Citing the Datasets

If you use any of these datasets, please cite the following paper:

@article{garske2024erx,
  title={ERX - a Fast Real-Time Anomaly Detection Algorithm for Hyperspectral Line-Scanning},
  author={Garske, Samuel and Evans, Bradley and Artlett, Christopher and Wong, KC},
  journal={arXiv preprint arXiv:2408.14947},
  year={2024},
}
If you use the beach dataset please cite the following paper as well (original source):
@article{mao2022openhsi,
  title={OpenHSI: A complete open-source hyperspectral imaging solution for everyone},
  author={Mao, Yiwei and Betters, Christopher H and Evans, Bradley and Artlett, Christopher P and Leon-Saval, Sergio G and Garske, Samuel and Cairns, Iver H and Cocks, Terry and Winter, Robert and Dell, Timothy},
  journal={Remote Sensing},
  volume={14},
  number={9},
  pages={2244},
  year={2022},
  publisher={MDPI}
}

Files

beach_rgb.png

Files (1.7 GB)

Name Size Download all
md5:07da5b9825f60acd4ef57d85f4cd99b9
599.9 MB Download
md5:a07e4746f598d42a7e4e89208c831ea6
5.6 MB Download
md5:918909ec126bf25a5644ea26643cb7a4
24.4 kB Preview Download
md5:34b679714500155b2f552149c1cd2778
1.8 MB Preview Download
md5:06649fc2773d9321a18feb997aa39195
11.2 MB Download
md5:111c16d97cd250cacf3241bc9611e2c5
38.0 kB Preview Download
md5:82c334b3c092bd1d47c1f5abd4a038e3
72.5 MB Download
md5:9471c5c4480e681c7a4bf7ff8ed427d7
3.1 MB Preview Download
md5:c0625dd6d3d07872bf7760e2c6c48f04
1.0 GB Download
md5:11540f6cab5a1e4686e1af55b7427056
1.4 MB Download
md5:4181fa311f316c3b6c041537d1f8c291
11.7 kB Preview Download
md5:3a2fa8cd2a376cac8a58a198a08756c1
2.8 MB Preview Download

Additional details

Related works

Is published in
Preprint: 10.48550/arXiv.2408.14947 (DOI)
Is variant form of
Journal article: 10.3390/rs14092244 (DOI)

Funding

Australian Research Council
Industrial Transformation Training Centre Grant IC170100023

Software

Repository URL
https://github.com/WiseGamgee/HyperAD
Programming language
Python
Development Status
Active