Published August 16, 2023 | Version v1.0
Dataset Open

HawaiiCoast_GT: Curated AIS for Hawaii's coast correlated with ground truth incidents

Authors/Creators

  • 1. Sandia National Laboratories

Description

Because of the high-risk nature of emergencies and illegal activities at sea, it is critical that algorithms designed to detect anomalies from maritime traffic data be robust. However, there exist no publicly available maritime traffic datasets with real-world labelled anomalies. As a result, most anomaly detection algorithms for maritime traffic are validated without ground truth. We introduce the HawaiiCoast_GT dataset, the first ever publicly available automatic identification system dataset with a large corresponding set of true anomalous incidents. This dataset—cleaned and curated from Bureau of Ocean Energy Management (BOEM) and National Oceanic and Atmospheric Administration (NOAA) automatic identification system (AIS) data--covers Hawaii’s coastal waters for four years (2017-2020) and contains 88,749,176 AIS points for a total of 2,622 unique vessels. 208 tracks are labelled corresponding to 154 labelled real-world incidents. The codebase used to curate the original AIS data is being made openly available on GitHub.

Notes

Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-NA0003525. The views expressed in the article do not necessarily represent the view of the U.S. DOE or the United States Government.

Files

HawaiiCoast_GT.zip

Files (3.0 GB)

Name Size Download all
md5:e7dd0951d1512357af7ca50cad710ebe
3.0 GB Preview Download

Additional details

Related works

Is described by
Journal article: 10.1007/s44289-023-00001-6 (DOI)

Software

Repository URL
https://github.com/sandialabs/HawaiiCoast_GT_Code_Generation
Programming language
Python
Development Status
Active

References