Published July 30, 2025 | Version 1.6.0
Dataset Open

Identifying Astrophysical Anomalies in 99.6 Million Source Cutouts from the Hubble Legacy Archive Using AnomalyMatch

  • 1. ROR icon European Space Astronomy Centre

Description

This repository contains the data released in the paper "Identifying Anomalies in 99.6 Million Source Cutouts from the Hubble Legacy Archive Using AnomalyMatch" (DOI: To be Added on Publication ; ArXiv: 2505.03508).

We release the images and descriptions of every anomalous object found in this paper (1,176 unique objects across 18 different anomaly classifications). The main result of the paper, the catalogue of anomalous objects, is contained in DOR_PG+2025_anomalies.csv. This catalogue contains the SourceID, Right Ascension, Declination, classification and the "Status" of each object. The status shows whether an object has appeared in the astrophysical literature prior to this work. This main catalogue describes every object we found.

We also include the images that correspond to each catalogue. These are contained within the .zip files in the repository. The individual filenames are the SourceIDs of the corresponding objects in the main CSV file.

Please note: These images have been saved as 150x150 gray images using only the F814W filter of the HST. Different dimension images/multi-band photometry will have to be found by the user.

Please also note: Each sub-category of object has only been classified visuallyTherefore, for unreferenced objects, these should be considered candidate objects.

A list of the anomaly classifications we have found are below:

  1. agn - Zip of sources containing an Active Galactic Nuclei (8 sources).
  2. arc - Zip of sources containing a gravitational arc (cluster lensing; 39 sources).
  3. candidate strong lens - Zip of sources containing a gravitational lense (galaxy - galaxy lensing; 140 sources).
  4. clumpy - Zip containing images of a clumpy galaxy (11 sources).
  5. collisional ring - Zip containing a collisional ring galaxy (2 sources).
  6. edge-on protoplanetary disk - Zip of sources which contain edge on planetary disks (2 sources).
  7. globular cluster - Zip containing sources showing globular cluster (1 source).
  8. high-z - Zip of sources containing high redshift galaxies (28 sources).
  9. jellyfish - Zip of sources containing a jellyfish galaxy (37 sources).
  10. jet - Zip of sources containing a galaxy with a jet (13 sources).
  11. lensed quasar - Zip containing images of a (quadruply-) lensed quasar (5 sources).
  12. merger - Zip of sources containing an interacting or merging galaxy (629 sources)
  13. odd - Zip containing sources with odd morphology (43 sources).
  14. overlap - Zip of sources which contain two galaxies which overlap by projection, but are not interacting (39 sources).
  15. ring - Zip of sources which contain ring galaxies (12 sources).
  16. submillimetre - Zip of sources which contain submillimetre or dusty galaxies (1 source).
  17. supernova - Zip of sources which contain a supernova remnant (1 source).
  18. unknown - Zip of sources which could not be classified visually (43 sources).

Version History:

  1. Version 1: Pre-release of paper and catalogue on ArXiv.
  2. Version 1.5: Catalogue and paper submitted to A&A (Changes: revised strong lensing candidate catalogue based on feedback from community).
  3. Version 1.6: Updated catalogue and paper after referee comments (Changes: updated paper, and included unknown.zip file in release).

Files

DOR_PG+2025_anomalies.csv

Files (12.5 MB)

Name Size Download all
md5:10e7c18d03abee6adc749336371249f2
66.3 kB Preview Download
md5:0714e0e820faefc9fc0edf73f5b7cffe
383.4 kB Preview Download
md5:b1d72e874c4a08ff58a3da7060c310ab
1.5 MB Preview Download
md5:2d5a68a2d100a084e50777d867d51eb2
135.5 kB Preview Download
md5:132b28b018712f10617c7b21a84cfce6
21.7 kB Preview Download
md5:1e5e7a4812f59da8b0c21a42e0b9b6ab
59.6 kB Preview Download
md5:542c9e61261837599d9c4a89fd9483fa
17.3 kB Preview Download
md5:7afc4e6a8aca037135258c480ca8e8fe
6.2 kB Preview Download
md5:9f69fe92f3216e47a2d94b225dcea58d
430.8 kB Preview Download
md5:90838f675b99bf54e1218bcb7c89cafa
391.2 kB Preview Download
md5:7b33fd9d5cb9f82dd0fa3da5ad89aee2
102.3 kB Preview Download
md5:2aceecae9f34df5650d40005df872f91
36.0 kB Preview Download
md5:2101c8912b0bce2776e15286e71c44e1
6.6 MB Preview Download
md5:5352fd2e0e7d12cc93610d288b5aa82b
1.8 MB Preview Download
md5:4b380dda3bda8384aaf0f41a8b37a2bc
381.0 kB Preview Download
md5:7c185eac29a66d7d2b7f5d09a847e6dc
134.1 kB Preview Download
md5:0d86d866c0b1d81d2e48a3fe13eb1037
19.3 kB Preview Download
md5:47652e872c1f585b373070f94ff6159c
466.1 kB Preview Download

Additional details

Related works

Is described by
Publication: arXiv:2505.03508 (arXiv)

Software

Programming language
Python
Development Status
Active