Published November 6, 2025 | Version v1
Dataset Open

AstroML-Datasets: Introductory Machine Learning Tasks Across Physics and Astronomy

Authors/Creators

  • 1. ROR icon Clemson University

Description

AstroML-Dataset is an open educational initiative designed to enable introductory-level machine learning tasks across physics and astronomy domains.
It provides curated, ML-ready datasets that enable learners and researchers to explore real astronomical data through accessible problems, such as galaxy redshift predictionphotometric classification, and source separation, using data from surveys like SDSS and HSC.

Each dataset includes clearly defined input features and target variables, supporting both teaching and research use in astroinformatics, data science, and machine learning.

For detailed documentation, data structure, and example notebooks, please visit the project repository:
🔗 https://github.com/srinadh99/AstroML-Datasets/tree/main

Files

PhotoZ_SDSS.csv

Files (148.4 MB)

Name Size Download all
md5:6d9e6c2e3d027dc609d34e34af42acc1
148.4 MB Preview Download

Additional details

Dates

Collected
2025-11-06