Published July 28, 2025 | Version 1.0.0
Dataset Open

TransferZ: a photometric dataset for machine learning

  • 1. ROR icon University of California, Los Angeles
  • 2. ROR icon Southern Oregon University
  • 3. ROR icon University of Richmond

Description

Overview

TransferZ is a machine learning ready dataset containing Hyper-Suprime Cam (HSC) PDR2 grizy photometry and COSMOS2020 derived photometric redshifts for 116,335 galaxies in the 2 sq. deg. COSMOS field. The dataset is associated with the paper "Improving Generalization and Uncertainty Quantification of Photometric Redshift Models" by Soriano et al. (2025). It is designed for machine learning applications in astrophysics, particularly for redshift estimation. Soriano et al. (2025) used TransferZ complimentary to GalaxiesML (doi) to test methods of generalizing redshift models. We provide the same train test splits used in the paper. In addition, we provide a "conformal" set which was used in the application of conformal prediction. 

Features

  • Photometry for 116,335 galaxies in five photometric bands (g,r,i,z,y)
  • Photometric redshifts for each galaxy derived from 35-band photometry (see Weaver+22). Redshifts range from 0.01 to 4

Citations

If you make use of any of these products, please cite this repository and Soriano et al. (2025). In addition, for COSMOS2020 data products, please cite Weaver et al. (2022); for HSC PDR2 data, please cite Aihara et al. (2019); for GalaxiesML data products, please cite Do et al (2024).

References:

  1. Aihara et al. (2019). PASJ, 71, 6. [arXiv:1905.12221][doi]
  2. Weaver et al. (2022).ApJS, 258, 11. [arXiv:2110.13923][doi]
  3. Do et al. (2024). [arXiv:2410.00271]
  4. Soriano et al. (2025). ApJ, submitted. 

Files

README.txt

Files (132.0 MB)

Name Size Download all
md5:58cef8c4dd850490994ac79c7ddc481d
7.3 MB Preview Download
md5:09e4c2ef0c5f6b4dfb0cd78c268998f7
7.3 MB Preview Download
md5:66f520da93e3667e35da6b5eedd3f7b2
51.2 MB Preview Download
md5:956c267232f32a810b4f18d2abe3d6e1
7.3 MB Preview Download
md5:da648e5ce4b69101f68f4856522066ae
3.7 MB Preview Download
md5:230f63086435b1a55f525842a1e4e86c
3.7 MB Preview Download
md5:5996eb4a0a88778614b3de3805f1eefd
26.2 MB Preview Download
md5:1b340404efa2a31d8ffa26da724bcc87
3.7 MB Preview Download
md5:6e81fe9554a31e3bddc73b67a0d2247c
4.8 kB Preview Download
md5:f1b7fd296eb9db87c8b92e59c4bfaccb
2.1 MB Preview Download
md5:c4690abaae2aabf3abf3cd249c85e304
2.1 MB Preview Download
md5:a2c006c9475b012b72c6ad30674808ff
15.0 MB Preview Download
md5:345a2a4237dde3f40459504a2f3a706f
2.1 MB Preview Download

Additional details

Related works

Cites
Journal article: 10.1093/pasj/psz103 (DOI)
Preprint: 10.48550/ARXIV.2410.00271 (DOI)
Dataset: 10.5281/ZENODO.11117527 (DOI)
Journal article: 10.3847/1538-4365/ac3078 (DOI)

References

  • Aihara, H., AlSayyad, Y., Ando, M., Armstrong, R., Bosch, J., Egami, E., Furusawa, H., Furusawa, J., Goulding, A., Harikane, Y., Hikage, C., Ho, P. T. P., Hsieh, B.-C., Huang, S., Ikeda, H., Imanishi, M., Ito, K., Iwata, I., Jaelani, A. T., … Yamada, Y. (2019). Second data release of the Hyper Suprime-Cam Subaru Strategic Program. Publications of the Astronomical Society of Japan, 71(6). https://doi.org/10.1093/pasj/psz103
  • Do, T., Boscoe, B., Jones, E., Li, Y. Q., & Alfaro, K. (2024). GalaxiesML: a dataset of galaxy images, photometry, redshifts, and structural parameters for machine learning (Version 1). arXiv. https://doi.org/10.48550/ARXIV.2410.00271
  • Do, T., Jones, E., Boscoe, B., Li, Y. Q., & Alfaro, K. (2024). GalaxiesML: an imaging and photometric dataset of galaxies for machine learning (Version v6.1) [Dataset]. Zenodo. https://doi.org/10.5281/ZENODO.11117527
  • Weaver, J. R., Kauffmann, O. B., Ilbert, O., McCracken, H. J., Moneti, A., Toft, S., Brammer, G., Shuntov, M., Davidzon, I., Hsieh, B. C., Laigle, C., Anastasiou, A., Jespersen, C. K., Vinther, J., Capak, P., Casey, C. M., McPartland, C. J. R., Milvang-Jensen, B., Mobasher, B., … Zamorani, G. (2022). COSMOS2020: A Panchromatic View of the Universe to z ∼ 10 from Two Complementary Catalogs. The Astrophysical Journal Supplement Series, 258(1), 11. https://doi.org/10.3847/1538-4365/ac3078