Published November 29, 2023 | Version v1
Output management plan Open

Selection of powerful radio galaxies with machine learning

  • 1. ROR icon Institute of Astrophysics and Space Sciences
  • 2. ROR icon University of Lisbon
  • 3. ROR icon Western Sydney University
  • 4. ROR icon Commonwealth Scientific and Industrial Research Organisation
  • 5. ROR icon European Southern Observatory
  • 6. ROR icon University of Porto
  • 7. ROR icon Atacama Large Millimeter Submillimeter Array
  • 8. ROR icon Instituto de Radioastronomía Milimétrica
  • 9. ROR icon Closer Consultoria (Portugal)

Description

Supplementary material to the article "Selection of powerful radio galaxies with machine learning" from Carvajal et al., 2023. Preprint version can be obtained from https://arxiv.org/abs/2309.11652

Included files are:

  • predicted_rAGN_HETDEX.parquet: Dataset from HETDEX Spring field. It includes initial properties from sources as well as predicted values. Description of columns in Appendix G of Carvajal et al., 2023.
  • predicted_rAGN_S82.parquet: Dataset from Stripe 82 field. It includes initial properties from sources as well as predicted values. Description of columns in Appendix G of Carvajal et al., 2023.
  • classification_AGN_galaxy.pkl: Model for classification between AGN and galaxies. It takes as input features described in article. It delivers uncalibrated scores.
  • classification_radio_detection.pkl: Model for classification between radio detectable and radio non-detectable AGN. It takes as input features described in article. It delivers uncalibrated scores.
  • regression_redshift.pkl: Model for prediction of redshift for radio-detectable AGN. It takes as input features described in article.
  • cal_classification_AGN_galaxy.joblib: Calibrated model for classification between AGN and galaxies. It takes as input scores from uncalibrated model. It delivers calibrated probabilities.
  • cal_classification_radio_detection.joblib: Calibrated model for classification between radio detectable and radio non-detectable AGN. It takes as input scores from uncalibrated model. It delivers calibrated probabilities.
  • datasets_description.txt: Description of columns in parquet files.

Files with parquet extension were generated with python using the pandas package (v.1.4.2) and the engine fastparquet.

Files with pkl extension were generated with python using pycaret (v.2.3.10).

Files with joblib extension were generated with python using scikit-learn (v.0.23.2).

 

An example on how to use these files can be found in https://github.com/racarvajal/ML_prediction_pipeline_run

 

Files

datasets_description.txt

Files (1.7 GB)

Name Size Download all
md5:abc9ec16f1e370ed684273dfa3769262
1.0 kB Download
md5:cc85b8a4f51c9d789bcf1b139fec14e5
1.0 kB Download
md5:53be399fd4581c97fd27aba7e6387c9b
178.6 MB Download
md5:c6cb0330aec8c70d6298a4fd77946a16
182.5 MB Download
md5:cc8493df2796894534e9a1dfd002df44
2.9 kB Preview Download
md5:0715310d28baaf513d85fb9c9575fa69
963.1 MB Download
md5:c229bae3e425c9d8cbf918096667da25
247.1 MB Download
md5:95faa0df282eadeab764ece0291acbd4
87.1 MB Download

Additional details

Related works

Is supplement to
Journal article: 10.1051/0004-6361/202245770 (DOI)

Funding

The first Radio Galaxies in the Universe PD/BD/150455/2019
Fundação para a Ciência e Tecnologia
IdEaS with ALMA - Identifying the Earliest Supermassive Black Holes with ALMA PTDC/FIS-AST/29245/2017
Fundação para a Ciência e Tecnologia
Encontrando emissores de Lyman-alpha através de aprendizagem máquina EXPL/FIS-AST/1085/2021
Fundação para a Ciência e Tecnologia
IA 2019 - Financiamento a Unidades de I&D 2019: Instituto de Astrofísica e Ciências do Espaço UID/FIS/04434/2019
Fundação para a Ciência e Tecnologia
IA base 2020+ - Financiamento Plurianual de Unidade de I&D 2020-2023 - Financiamento Base UIDB/04434/2020
Fundação para a Ciência e Tecnologia
IA base 2020+ - Financiamento Plurianual de Unidade de I&D 2020-2023 - Financiamento Base UIDP/04434/2020
Fundação para a Ciência e Tecnologia