Published July 1, 2024 | Version v1
Video/Audio Open

STFC Astronomy and Artificial Intelligence Case Studies

  • 1. ROR icon The Open University
  • 2. ROR icon Canterbury Christ Church University
  • 3. ROR icon Queen's University Belfast
  • 4. ROR icon University of Edinburgh
  • 5. ROR icon University of Toronto
  • 6. ROR icon University College London

Description

Context

As part of the Epistemic Insight Initiative of the LASAR (Learning about Science and Religion) research and outreach centre, an STFC Astronomy and Artificial Intelligence summer school with public engagement was held in July 2024. As part of this, an additional free Astronomy and AI Online Event day was held the week before on Wednesday 3rd July 2024 to provide panel discussions and example use cases of AI in astronomy, organised by Prof Berry Billingsley and Dr James Pearson. This event hosted two talks by Prof Berry Billingsley and Dr Marc Sarzi, as well as panels of astronomy PhD students and researchers with experience in using AI and deep learning, to discuss the broader questions about AI. The panellists for the event were Ingo Waldmann (University College London), Xinyue Sheng (Queen’s University Belfast), Benjamin Joachimi (University College London), Josh Wilde (The Open University), Weiguang Cui (University of Edinburgh), and Kevin Walsh (Westminster School). The event aimed to reach a wider diversity of students than the summer school: while STFC-funded and self-funded students took priority, any astronomy PhD students could apply, as well as final year undergraduate & master's students in physics and computer science.

Alongside this event, we release here a number of recorded case studies showcasing how researchers in astronomy are using AI and deep learning in their own work. These case studies cover a variety of research topics and types of AI, and were kindly provided by a number of researchers and PhD students - more details are given below. As well as their individual topics, many provide tips and explanations for AI, machine learning and deep learning techniques, so we encourage you to watch them all!

In addition to the individual case studies, recordings of the online event itself have been made available at the following page, as well as a short hightlight video covering some of the event and case studies: https://doi.org/10.5281/zenodo.12674686

Production Coordinator: Dr James Pearson (The Open University)
Video Editor: Dr James Pearson (The Open University)
Principal Investigator: Prof Berry Billingsley (Epistemic Insight Initiative)

Case Studies

Ruby Pearce-Casey (The Open University) - Using cGANs for Anomaly Detection: Hunting for Gravitational Lensing Systems in Euclid

Gravitational lensing is a powerful tool that directly probes all clustering components in the universe through their gravitational effect on light from distant background sources. The problem arises in finding gravitational lenses, and, with the accelerated growth in data volume and complexity in astronomy, machine-learning-aided lens searches have proven successful. We present a proof of concept for an alternative method of strong gravitational lens finding using a conditional Generative Adversarial Network (cGAN). We use Early Release Observation (ERO) images of the Perseus Cluster from Euclid, covering 0.57 sq. degrees on the sky, and the network is based on the pix2pix architecture with an adapted U-Net generator. We train our model to predict Euclid’s NISP-H band flux (1.54-2.00µm) from a combination of the filters NISP-J, NISP-Y and VIS band (0.55-1.54µm) in 40,000 cut-outs from the Perseus Cluster which are 20×20 arcseconds in size. We test the cGAN on 5,000 cut-outs from the Perseus cluster, 10% of which contain a simulated strong gravitational lens painted into the cut-out based on a Singular Isothermal Ellipsoid model. Candidate gravitational lenses and cut-outs with a gravitational lens painted in were deliberated excluded from the model’s training data set such that gravitational lensing systems remain unknown to the network. We find that the cGAN can accurately predict the NISP-H band flux of the cut-outs from the Perseus cluster. However, the model fails to predict the NISP-H band flux of the cut-outs containing the simulated gravitational lenses, with a larger difference between the model’s prediction and ground truth for lenses with extended arcs and Einstein rings, suggesting that the cGAN can be used as an anomaly detector for an alternative method of lens finding.

Xinyue Sheng (Queen’s University Belfast) - NEural Engine for Discovering Luminous Events (NEEDLE): identifying rare transient candidates in real time from host galaxy images

The work is to develop a machine learning classifier (NEEDLE) for identifying superluminous-supernovae and tidal disruption events at their early stage, using a few detections and the context information. It is currently applied on Lasair broker digesting ZTF alerts, and in the future, it will deal with LSST alerts.

Weiguang Cui (University of Edinburgh) - How AI helps with constraining galaxy cluster mass

Machine learning has been widely used in different astronomy researches recently. Some of its models, especially the image to property feature, make it the perfect tool for solving some particular problems in astronomy, such as estimating the cluster mass from observation images. In this showcase, I present how the different ML models are adopted to estimate the cluster mass in different dimensions — from a signal cluster mass M_500 (0d data point, see de Andres et al. 2022), to cluster mass within different radii (1D density profile, Ferragamo et al. 2023) and projected cluster mass map (2D image, de Andres et al. 2024). With this powerful tool, we are able to probe the mystery Universe, deeply, completely and swiftly!

Mike Walmsley (University of Toronto) - Computer vision for galaxy images

Modern telescopes take far more images than humans could ever look through, and so we rely on automated methods to measure what galaxies look like. AI is an excellent tool for solving this visual recognition problem. I’ll talk about how AI models have changed over the last few years to learn efficiently and quickly adapt to new measurement tasks.

Josh Wilde (The Open University) - Using machine learning to find gravitational lenses & how it can go wrong

I will discuss the uses of CNNs to identify gravitational lenses in simulated and real data, discussing common flaws that machine learning models suffer from and what can cause these issues. I will discuss the use of interpretability methods to identify what features within the image the machine learning techniques associate with gravitational lensing. I will explain how the need of interpretability in machine learning has led me to use U-nets to help locate gravitational lenses in images.

Benjamin Joachimi (University College London) - Forward-modelling the Universe with neural density estimation

I provide an overview of recent work constraining cosmological parameters from two of the largest galaxy imaging surveys to date, the Dark Energy Survey and the Kilo-Degree Survey. Both analyses use a novel approach to inference: machine-learning the likelihood using neural density estimators and large suites of large-scale structure simulations. I’ll include brief introductions to weak gravitational lensing, on which the analyses are based, and to the concepts of simulation-based/likelihood-free inference.

Ingo Waldmann (University College London) - Machine learning in exoplanet characterisation

The upcoming decade promises remarkable progress in our grasp of exoplanet formation, evolution, and an in-depth assessment of their climates and possible habitability. With the James Webb Space Telescope now in orbit, we’re at the threshold of high-precision atmospheric studies of these distant worlds. This milestone, combined with forthcoming endeavours like the ELTs and the Ariel space mission, positions us in an era rich in high-precision exoplanetary data. This vast influx of precise data holds immense discovery prospects but also introduces challenges in handling both model intricacy and data volume. As machine and deep learning techniques gain traction across various scientific and industrial domains, they also find their footing in exoplanetary and solar system research. Such techniques offer a refreshing edge in modelling intricate, non-linear data over conventional methods. In my talk’s initial segment, I’ll explore the use of simulation-based inference, deep surrogate models, and Explainable AI in refining time-series data and leveraging inverse modelling to extract exoplanet attributes from observations.

In the second part, I’ll delve into our machine-learning-centric data challenges. It’s noteworthy that many challenges in exoplanet data analysis resonate with issues keenly explored in the machine learning realm. Yet, interdisciplinary collaboration has often been stymied by language barriers and domain unfamiliarity. Addressing this, as part of the ESA Ariel Space mission, we have organised four machine learning challenges, hosted at prestigious AI conferences like NeurIPS and ECML, attracting hundreds of participating teams annually. I will discuss how such challenges can catalyse rapid, sturdy AI advancements in this budding domain and pave the way for more resilient AI solutions in the future.

Files

ruby_pearce-casey_cGANs_anomaly_detection_euclid_lenses.mp4