Published August 1, 2024 | Version v1
Model Open

The machine-learning flare identification models for the article "Scalable, Advanced Machine Learning-based Approaches for Stellar Flare Identification: Application to TESS short-cadence Data and Analysis of a New Flare Catalogue"

  • 1. ROR icon National Central University
  • 2. ROR icon University of Arizona
  • 1. National Central University
  • 2. ROR icon University of Arizona
  • 3. ROR icon National Solar Observatory

Description

This repository contains the machine learning methods for our multi-algorithm approach of flare identification in light curve data.

Models

We trained three models using three different algorithms:

  1. Deep Neural Network (DNN)
  2. Random Forest (RF)
  3. XGBoost

These models are designed to identify flares in TESS short-cadence light curve data but can theoretically be applied to light curve data observed at different cadences. The models are user-friendly and can run on standard machines. You can find them in the "ML_models" directory of this repository:

DNNClassifier_flare_classification-by-Lin.keras

RandomForestClassifier_flare_classification-by-Lin.pkl

XGBoostClassifier_flare_classification-by-Lin.json

 

Tutorial

A comprehensive tutorial on how to effectively use these models is provided in "Tutorial.ipynb". This tutorial will guide you step-by-step on:

  • Collecting flare candidates: How to gather flare candidates from the TESS light curve data.
  • Feature extraction: How to determine the features of these flare candidates.
  • Identifying true flares: How to use our machine learning models to identify "True Flares" among these candidates.

Installation and Dependencies

To run the models and tutorial, ensure you have all standard/wide-used Python packages (i.e., numpy, scipy, matplotlib, etc.) and the following dependencies installed:

  • Python 3.x
  • TensorFlow or PyTorch (for DNN)
  • Scikit-learn (for Random Forest)
  • XGBoost

You can install these dependencies using the following command:

pip install tensorflow scikit-learn xgboost

Usage

To learn how to use the models, follow these steps:

  1. Clone the repository:
    git clone https://github.com/CLL-Lin/MLsFlares.git
    cd MLsFlares
  2. Follow the instructions in "Tutorial.ipynb" to start.

Citation

If you find our models useful, please cite the article "Scalable, Advanced Machine Learning-based Approaches for Stellar Flare Identification: Application to TESS short-cadence Data and Analysis of a New Flare Catalogue".

Note: This article is still under review. We will update the DOI and the announcement when the paper is published in Atronomical Journal.

Files

Tutorial.ipynb

Files (25.6 MB)

Name Size Download all
md5:90c4f0110114ec37ff7c12fa5ac395e7
2.4 MB Download
md5:51312ff1fe498c5f240913bd22231e72
19.9 MB Download
md5:d2d9237f26cb0f54e8f4202007b27d58
1.5 MB Preview Download
md5:7e3c367d8fed11be2593904adafd2e11
1.8 MB Preview Download

Additional details

Funding

National Science and Technology Council
Overseas Project for Graduate Research 112-2917-I-008-001
National Aeronautics and Space Administration
Alien Earths 80NSSC21K0593

Dates

Submitted
2024-08-02

Software

Repository URL
https://github.com/CLL-Lin/MLsFlares.git
Programming language
Python, Jupyter Notebook
Development Status
Active