Published May 18, 2023 | Version v1
Dataset Open

Active region magnetograms for solar flare prediction: Reduced resolution dataset

  • 1. New Mexico State University

Description

In this dataset, we provide a comprehensive collection of magnetograms from the National Aeronautics and Space Administration's (NASA's) Solar Dynamics Observatory (SDO).  The dataset incorporates data from three sources and provides SDO Helioseismic and Magnetic Imager (HMI) magnetograms of solar active regions as well as labels of corresponding flaring activity.  This dataset will be useful for image analysis or solar physics research related to magnetic structure, its evolution over time, and its relation to solar flares.  The dataset will be of interest to those researchers investigating automated solar flare prediction methods, including supervised and unsupervised machine learning (classical and deep), binary and multi-class classification, and regression.  This dataset is a minimally processed, user configurable dataset of consistently sized images of solar active regions that can serve as a benchmark dataset for solar flare prediction research.  This dataset consists of reduced resolution images (see usage notes below).

Notes

Image data are provided in supplementary files available on zenodo (see link under Related works) as .png files which can be opened with any common image manipulation software.  All other files included here are text files that can be opened with any standard text manipulation software.  We do note, however, that many text files are very large (~1M lines), and may take a while to load. 

This is one of three datasets related to the same study:

Reduced resolution dataset (this dataset): Reduced resolution images (950,047 images, each of which is 224x224 pixels and 8-bit depth resolution), https://doi.org/10.5061/dryad.jq2bvq898.

Full resolution dataset: Full resolution images (950,047 images, each of which is 600x600 pixels and 17-bit depth resolution), https://doi.org/10.5061/dryad.dv41ns23n.

Extra images dataset: Images that were excluded from the main analyses in the first and second datasets (421,957 images that were excluded for latitude, longitude, and/or NaN pixels), https://doi.org/10.5061/dryad.qjq2bvqmj.  Researchers wishing to work with the entire dataset (all 1,357,004 images) must combine the files from the full resolution preconfigured dataset (https://doi.org/10.5061/dryad.dv41ns23n) and the extra images dataset (https://doi.org/10.5061/dryad.qjq2bvqmj) by moving/copying the subdirectories to a common base directory, e.g., active_regions/.

Files

C1.0_24hr_224_png_Labels.txt

Files (584.5 MB)

Name Size Download all
md5:c07787bd9def72d0519a49c02cb86f7c
57.6 MB Preview Download
md5:95963729b452c90e3d0c3c1ec8822ba4
465.1 MB Preview Download
md5:6f73d64b63429ee26876a156a734fa12
785 Bytes Preview Download
md5:0b8b0b42eba839007bd40f163fc94455
6.3 kB Preview Download
md5:fa192a397d0b582d372270af9db26393
785 Bytes Preview Download
md5:ec8fa56a306a2e70862332dd94efb849
10.4 kB Preview Download
md5:d6f0abaca77628def3462366e0937639
6.2 MB Preview Download
md5:58442b71ebc3741a19cba18c75061017
49.4 MB Preview Download
md5:2f4ec494748861ed110edf23b8bdf636
6.2 MB Preview Download

Additional details

Related works