SuryaBench is a standardized test benchmark for evaluating the performance of foundational AI models in Heliophysics and Space Weather Science across various tasks.
The complete dataset along with input files and application-specific instances is available through the Hugging Face platform at nasa-impact/SuryaBench.
SuryaBench contains application datasets related to six key tasks that are structured as prediction, segmentation, or reconstruction. These tasks are related to various aspects of solar and heliospheric events ecosystem and are listed below:
- Active Region Segmentation
- Active Region Emergence Prediction
- Coronal Field Extrapolation
- Solar Flare Forecasting
- Solar Wind Forecasting
- Solar EUV Spectra Modeling
This collection includes segmentation maps of active regions with magnetic polarity inversion lines (also referred to as ARPILs). The ARPIL data generation is applied on line of sight component of the solar magnetic field.
The code is designed to process magnetogram data efficiently in C++. Various libraries including OpenCV, CUDA for efficient image processing, and CFITS10 and HDF5 for data I/O. Refer to the application's README for more details.
Building early forecasts of the manifestation of solar activity, specifically active regions is crucial to mitigate the impact of space weather disturbances. For this application, 50 active region emergence instances that appear on the solar surface within 30 deg of central meridian are selected. The selected active region instances are persisted for more than four days and include helioseismic and magnetic information (magnetic flux, Doppler velocity, and continuum intensity). Refer to the application's README for more details.
The solar corona is a highly magnetic environment, characterized by complex and time-varying magnetic fields. A common task for scientists studying the Sun is to model the 3D magnetic field. This application aims to evaluate the effectiveness of AI foundation models to emulate the physics-based ADAPT-WSA PFSS model. The parameters to predict are spherical harmonic coefficients which represent the magnetic potential for a domain between the photosphere and te source surface (set to 2.51 Rs). Refer to the application's README for more details.
This application focuses on a key space weather prediction task -- solar flare forecasting. The provided script generates labeled solar flare time-series data using a rolling window approach. The output includes binary labels based on both the maximum flare class and cumulative flare intensity within a given time window. Refer to the application's README for more details.
The solar wind is a stream of particles that emanates from the Sun. The interaction of solar wind with Earth's magnetic field results in the formation of near-Earth space weather. Solar wind interactions are well-known to drive geomagnetic storms, wherein the Earth's magnetic field is perturbed, causing induction of currents into systems including satellites, power grids, oil pipelines, leading to potential socio-economic impacts. This application contains generation code for solar wind forecasting labels, specifically scripts for downloading the solar wind data from OMNI and provide splits. Refer to the application's README for more details.
The solar Extreme Ultraviolet (EUV) spectral irradiance is one of the most influential radiative component emitted from the chromosphere and corona, with wavelengths ranging from approximately 5 to 120 nm. These emissions play an important role in controlling the thermodynamic state of the Earth's upper atmosphere, governing processes such as photoionization, photodissociation, and heating in the thermosphere and ionosphere. Fluctuations in EUV irradiance, whether driven by the 11-year solar cycle, 27-day solar rotation, or impulsive flare events, introduce significant variability in space weather, influencing satellite drag, GPS signal accuracy, and communications infrastructure. This application contains generation code for high-energy EUV spectra from SDO/EVE (MEGS-A) with a 10s cadence. Refer to the application's README for more details.
BibTeX:
@misc{suryabench,
title={SuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction},
author={ Sujit Roy and
Dinesha Vasanta Hegde and
Johannes Schmude and
Amy Lin and
Vishal Gaur and
Talwinder Singh and
Rohit Lal and
Andrés Muñoz-Jaramillo and
Kang Yang and
Chetraj Pandey and
Jinsu Hong and
Berkay Aydin and
Ryan McGranaghan and
Spiridon Kasapis and
Vishal Upendran and
Shah Bahauddin and
Daniel da Silva and
Marcus Freitag and
Nikolai Pogorelov and
Manil Maskey and
Rahul Ramachandran},
year={2025}
}
- Sujit Roy*
- Dinesha Vasanta Hegde*
- Johannes Schmude
- Amy Lin
- Vishal Gaur
- Talwinder Singh
- Rohit Lal
- Andrés Muñoz-Jaramillo
- Kang Yang
- Chetraj Pandey
- Jinsu Hong
- Berkay Aydin
- Ryan McGranaghan
- Spiridon Kasapis
- Vishal Upendran
- Shah Bahauddin
- Daniel da Silva
- Marcus Freitag
- Nikolai Pogorelov
- Manil Maskey
- Rahul Ramachandran
For each task, please contact: