We present a long short-term memory (LSTM) network for predicting whether an active region (AR) would produce a -class flare within the next 24 hr. We consider three classes, namely ≥M5.0 class, ≥M ...class, and ≥C class, and build three LSTM models separately, each corresponding to a class. Each LSTM model is used to make predictions of its corresponding -class flares. The essence of our approach is to model data samples in an AR as time series and use LSTMs to capture temporal information of the data samples. Each data sample has 40 features including 25 magnetic parameters obtained from the Space-weather HMI Active Region Patches and related data products as well as 15 flare history parameters. We survey the flare events that occurred from 2010 May to 2018 May, using the Geostationary Operational Environmental Satellite X-ray flare catalogs provided by the National Centers for Environmental Information (NCEI), and select flares with identified ARs in the NCEI flare catalogs. These flare events are used to build the labels (positive versus negative) of the data samples. Experimental results show that (i) using only 14-22 most important features including both flare history and magnetic parameters can achieve better performance than using all 40 features together; (ii) our LSTM network outperforms related machine-learning methods in predicting the labels of the data samples. To our knowledge, this is the first time that LSTMs have been used for solar-flare prediction.
Adverse space-weather effects can often be traced to solar flares, the prediction of which has drawn significant research interests. The Helioseismic and Magnetic Imager (HMI) produces full-disk ...vector magnetograms with continuous high cadence, while flare prediction efforts utilizing this unprecedented data source are still limited. Here we report results of flare prediction using physical parameters provided by the Space-weather HMI Active Region Patches (SHARP) and related data products. We survey X-ray flares that occurred from 2010 May to 2016 December and categorize their source regions into four classes (B, C, M, and X) according to the maximum GOES magnitude of flares they generated. We then retrieve SHARP-related parameters for each selected region at the beginning of its flare date to build a database. Finally, we train a machine-learning algorithm, called random forest (RF), to predict the occurrence of a certain class of flares in a given active region within 24 hr, evaluate the classifier performance using the 10-fold cross-validation scheme, and characterize the results using standard performance metrics. Compared to previous works, our experiments indicate that using the HMI parameters and RF is a valid method for flare forecasting with fairly reasonable prediction performance. To our knowledge, this is the first time that RF has been used to make multiclass predictions of solar flares. We also find that the total unsigned quantities of vertical current, current helicity, and flux near the polarity inversion line are among the most important parameters for classifying flaring regions into different classes.
Abstract
Solar flares are explosions on the Sun. They happen when energy stored in magnetic fields around solar active regions (ARs) is suddenly released. Solar flares and accompanied coronal mass ...ejections are sources of space weather, which negatively affects a variety of technologies at or near Earth, ranging from blocking high-frequency radio waves used for radio communication to degrading power grid operations. Monitoring and providing early and accurate prediction of solar flares is therefore crucial for preparedness and disaster risk management. In this article, we present a transformer-based framework, named SolarFlareNet, for predicting whether an AR would produce a
$$\gamma$$
γ
-class flare within the next 24 to 72 h. We consider three
$$\gamma$$
γ
classes, namely the
$$\ge$$
≥
M5.0 class, the
$$\ge$$
≥
M class and the
$$\ge$$
≥
C class, and build three transformers separately, each corresponding to a
$$\gamma$$
γ
class. Each transformer is used to make predictions of its corresponding
$$\gamma$$
γ
-class flares. The crux of our approach is to model data samples in an AR as time series and to use transformers to capture the temporal dynamics of the data samples. Each data sample consists of magnetic parameters taken from Space-weather HMI Active Region Patches (SHARP) and related data products. We survey flare events that occurred from May 2010 to December 2022 using the Geostationary Operational Environmental Satellite X-ray flare catalogs provided by the National Centers for Environmental Information (NCEI), and build a database of flares with identified ARs in the NCEI flare catalogs. This flare database is used to construct labels of the data samples suitable for machine learning. We further extend the deterministic approach to a calibration-based probabilistic forecasting method. The SolarFlareNet system is fully operational and is capable of making near real-time predictions of solar flares on the Web.
Abstract
Solar energetic particles (SEPs) are an essential source of space radiation, and are hazardous for humans in space, spacecraft, and technology in general. In this paper, we propose a ...deep-learning method, specifically a bidirectional long short-term memory (biLSTM) network, to predict if an active region (AR) would produce an SEP event given that (i) the AR will produce an M- or X-class flare and a coronal mass ejection (CME) associated with the flare, or (ii) the AR will produce an M- or X-class flare regardless of whether or not the flare is associated with a CME. The data samples used in this study are collected from the Geostationary Operational Environmental Satellite's X-ray flare catalogs provided by the National Centers for Environmental Information. We select M- and X-class flares with identified ARs in the catalogs for the period between 2010 and 2021, and find the associations of flares, CMEs, and SEPs in the Space Weather Database of Notifications, Knowledge, Information during the same period. Each data sample contains physical parameters collected from the Helioseismic and Magnetic Imager on board the Solar Dynamics Observatory. Experimental results based on different performance metrics demonstrate that the proposed biLSTM network is better than related machine-learning algorithms for the two SEP prediction tasks studied here. We also discuss extensions of our approach for probabilistic forecasting and calibration with empirical evaluation.
We propose a new machine-learning approach to Stokes inversion based on a convolutional neural network (CNN) and the Milne-Eddington (ME) method. The Stokes measurements used in this study were taken ...by the Near InfraRed Imaging Spectropolarimeter (NIRIS) on the 1.6 m Goode Solar Telescope (GST) at the Big Bear Solar Observatory. By learning the latent patterns in the training data prepared by the physics-based ME tool, the proposed CNN method is able to infer vector magnetic fields from the Stokes profiles of GST/NIRIS. Experimental results show that our CNN method produces smoother and cleaner magnetic maps than the widely used ME method. Furthermore, the CNN method is four to six times faster than the ME method and able to produce vector magnetic fields in nearly real time, which is essential to space weather forecasting. Specifically, it takes ∼50 s for the CNN method to process an image of 720 × 720 pixels comprising Stokes profiles of GST/NIRIS. Finally, the CNN-inferred results are highly correlated to the ME-calculated results and closer to the ME's results with the Pearson product-moment correlation coefficient (PPMCC) being closer to 1, on average, than those from other machine-learning algorithms, such as multiple support vector regression and multilayer perceptrons (MLP). In particular, the CNN method outperforms the current best machine-learning method (MLP) by 2.6%, on average, in PPMCC according to our experimental study. Thus, the proposed physics-assisted deep learning-based CNN tool can be considered as an alternative, efficient method for Stokes inversion for high-resolution polarimetric observations obtained by GST/NIRIS.
Abstract
Interplanetary magnetic flux ropes (MFRs) are commonly observed structures in the solar wind, categorized as magnetic clouds (MCs) and small-scale MFRs (SMFRs) depending on whether they are ...associated with coronal mass ejections. We apply machine learning to systematically compare SMFRs, MCs, and ambient solar wind plasma properties. We construct a data set of 3-minute averaged sequential data points of the solar wind’s instantaneous bulk fluid plasma properties using about 20 years of measurements from Wind. We label samples by the presence and type of MFRs containing them using a catalog based on Grad–Shafranov (GS) automated detection for SMFRs and NASA's catalog for MCs (with samples in neither labeled non-MFRs). We apply the random forest machine learning algorithm to find which categories can be more easily distinguished and by what features. MCs were distinguished from non-MFRs with an area under the receiver-operator curve (AUC) of 94% and SMFRs with an AUC of 89%, and had distinctive plasma properties. In contrast, while SMFRs were distinguished from non-MFRs with an AUC of 86%, this appears to rely solely on the 〈
B
〉 > 5 nT threshold applied by the GS catalog. The results indicate that SMFRs have virtually the same plasma properties as the ambient solar wind, unlike the distinct plasma regimes of MCs. We interpret our findings as additional evidence that most SMFRs at 1 au are generated within the solar wind. We also suggest that they should be considered a salient feature of the solar wind’s magnetic structure rather than transient events.
Transfer learning (TL) algorithms aim to improve the prediction performance in a target task (e.g. the prediction of cisplatin sensitivity in triple-negative breast cancer patients) via transferring ...knowledge from auxiliary data of a related task (e.g. the prediction of docetaxel sensitivity in breast cancer patients), where the distribution and even the feature space of the data pertaining to the tasks can be different. In real-world applications, we sometimes have a limited training set in a target task while we have auxiliary data from a related task. To obtain a better prediction performance in the target task, supervised learning requires a sufficiently large training set in the target task to perform well in predicting future test examples of the target task. In this paper, we propose a TL approach for cancer drug sensitivity prediction, where our approach combines three techniques. First, we shift the representation of a subset of examples from auxiliary data of a related task to a representation closer to a target training set of a target task. Second, we align the shifted representation of the selected examples of the auxiliary data to the target training set to obtain examples with representation aligned to the target training set. Third, we train machine learning algorithms using both the target training set and the aligned examples. We evaluate the performance of our approach against baseline approaches using the Area Under the receiver operating characteristic (ROC) Curve (AUC) on real clinical trial datasets pertaining to multiple myeloma, nonsmall cell lung cancer, triple-negative breast cancer, and breast cancer. Experimental results show that our approach is better than the baseline approaches in terms of performance and statistical significance.
Abstract Small-scale interplanetary magnetic flux ropes (SMFRs) are similar to ICMEs in magnetic structure, but are smaller and do not exhibit coronal mass ejection plasma signatures. We present a ...computationally efficient and GPU-powered version of the single-spacecraft automated SMFR detection algorithm based on the Grad–Shafranov (GS) technique. Our algorithm can process higher resolution data, eliminates selection bias caused by a fixed 〈 B 〉 threshold, has improved detection criteria demonstrated to have better results on an MHD simulation, and recovers full 2.5D cross sections using GS reconstruction. We used it to detect 512,152 SMFRs from 27 yr (1996–2022) of 3 s cadence Wind measurements. Our novel findings are the following: (1) the SMFR filling factor (∼ 35%) is independent of solar activity, distance to the heliospheric current sheet, and solar wind plasma type, although the minority of SMFRs with diameters greater than ∼0.01 au have a strong solar activity dependence; (2) SMFR diameters follow a log-normal distribution that peaks below the resolved range (≳10 4 km), although the filling factor is dominated by SMFRs between 10 5 and 10 6 km; (3) most SMFRs at 1 au have strong field-aligned flows like those from Parker Solar Probe measurements; (4) the radial density (generally ∼1 detected per 10 6 km) and axial magnetic flux density of SMFRs are higher in faster solar wind types, suggesting that they are more compressed. Implications for the origin of SMFRs and switchbacks are briefly discussed. The new algorithm and SMFR dataset are made freely available.
Abstract
We present a new deep-learning method, named FibrilNet, for tracing chromospheric fibrils in H
α
images of solar observations. Our method consists of a data preprocessing component that ...prepares training data from a threshold-based tool, a deep-learning model implemented as a Bayesian convolutional neural network for probabilistic image segmentation with uncertainty quantification to predict fibrils, and a post-processing component containing a fibril-fitting algorithm to determine fibril orientations. The FibrilNet tool is applied to high-resolution H
α
images from an active region (AR 12665) collected by the 1.6 m Goode Solar Telescope (GST) equipped with high-order adaptive optics at the Big Bear Solar Observatory (BBSO). We quantitatively assess the FibrilNet tool, comparing its image segmentation algorithm and fibril-fitting algorithm with those employed by the threshold-based tool. Our experimental results and major findings are summarized as follows. First, the image segmentation results (i.e., the detected fibrils) of the two tools are quite similar, demonstrating the good learning capability of FibrilNet. Second, FibrilNet finds more accurate and smoother fibril orientation angles than the threshold-based tool. Third, FibrilNet is faster than the threshold-based tool and the uncertainty maps produced by FibrilNet not only provide a quantitative way to measure the confidence on each detected fibril, but also help identify fibril structures that are not detected by the threshold-based tool but are inferred through machine learning. Finally, we apply FibrilNet to full-disk H
α
images from other solar observatories and additional high-resolution H
α
images collected by BBSO/GST, demonstrating the tool’s usability in diverse data sets.
Predicting the response, or sensitivity, of a clinical drug to a specific cancer type is an important research problem. By predicting the clinical drug response correctly, clinicians are able to ...understand patient-to-patient differences in drug sensitivity outcomes, which in turn results in lesser time spent and lower cost associated with identifying effective drug candidates. Although technological advances in high-throughput drug screening in cells led to the generation of a substantial amount of relevant data, the analysis of such data would be a challenging task. There is a critical need for advanced machine learning (ML) algorithms to generate accurate predictions of clinical drug response. A major goal of this work is to provide advanced ML tools to data analysts, who would in turn build prediction calculators to be incorporated into intelligent clinical decision support systems. Such innovative tools could be used to enhance patient-care, among other uses. To achieve this goal, we develop new ML techniques, including a transfer learning approach coupled with or without a boosting technique. Experimental results on real clinical data pertaining to breast cancer, multiple myeloma, and triple-negative cancer patients demonstrate the effectiveness and superiority of the proposed approaches compared to baseline approaches, including existing transfer learning methods.
•We propose two transfer learning approaches for predicting clinical drug response.•We compare the proposed approaches against baseline approaches including existing transfer learning methods.•We present experimental results for several prediction tasks of clinical drug response.•Our results demonstrate the effectiveness and superiority of the proposed approaches over the baselines.