We introduce a hybrid approach to solar flare prediction, whereby a supervised regularization method is used to realize feature importance and an unsupervised clustering method is used to realize the binary flare/no-flare decision. The approach is validated against NOAA SWPC data.
All three components of the current density are required to compute the heating rate due to free magnetic energy dissipation. Here we present a first test of a new model developed to determine if the times of increases in the resistive heating rate i
n active region (AR) photospheres are correlated with the subsequent occurrence of M and X flares in the corona. A data driven, 3 D, non-force-free magnetohydrodynamic model restricted to the near-photospheric region is used to compute time series of the complete current density and the resistive heating rate per unit volume $(Q(t))$ in each pixel in neutral line regions (NLRs) of 14 ARs. The model is driven by time series of the magnetic field ${bf B}$ measured by the Helioseismic & Magnetic Imager on the Solar Dynamics Observatory (SDO) satellite. Spurious Doppler periods due to SDO orbital motion are filtered out of the time series for ${bf B}$ in every AR pixel. For each AR, the cumulative distribution function (CDF) of the values of the NLR area integral $Q_i(t)$ of $Q(t)$ is found to be a scale invariant power law distribution essentially identical to the observed CDF for the total energy released in coronal flares. This suggests that coronal flares and the photospheric $Q_i$ are correlated, and powered by the same process. The model predicts spikes in $Q_i$ with values orders of magnitude above background values. These spikes are driven by spikes in the non-force free component of the current density. The times of these spikes are plausibly correlated with times of subsequent M or X flares a few hours to a few days later. The spikes occur on granulation scales, and may be signatures of heating in horizontal current sheets. It is also found that the times of relatively large values of the rate of change of the NLR unsigned magnetic flux are also plausibly correlated with the times of subsequent M and X flares, and spikes in $Q_i$.
Solar flare prediction plays an important role in understanding and forecasting space weather. The main goal of the Helioseismic and Magnetic Imager (HMI), one of the instruments on NASAs Solar Dynamics Observatory, is to study the origin of solar va
riability and characterize the Suns magnetic activity. HMI provides continuous full-disk observations of the solar vector magnetic field with high cadence data that lead to reliable predictive capability; yet, solar flare prediction effort utilizing these data is still limited. In this paper, we present a machine-learning-as-a-service (MLaaS) framework, called DeepSun, for predicting solar flares on the Web based on HMIs data products. Specifically, we construct training data by utilizing the physical parameters provided by the Space-weather HMI Active Region Patches (SHARP) and categorize solar flares into four classes, namely B, C, M, X, according to the X-ray flare catalogs available at the National Centers for Environmental Information (NCEI). Thus, the solar flare prediction problem at hand is essentially a multi-class (i.e., four-class) classification problem. The DeepSun system employs several machine learning algorithms to tackle this multi-class prediction problem and provides an application programming interface (API) for remote programming users. To our knowledge, DeepSun is the first MLaaS tool capable of predicting solar flares through the Internet.
In this paper, we are going to find meaning of words based on distinct situations. Word Sense Disambiguation is used to find meaning of words based on live contexts using supervised and unsupervised approaches. Unsupervised approaches use online dict
ionary for learning, and supervised approaches use manual learning sets. Hand tagged data are populated which might not be effective and sufficient for learning procedure. This limitation of information is main flaw of the supervised approach. Our proposed approach focuses to overcome the limitation using learning set which is enriched in dynamic way maintaining new data. Trivial filtering method is utilized to achieve appropriate training data. We introduce a mixed methodology having Modified Lesk approach and Bag-of-Words having enriched bags using learning methods. Our approach establishes the superiority over individual Modified Lesk and Bag-of-Words approaches based on experimentation.
We developed a flare prediction model using machine learning, which is optimized to predict the maximum class of flares occurring in the following 24 h. Machine learning is used to devise algorithms that can learn from and make decisions on a huge am
ount of data. We used solar observation data during the period 2010-2015, such as vector magnetogram, ultraviolet (UV) emission, and soft X-ray emission taken by the Solar Dynamics Observatory and the Geostationary Operational Environmental Satellite. We detected active regions from the full-disk magnetogram, from which 60 features were extracted with their time differentials, including magnetic neutral lines, the current helicity, the UV brightening, and the flare history. After standardizing the feature database, we fully shuffled and randomly separated it into two for training and testing. To investigate which algorithm is best for flare prediction, we compared three machine learning algorithms: the support vector machine (SVM), k-nearest neighbors (k-NN), and extremely randomized trees (ERT). The prediction score, the true skill statistic (TSS), was higher than 0.9 with a fully shuffled dataset, which is higher than that for human forecasts. It was found that k-NN has the highest performance among the three algorithms. The ranking of the feature importance showed that the previous flare activity is most effective, followed by the length of magnetic neutral lines, the unsigned magnetic flux, the area of UV brightening, and the time differentials of features over 24 h, all of which are strongly correlated with the flux emergence dynamics in an active region.
We developed a solar flare prediction model using a deep neural network (DNN), named Deep Flare Net (DeFN). The model can calculate the probability of flares occurring in the following 24 h in each active region, which is used to determine the most l
ikely maximum classes of flares via a binary classification (e.g., >=M class versus <M class or >=C class versus <C class). From 3x10^5 observation images taken during 2010-2015 by Solar Dynamic Observatory, we automatically detected sunspots and calculated 79 features for each region, to which flare occurrence labels of X-, M-, and C-class were attached. We adopted the features used in Nishizuka et al. (2017) and added some features for operational prediction: coronal hot brightening at 131 A (T>=10^7 K) and the X-ray and 131 A intensity data 1 and 2 h before an image. For operational evaluation, we divided the database into two for training and testing: the dataset in 2010-2014 for training and the one in 2015 for testing. The DeFN model consists of deep multilayer neural networks, formed by adapting skip connections and batch normalizations. To statistically predict flares, the DeFN model was trained to optimize the skill score, i.e., the true skill statistic (TSS). As a result, we succeeded in predicting flares with TSS=0.80 for >=M-class flares and TSS=0.63 for >=C-class flares. Note that in usual DNN models, the prediction process is a black box. However, in the DeFN model, the features are manually selected, and it is possible to analyze which features are effective for prediction after evaluation.