Finding rare objects and building pure samples: Probabilistic quasar classification from low resolution Gaia spectra

164 0 0.0 ( 0 )

تحميل البحث استخدام كمرجع

نشر من قبل Coryn Bailer-Jones

تاريخ النشر 2008

مجال البحث فيزياء

والبحث باللغة English

تأليف C.A.L. Bailer-Jones

تحليل البيانات والإحصاءات والاحتمال التعلم الالي

قم بزيارة صفحتنا على فيسبوك

‎Shamra Academia - شمرا أكاديميا‎

اسأل ChatGPT حول البحث

الملخص بالعربية الملخص بالإنكليزية

We develop and demonstrate a probabilistic method for classifying rare objects in surveys with the particular goal of building very pure samples. It works by modifying the output probabilities from a classifier so as to accommodate our expectation (priors) concerning the relative frequencies of different classes of objects. We demonstrate our method using the Discrete Source Classifier, a supervised classifier currently based on Support Vector Machines, which we are developing in preparation for the Gaia data analysis. DSC classifies objects using their very low resolution optical spectra. We look in detail at the problem of quasar classification, because identification of a pure quasar sample is necessary to define the Gaia astrometric reference frame. By varying a posterior probability threshold in DSC we can trade off sample completeness and contamination. We show, using our simulated data, that it is possible to achieve a pure sample of quasars (upper limit on contamination of 1 in 40,000) with a completeness of 65% at magnitudes of G=18.5, and 50% at G=20.0, even when quasars have a frequency of only 1 in every 2000 objects. The star sample completeness is simultaneously 99% with a contamination of 0.7%. Including parallax and proper motion in the classifier barely changes the results. We further show that not accounting for class priors in the target population leads to serious misclassifications and poor predictions for sample completeness and contamination. (Truncated)

قيم البحث

81 - J.M. Carrasco , M. Weiler , C. Jordi 2021

The full third Gaia data release will provide the calibrated spectra obtained with the blue and red Gaia slit-less spectrophotometers. The main challenge when facing Gaia spectral calibration is that no lamp spectra or flat fields are available durin g the mission. Also, the significant size of the line spread function with respect to the dispersion of the prisms produces alien photons contaminating neighbouring positions of the spectra. This makes the calibration special and different from standard approaches. This work gives a detailed description of the internal calibration model to obtain the spectrophotometric data in the Gaia catalogue. The main purpose of the internal calibration is to bring all the epoch spectra onto a common flux and pixel (pseudo-wavelength) scale, taking into account variations over the focal plane and with time, producing a mean spectrum from all the observations of the same source. In order to describe all observations in a common mean flux and pseudo-wavelength scale, we construct a suitable representation of the internally calibrated mean spectra via basis functions and we describe the transformation between non calibrated epoch spectra and calibrated mean spectra via a discrete convolution, parametrising the convolution kernel to recover the relevant coefficients. The model proposed here is able to combine all observations into a mean instrument to allow the comparison of different sources and observations obtained with different instrumental conditions along the mission and the generation of mean spectra from a number of observations of the same source. The output of this model provides the internal mean spectra, not as a sampled function (flux and wavelength), but as a linear combination of basis functions, although sampled spectra can easily be derived from them.

الأجهزة والأساليب للزيئات الفيزياء الفلكية

Jet Flavour Classification Using DeepJet

62 - Emil Bols , Jan Kieseler , Mauro Verzetti 2020

Jet flavour classification is of paramount importance for a broad range of applications in modern-day high-energy-physics experiments, particularly at the LHC. In this paper we propose a novel architecture for this task that exploits modern deep lear ning techniques. This new model, called DeepJet, overcomes the limitations in input size that affected previous approaches. As a result, the heavy flavour classification performance improves, and the model is extended to also perform quark-gluon tagging.

فيزياء الطاقة العالية - التجربة تحليل البيانات والإحصاءات والاحتمال التعلم الالي

Finding nonlinear system equations and complex network structures from data: a sparse optimization approach

122 - Ying-Cheng Lai 2020

In applications of nonlinear and complex dynamical systems, a common situation is that the system can be measured but its structure and the detailed rules of dynamical evolution are unknown. The inverse problem is to determine the system equations an d structure based solely on measured time series. Recently, methods based on sparse optimization have been developed. For example, the principle of exploiting sparse optimization such as compressive sensing to find the equations of nonlinear dynamical systems from data was articulated in 2011 by the Nonlinear Dynamics Group at Arizona State University. This article presents a brief review of the recent progress in this area. The basic idea is to expand the equations governing the dynamical evolution of the system into a power series or a Fourier series of a finite number of terms and then to determine the vector of the expansion coefficients based solely on data through sparse optimization. Examples discussed here include discovering the equations of stationary or nonstationary chaotic systems to enable prediction of dynamical events such as critical transition and system collapse, inferring the full topology of complex networks of dynamical oscillators and social networks hosting evolutionary game dynamics, and identifying partial differential equations for spatiotemporal dynamical systems. Situations where sparse optimization is effective and those in which the method fails are discussed. Comparisons with the traditional method of delay coordinate embedding in nonlinear time series analysis are given and the recent development of model-free, data driven prediction framework based on machine learning is briefly introduced.

النظم الديناميكية تحليل البيانات والإحصاءات والاحتمال

SNIascore: Deep Learning Classification of Low-Resolution Supernova Spectra

87 - Christoffer Fremling , Xander J. Hall , Michael W. Coughlin 2021

We present SNIascore, a deep-learning based method for spectroscopic classification of thermonuclear supernovae (SNe Ia) based on very low-resolution (R $sim100$) data. The goal of SNIascore is fully automated classification of SNe Ia with a very low false-positive rate (FPR) so that human intervention can be greatly reduced in large-scale SN classification efforts, such as that undertaken by the public Zwicky Transient Facility (ZTF) Bright Transient Survey (BTS). We utilize a recurrent neural network (RNN) architecture with a combination of bidirectional long short-term memory and gated recurrent unit layers. SNIascore achieves a $<0.6%$ FPR while classifying up to $90%$ of the low-resolution SN Ia spectra obtained by the BTS. SNIascore simultaneously performs binary classification and predicts the redshifts of secure SNe Ia via regression (with a typical uncertainty of $<0.005$ in the range from $z = 0.01$ to $z = 0.12$). For the magnitude-limited ZTF BTS survey ($approx70%$ SNe Ia), deploying SNIascore reduces the amount of spectra in need of human classification or confirmation by $approx60%$. Furthermore, SNIascore allows SN Ia classifications to be automatically announced in real-time to the public immediately following a finished observation during the night.

الأجهزة والأساليب للزيئات الفيزياء الفلكية ظاهرة عالية الطاقة الفيزياء الفيزيائية

Low-Resolution Fault Localization Using Phasor Measurement Units with Community Detection

61 - Mahdi Jamei Arizonan State University 2018

A significant portion of the literature on fault localization assumes (more or less explicitly) that there are sufficient reliable measurements to guarantee that the system is observable. While several heuristics exist to break the observability barr ier, they mostly rely on recognizing spatio-temporal patterns, without giving insights on how the performance are tied with the system features and the sensor deployment. In this paper, we try to fill this gap and investigate the limitations and performance limits of fault localization using Phasor Measurement Units (PMUs), in the low measurements regime, i.e., when the system is unobservable with the measurements available. Our main contribution is to show how one can leverage the scarce measurements to localize different type of distribution line faults (three-phase, single-phase to ground, ...) at the level of sub-graph, rather than with the resolution of a line. We show that the resolution we obtain is strongly tied with the graph clustering notion in network science.

أنظمة وتحكم تحليل البيانات والإحصاءات والاحتمال

سجل دخول لتتمكن من نشر تعليقات

التعليقات

جاري جلب التعليقات

سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها

جامعة وهران احمد بن بله

تفاصيل إضافية المزيد من الجامعات

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Finding rare objects and building pure samples: Probabilistic quasar classification from low resolution Gaia spectra

اسأل ChatGPT حول البحث

ﻻ يوجد ملخص باللغة العربية

اقرأ أيضاً