Photometric classification of emission line galaxies with Machine Learning methods

592 0 0.0 ( 0 )

تحميل البحث استخدام كمرجع

نشر من قبل Stefano Cavuoti

تاريخ النشر 2013

مجال البحث فيزياء

والبحث باللغة English

تأليف Stefano Cavuoti - Massimo Brescia - Raffaele DAbrusco

علم الكونيات والفيزياء الفلكية Nongalactic

قم بزيارة صفحتنا على فيسبوك

‎Shamra Academia - شمرا أكاديميا‎

اسأل ChatGPT حول البحث

الملخص بالعربية الملخص بالإنكليزية

In this paper we discuss an application of machine learning based methods to the identification of candidate AGN from optical survey data and to the automatic classification of AGNs in broad classes. We applied four different machine learning algorithms, namely the Multi Layer Perceptron (MLP), trained respectively with the Conjugate Gradient, Scaled Conjugate Gradient and Quasi Newton learning rules, and the Support Vector Machines (SVM), to tackle the problem of the classification of emission line galaxies in different classes, mainly AGNs vs non-AGNs, obtained using optical photometry in place of the diagnostics based on line intensity ratios which are classically used in the literature. Using the same photometric features we discuss also the behavior of the classifiers on finer AGN classification tasks, namely Seyfert I vs Seyfert II and Seyfert vs LINER. Furthermore we describe the algorithms employed, the samples of spectroscopically classified galaxies used to train the algorithms, the procedure followed to select the photometric parameters and the performances of our methods in terms of multiple statistical indicators. The results of the experiments show that the application of self adaptive data mining algorithms trained on spectroscopic data sets and applied to carefully chosen photometric parameters represents a viable alternative to the classical methods that employ time-consuming spectroscopic observations.

قيم البحث

76 - Andrew S. Leung 2015

We present a Bayesian approach to the redshift classification of emission-line galaxies when only a single emission line is detected spectroscopically. We consider the case of surveys for high-redshift Lyman-alpha-emitting galaxies (LAEs), which have traditionally been classified via an inferred rest-frame equivalent width (EW) greater than 20 angstrom. Our Bayesian method relies on known prior probabilities in measured emission-line luminosity functions and equivalent width distributions for the galaxy populations, and returns the probability that an object in question is an LAE given the characteristics observed. This approach will be directly relevant for the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX), which seeks to classify ~10^6 emission-line galaxies into LAEs and low-redshift [O II] emitters. For a simulated HETDEX catalog with realistic measurement noise, our Bayesian method recovers 86% of LAEs missed by the traditional EW > 20 angstrom cutoff over 2 < z < 3, outperforming the EW cut in both contamination and incompleteness. This is due to the methods ability to trade off between the two types of binary classification error by adjusting the stringency of the probability requirement for classifying an observed object as an LAE. In our simulations of HETDEX, this method reduces the uncertainty in cosmological distance measurements by 14% with respect to the EW cut, equivalent to recovering 29% more cosmological information. Rather than using binary object labels, this method enables the use of classification probabilities in large-scale structure analyses. It can be applied to narrowband emission-line surveys as well as upcoming large spectroscopic surveys including Euclid and WFIRST.

الأجهزة والأساليب للزيئات الفيزياء الفلكية علم الكونيات والفيزياء الفلكية Nongalactic الفيزياء الفلكية من المجرات

Photometric Supernova Classification With Machine Learning

71 - Michelle Lochner , Jason D. McEwen , Hiranya V. Peiris 2016

Automated photometric supernova classification has become an active area of research in recent years in light of current and upcoming imaging surveys such as the Dark Energy Survey (DES) and the Large Synoptic Survey Telescope, given that spectroscop ic confirmation of type for all supernovae discovered will be impossible. Here, we develop a multi-faceted classification pipeline, combining existing and new approaches. Our pipeline consists of two stages: extracting descriptive features from the light curves and classification using a machine learning algorithm. Our feature extraction methods vary from model-dependent techniques, namely SALT2 fits, to more independent techniques fitting parametric models to curves, to a completely model-independent wavelet approach. We cover a range of representative machine learning algorithms, including naive Bayes, k-nearest neighbors, support vector machines, artificial neural networks and boosted decision trees (BDTs). We test the pipeline on simulated multi-band DES light curves from the Supernova Photometric Classification Challenge. Using the commonly used area under the curve (AUC) of the Receiver Operating Characteristic as a metric, we find that the SALT2 fits and the wavelet approach, with the BDTs algorithm, each achieves an AUC of 0.98, where 1 represents perfect classification. We find that a representative training set is essential for good classification, whatever the feature set or algorithm, with implications for spectroscopic follow-up. Importantly, we find that by using either the SALT2 or the wavelet feature sets with a BDT algorithm, accurate classification is possible purely from light curve data, without the need for any redshift information.

الأجهزة والأساليب للزيئات الفيزياء الفلكية علم الكونيات والفيزياء الفلكية Nongalactic

Star Formation Rates for photometric samples of galaxies using machine learning methods

96 - M. Delli Veneri , S. Cavuoti , M. Brescia 2019

Star Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFRs are usually estimated via spectroscopic observations requiring large amounts of telescope time. We explore an alternative approach based on the pho tometric estimation of global SFRs for large samples of galaxies, by using methods such as automatic parameter space optimisation, and supervised Machine Learning models. We demonstrate that, with such approach, accurate multi-band photometry allows to estimate reliable SFRs. We also investigate how the use of photometric rather than spectroscopic redshifts, affects the accuracy of derived global SFRs. Finally, we provide a publicly available catalogue of SFRs for more than 27 million galaxies extracted from the Sloan Digital Sky survey Data Release 7. The catalogue is available through the Vizier facility at the following link ftp://cdsarc.u-strasbg.fr/pub/cats/J/MNRAS/486/1377.

الأجهزة والأساليب للزيئات الفيزياء الفلكية الفيزياء الفلكية من المجرات

Photometric classification of HSC transients using machine learning

99 - Ichiro Takahashi , Nao Suzuki , Naoki Yasuda 2020

The advancement of technology has resulted in a rapid increase in supernova (SN) discoveries. The Subaru/Hyper Suprime-Cam (HSC) transient survey, conducted from fall 2016 through spring 2017, yielded 1824 SN candidates. This gave rise to the need fo r fast type classification for spectroscopic follow-up and prompted us to develop a machine learning algorithm using a deep neural network (DNN) with highway layers. This machine is trained by actual observed cadence and filter combinations such that we can directly input the observed data array into the machine without any interpretation. We tested our model with a dataset from the LSST classification challenge (Deep Drilling Field). Our classifier scores an area under the curve (AUC) of 0.996 for binary classification (SN Ia or non-SN Ia) and 95.3% accuracy for three-class classification (SN Ia, SN Ibc, or SN II). Application of our binary classification to HSC transient data yields an AUC score of 0.925. With two weeks of HSC data since the first detection, this classifier achieves 78.1% accuracy for binary classification, and the accuracy increases to 84.2% with the full dataset. This paper discusses the potential use of machine learning for SN type classification purposes.

الأجهزة والأساليب للزيئات الفيزياء الفلكية ظاهرة عالية الطاقة الفيزياء الفيزيائية

Photometric redshifts and clustering of emission line galaxies selected jointly by DES and eBOSS

68 - S. Jouvel , T. Delubac , J. Comparat 2015

We present the results of the first test plates of the extended Baryon Oscillation Spectroscopic Survey. This paper focuses on the emission line galaxies (ELG) population targetted from the Dark Energy Survey (DES) photometry. We analyse the success rate, efficiency, redshift distribution, and clustering properties of the targets. From the 9000 spectroscopic redshifts targetted, 4600 have been selected from the DES photometry. The total success rate for redshifts between 0.6 and 1.2 is 71% and 68% respectively for a bright and faint, on average more distant, samples including redshifts measured from a single strong emission line. We find a mean redshift of 0.8 and 0.87, with 15 and 13% of unknown redshifts respectively for the bright and faint samples. In the redshift range 0.6<z<1.2, for the most secure spectroscopic redshifts, the mean redshift for the bright and faint sample is 0.85 and 0.9 respectively. Star contamination is lower than 2%. We measure a galaxy bias averaged on scales of 1 and 10~Mpc/h of 1.72 pm 0.1 for the bright sample and of 1.78 pm 0.12 for the faint sample. The error on the galaxy bias have been obtained propagating the errors in the correlation function to the fitted parameters. This redshift evolution for the galaxy bias is in agreement with theoretical expectations for a galaxy population with MB-5log h < -21.0. We note that biasing is derived from the galaxy clustering relative to a model for the mass fluctuations. We investigate the quality of the DES photometric redshifts and find that the outlier fraction can be reduced using a comparison between template fitting and neural network, or using a random forest algorithm.

علم الكونيات والفيزياء الفلكية Nongalactic

سجل دخول لتتمكن من نشر تعليقات