ترغب بنشر مسار تعليمي؟ اضغط هنا

Photometric Classifications of Evolved Massive Stars: Preparing for the Era of Webb and Roman with Machine Learning

89   0   0.0 ( 0 )
 نشر من قبل Trevor Dorn-Wallenstein
 تاريخ النشر 2021
  مجال البحث فيزياء
والبحث باللغة English




اسأل ChatGPT حول البحث

In the coming years, next-generation space-based infrared observatories will significantly increase our samples of rare massive stars, representing a tremendous opportunity to leverage modern statistical tools and methods to test massive stellar evolution in entirely new environments. Such work is only possible if the observed objects can be reliably classified. Spectroscopic observations are infeasible with more distant targets, and so we wish to determine whether machine learning methods can classify massive stars using broadband infrared photometry. We find that a Support Vector Machine classifier is capable of coarsely classifying massive stars with labels corresponding to hot, cool, and emission line stars with high accuracy, while rejecting contaminating low mass giants. Remarkably, 76% of emission line stars can be recovered without the need for narrowband or spectroscopic observations. We classify a sample of ${sim}2500$ objects with no existing labels, and identify fourteen candidate emission line objects. Unfortunately, despite the high precision of the photometry in our sample, the heterogeneous origins of the labels for the stars in our sample severely inhibits our classifier from distinguishing classes of stars with more granularity. Ultimately, no large and homogeneously labeled sample of massive stars currently exists. Without significant efforts to robustly classify evolved massive stars -- which is feasible given existing data from large all-sky spectroscopic surveys -- shortcomings in the labeling of existing data sets will hinder efforts to leverage the next-generation of space observatories.



قيم البحث

اقرأ أيضاً

We present a machine-learning photometric redshift analysis of the Kilo-Degree Survey Data Release 3, using two neural-network based techniques: ANNz2 and MLPQNA. Despite limited coverage of spectroscopic training sets, these ML codes provide photo-z s of quality comparable to, if not better than, those from the BPZ code, at least up to zphot<0.9 and r<23.5. At the bright end of r<20, where very complete spectroscopic data overlapping with KiDS are available, the performance of the ML photo-zs clearly surpasses that of BPZ, currently the primary photo-z method for KiDS. Using the Galaxy And Mass Assembly (GAMA) spectroscopic survey as calibration, we furthermore study how photo-zs improve for bright sources when photometric parameters additional to magnitudes are included in the photo-z derivation, as well as when VIKING and WISE infrared bands are added. While the fiducial four-band ugri setup gives a photo-z bias $delta z=-2e-4$ and scatter $sigma_z<0.022$ at mean z = 0.23, combining magnitudes, colours, and galaxy sizes reduces the scatter by ~7% and the bias by an order of magnitude. Once the ugri and IR magnitudes are joined into 12-band photometry spanning up to 12 $mu$, the scatter decreases by more than 10% over the fiducial case. Finally, using the 12 bands together with optical colours and linear sizes gives $delta z<4e-5$ and $sigma_z<0.019$. This paper also serves as a reference for two public photo-z catalogues accompanying KiDS DR3, both obtained using the ANNz2 code. The first one, of general purpose, includes all the 39 million KiDS sources with four-band ugri measurements in DR3. The second dataset, optimized for low-redshift studies such as galaxy-galaxy lensing, is limited to r<20, and provides photo-zs of much better quality than in the full-depth case thanks to incorporating optical magnitudes, colours, and sizes in the GAMA-calibrated photo-z derivation.
The intermediate-mass pre-main sequence Herbig Ae/Be stars are key to understanding the differences in formation mechanisms between low- and high-mass stars. The study of the general properties of these objects is hampered by the fact that few and mo stly serendipitously discovered sources are known. Our goal is to identify new Herbig Ae/Be candidates to create a homogeneous and well defined catalogue of these objects. We have applied machine learning techniques to 4,150,983 sources with data from Gaia DR2, 2MASS, WISE, and IPHAS or VPHAS+. Several observables were chosen to identify new Herbig Ae/Be candidates based on our current knowledge of this class, which is characterised by infrared excesses, photometric variabilities, and H$alpha$ emission lines. Classical techniques are not efficient for identifying new Herbig Ae/Be stars mainly because of their similarity with classical Be stars, with which they share many characteristics. By focusing on disentangling these two types of objects, our algorithm has also identified new classical Be stars. We have obtained a large catalogue of 8470 new pre-main sequence candidates and another catalogue of 693 new classical Be candidates with a completeness of $78.8pm1.4%$ and $85.5pm1.2%$, respectively. Of the catalogue of pre-main sequence candidates, at least 1361 sources are potentially new Herbig Ae/Be candidates according to their position in the Hertzsprung-Russell diagram. In this study we present the methodology used, evaluate the quality of the catalogues, and perform an analysis of their flaws and biases. For this assessment, we make use of observables that have not been accounted for by the algorithm and hence are selection-independent, such as coordinates and parallax based distances. The catalogue of new Herbig Ae/Be stars that we present here increases the number of known objects of the class by an order of magnitude.
The second $Gaia$ Data Release (DR2) contains astrometric and photometric data for more than 1.6 billion objects with mean $Gaia$ $G$ magnitude $<$20.7, including many Young Stellar Objects (YSOs) in different evolutionary stages. In order to explore the YSO population of the Milky Way, we combined the $Gaia$ DR2 database with WISE and Planck measurements and made an all-sky probabilistic catalogue of YSOs using machine learning techniques, such as Support Vector Machines, Random Forests, or Neural Networks. Our input catalogue contains 103 million objects from the DR2xAllWISE cross-match table. We classified each object into four main classes: YSOs, extragalactic objects, main-sequence stars and evolved stars. At a 90% probability threshold we identified 1,129,295 YSO candidates. To demonstrate the quality and potential of our YSO catalogue, here we present two applications of it. (1) We explore the 3D structure of the Orion A star forming complex and show that the spatial distribution of the YSOs classified by our procedure is in agreement with recent results from the literature. (2) We use our catalogue to classify published $Gaia$ Science Alerts. As $Gaia$ measures the sources at multiple epochs, it can efficiently discover transient events, including sudden brightness changes of YSOs caused by dynamic processes of their circumstellar disk. However, in many cases the physical nature of the published alert sources are not known. A cross-check with our new catalogue shows that about 30% more of the published $Gaia$ alerts can most likely be attributed to YSO activity. The catalogue can be also useful to identify YSOs among future $Gaia$ alerts.
We present a Bayesian method to cross-match 5,827,988 high proper motion Gaia sources ($mu>40 mas yr^{-1}$) to various photometric surveys: 2MASS, AllWISE, GALEX, RAVE, SDSS and Pan-STARRS. To efficiently associate these objects across catalogs, we develop a technique that compares the multidimensional distribution of all sources in the vicinity of each Gaia star to a reference distribution of random field stars obtained by extracting all sources in a region on the sky displaced 2$^prime$. This offset preserves the local field stellar density and magnitude distribution allowing us to characterize the frequency of chance alignments. The resulting catalog with Bayesian probabilities $>$95% has a marginally higher match rate than current internal Gaia DR2 matches for most catalogs. However, a significant improvement is found with Pan-STARRS, where $sim$99.8% of the sample within the Pan-STARRS footprint is recovered, as compared to a low $sim$20.8% in Gaia DR2. Using these results, we train a Gaussian Process Regressor to calibrate two photometric metallicity relationships. For dwarfs of $3500<T_{eff}<5280$ K, we use metallicity values of 4,378 stars from APOGEE and Hejazi et al. (2020) to calibrate the relationship, producing results with a $1sigma$ precision of 0.12 dex and few systematic errors. We then indirectly infer the metallicity of 4,018 stars with $2850<T_{eff}<3500$ K, that are wide companions of primaries whose metallicities are estimated with our first regressor, to produce a relationship with a $1sigma$ precision of 0.21 dex and significant systematic errors. Additional work is needed to better remove unresolved binaries from this sample to reduce these systematic errors.
We present the Cosmology and Astrophysics with MachinE Learning Simulations --CAMELS-- project. CAMELS is a suite of 4,233 cosmological simulations of $(25~h^{-1}{rm Mpc})^3$ volume each: 2,184 state-of-the-art (magneto-)hydrodynamic simulations run with the AREPO and GIZMO codes, employing the same baryonic subgrid physics as the IllustrisTNG and SIMBA simulations, and 2,049 N-body simulations. The goal of the CAMELS project is to provide theory predictions for different observables as a function of cosmology and astrophysics, and it is the largest suite of cosmological (magneto-)hydrodynamic simulations designed to train machine learning algorithms. CAMELS contains thousands of different cosmological and astrophysical models by way of varying $Omega_m$, $sigma_8$, and four parameters controlling stellar and AGN feedback, following the evolution of more than 100 billion particles and fluid elements over a combined volume of $(400~h^{-1}{rm Mpc})^3$. We describe the simulations in detail and characterize the large range of conditions represented in terms of the matter power spectrum, cosmic star formation rate density, galaxy stellar mass function, halo baryon fractions, and several galaxy scaling relations. We show that the IllustrisTNG and SIMBA suites produce roughly similar distributions of galaxy properties over the full parameter space but significantly different halo baryon fractions and baryonic effects on the matter power spectrum. This emphasizes the need for marginalizing over baryonic effects to extract the maximum amount of information from cosmological surveys. We illustrate the unique potential of CAMELS using several machine learning applications, including non-linear interpolation, parameter estimation, symbolic regression, data generation with Generative Adversarial Networks (GANs), dimensionality reduction, and anomaly detection.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا