We describe two new open source tools written in Python for performing extreme deconvolution Gaussian mixture modeling (XDGMM) and using a conditioned model to re-sample observed supernova and host galaxy populations. XDGMM is new program for using Gaussian mixtures to do density estimation of noisy data using extreme deconvolution (XD) algorithms that has functionality not available in other XD tools. It allows the user to select between the AstroML (Vanderplas et al. 2012; Ivezic et al. 2015) and Bovy et al. (2011) fitting methods and is compatible with scikit-learn machine learning algorithms (Pedregosa et al. 2011). Most crucially, it allows the user to condition a model based on the known values of a subset of parameters. This gives the user the ability to produce a tool that can predict unknown parameters based on a model conditioned on known values of other parameters. EmpiriciSN is an example application of this functionality that can be used for fitting an XDGMM model to observed supernova/host datasets and predicting likely supernova parameters using on a model conditioned on observed host properties. It is primarily intended for simulating realistic supernovae for LSST data simulations based on empirical galaxy properties.
Many processes in chemistry and physics take place on timescales that cannot be explored using standard molecular dynamics simulations. This renders the use of enhanced sampling mandatory. Here we introduce an enhanced sampling method that is based on constructing a model probability density from which a bias potential is derived. The model relies on the fact that in a physical system most of the configurations visited can be grouped into isolated metastable islands. To each island we associate a distribution that is fitted to a Gaussian mixture. The different distributions are linearly combined together with coefficients that are computed self consistently. Remarkably, from this biased dynamics, rates of transition between different metastable states can be straightforwardly computed.
The red sequence is an important feature of galaxy clusters and plays a crucial role in optical cluster detection. Measurement of the slope and scatter of the red sequence are affected both by selection of red sequence galaxies and measurement errors. In this paper, we describe a new error corrected Gaussian Mixture Model for red sequence galaxy identification. Using this technique, we can remove the effects of measurement error and extract unbiased information about the intrinsic properties of the red sequence. We use this method to select red sequence galaxies in each of the 13,823 clusters in the maxBCG catalog, and measure the red sequence ridgeline location and scatter of each. These measurements provide precise constraints on the variation of the average red galaxy populations in the observed frame with redshift. We find that the scatter of the red sequence ridgeline increases mildly with redshift, and that the slope decreases with redshift. We also observe that the slope does not strongly depend on cluster richness. Using similar methods, we show that this behavior is mirrored in a spectroscopic sample of field galaxies, further emphasizing that ridgeline properties are independent of environment.
We present a new framework to detect various types of variable objects within massive astronomical time-series data. Assuming that the dominant population of objects is non-variable, we find outliers from this population by using a non-parametric Bayesian clustering algorithm based on an infinite GaussianMixtureModel (GMM) and the Dirichlet Process. The algorithm extracts information from a given dataset, which is described by six variability indices. The GMM uses those variability indices to recover clusters that are described by six-dimensional multivariate Gaussian distributions, allowing our approach to consider the sampling pattern of time-series data, systematic biases, the number of data points for each light curve, and photometric quality. Using the Northern Sky Variability Survey data, we test our approach and prove that the infinite GMM is useful at detecting variable objects, while providing statistical inference estimation that suppresses false detection. The proposed approach will be effective in the exploration of future surveys such as GAIA, Pan-Starrs, and LSST, which will produce massive time-series data.
We present a novel Bayesian method, referred to as Blobby3D, to infer gas kinematics that mitigates the effects of beam smearing for observations using Integral Field Spectroscopy (IFS). The method is robust for regularly rotating galaxies despite substructure in the gas distribution. Modelling the gas substructure within the disk is achieved by using a hierarchical Gaussian mixture model. To account for beam smearing effects, we construct a modelled cube that is then convolved per wavelength slice by the seeing, before calculating the likelihood function. We show that our method can model complex gas substructure including clumps and spiral arms. We also show that kinematic asymmetries can be observed after beam smearing for regularly rotating galaxies with asymmetries only introduced in the spatial distribution of the gas. We present findings for our method applied to a sample of 20 star-forming galaxies from the SAMI Galaxy Survey. We estimate the global H$alpha$ gas velocity dispersion for our sample to be in the range $bar{sigma}_v sim $[7, 30] km s$^{-1}$. The relative difference between our approach and estimates using the single Gaussian component fits per spaxel is $Delta bar{sigma}_v / bar{sigma}_v = - 0.29 pm 0.18$ for the H$alpha$ flux-weighted mean velocity dispersion.
In this paper we address the problem of building a class of robust factorization algorithms that solve for the shape and motion parameters with both affine (weak perspective) and perspective camera models. We introduce a Gaussian/uniform mixture model and its associated EM algorithm. This allows us to address robust parameter estimation within a data clustering approach. We propose a robust technique that works with any affine factorization method and makes it robust to outliers. In addition, we show how such a framework can be further embedded into an iterative perspective factorization scheme. We carry out a large number of experiments to validate our algorithms and to compare them with existing ones. We also compare our approach with factorization methods that use M-estimators.
Thomas W.-S. Holoien
,Philip J. Marshall
,
.
(2016)
.
"EmpiriciSN: Re-sampling Observed Supernova/Host Galaxy Populations using an XD Gaussian Mixture Model"
.
Thomas Holoien
هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا