No Arabic abstract
Cosmological parameter estimation is entering a new era. Large collaborations need to coordinate high-stakes analyses using multiple methods; furthermore such analyses have grown in complexity due to sophisticated models of cosmology and systematic uncertainties. In this paper we argue that modularity is the key to addressing these challenges: calculations should be broken up into interchangeable modular units with inputs and outputs clearly defined. We present a new framework for cosmological parameter estimation, CosmoSIS, designed to connect together, share, and advance development of inference tools across the community. We describe the modules already available in CosmoSIS, including CAMB, Planck, cosmic shear calculations, and a suite of samplers. We illustrate it using demonstration code that you can run out-of-the-box with the installer available at http://bitbucket.org/joezuntz/cosmosis
The ability to obtain reliable point estimates of model parameters is of crucial importance in many fields of physics. This is often a difficult task given that the observed data can have a very high number of dimensions. In order to address this problem, we propose a novel approach to construct parameter estimators with a quantifiable bias using an order expansion of highly compressed deep summary statistics of the observed data. These summary statistics are learned automatically using an information maximising loss. Given an observation, we further show how one can use the constructed estimators to obtain approximate Bayes computation (ABC) posterior estimates and their corresponding uncertainties that can be used for parameter inference using Gaussian process regression even if the likelihood is not tractable. We validate our method with an application to the problem of cosmological parameter inference of weak lensing mass maps. We show in that case that the constructed estimators are unbiased and have an almost optimal variance, while the posterior distribution obtained with the Gaussian process regression is close to the true posterior and performs better or equally well than comparable methods.
Cosmological large-scale structure analyses based on two-point correlation functions often assume a Gaussian likelihood function with a fixed covariance matrix. We study the impact on cosmological parameter estimation of ignoring the parameter dependence of this covariance matrix, focusing on the particular case of joint weak-lensing and galaxy clustering analyses. Using a Fisher matrix formalism (calibrated against exact likelihood evaluation in particular simple cases), we quantify the effect of using a parameter dependent covariance matrix on both the bias and variance of the parameters. We confirm that the approximation of a parameter-independent covariance matrix is exceptionally good in all realistic scenarios. The information content in the covariance matrix (in comparison with the two point functions themselves) does not change with the fractional sky coverage. Therefore the increase in information due to the parameter dependent covariance matrix becomes negligible as the number of modes increases. Even for surveys covering less than $1%$ of the sky, this effect only causes a bias of up to ${cal O}(10%)$ of the statistical uncertainties, with a misestimation of the parameter uncertainties at the same level or lower. The effect will only be smaller with future large-area surveys. Thus for most analyses the effect of a parameter-dependent covariance matrix can be ignored both in terms of the accuracy and precision of the recovered cosmological constraints.
The galaxy catalogs generated from low-resolution emission line surveys often contain both foreground and background interlopers due to line misidentification, which can bias the cosmological parameter estimation. In this paper, we present a method for correcting the interloper bias by using the joint-analysis of auto- and cross-power spectra of the main and the interloper samples. In particular, we can measure the interloper fractions from the cross-correlation between the interlopers and survey galaxies, because the true cross-correlation must be negligibly small. The estimated interloper fractions, in turn, remove the interloper bias in the cosmological parameter estimation. For example, in the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) low-redshift ($z<0.5$) [O II] $lambda3727${AA} emitters contaminate high-redshift ($1.9<z<3.5$) Lyman-$alpha$ line emitters. We demonstrate that the joint-analysis method yields a high signal-to-noise ratio measurement of the interloper fractions while only marginally increasing the uncertainties in the cosmological parameters relative to the case without interlopers. We also show the same is true for the high-latitude spectroscopic survey of Wide-Field Infrared Survey Telescope (WFIRST) mission where contamination occurs between the Balmer-$alpha$ line emitters at lower redshifts ($1.1<z<1.9$) and Oxygen ([O III] $lambda5007${AA}) line emitters at higher redshifts ($1.7<z<2.8$).
We perform a model independent reconstruction of the cosmic expansion rate based on type Ia supernova data. Using the Union 2.1 data set, we show that the Hubble parameter behaviour allowed by the data without making any hypothesis about cosmological model or underlying gravity theory is consistent with a flat LCDM universe having H_0 = 70.43 +- 0.33 and Omega_m=0.297 +- 0.020, weakly dependent on the choice of initial scatter matrix. This is in closer agreement with the recently released Planck results (H_0 = 67.3 +- 1.2, Omega_m = 0.314 +- 0.020) than other standard analyses based on type Ia supernova data. We argue this might be an indication that, in order to tackle subtle deviations from the standard cosmological model present in type Ia supernova data, it is mandatory to go beyond parametrized approaches.
To further our knowledge of the complex physical process of galaxy formation, it is essential that we characterize the formation and evolution of large databases of galaxies. The spectral synthesis STARLIGHT code of Cid Fernandes et al. (2004) was designed for this purpose. Results of STARLIGHT are highly dependent on the choice of input basis of simple stellar population (SSP) spectra. Speed of the code, which uses random walks through the parameter space, scales as the square of the number of basis spectra, making it computationally necessary to choose a small number of SSPs that are coarsely sampled in age and metallicity. In this paper, we develop methods based on diffusion map (Lafon & Lee, 2006) that, for the first time, choose appropriate bases of prototype SSP spectra from a large set of SSP spectra designed to approximate the continuous grid of age and metallicity of SSPs of which galaxies are truly composed. We show that our techniques achieve better accuracy of physical parameter estimation for simulated galaxies. Specifically, we show that our methods significantly decrease the age-metallicity degeneracy that is common in galaxy population synthesis methods. We analyze a sample of 3046 galaxies in SDSS DR6 and compare the parameter estimates obtained from different basis choices.