ترغب بنشر مسار تعليمي؟ اضغط هنا

Inadequacy of internal covariance estimation for super-sample covariance

84   0   0.0 ( 0 )
 نشر من قبل Fabien Lacasa
 تاريخ النشر 2017
  مجال البحث فيزياء
والبحث باللغة English




اسأل ChatGPT حول البحث

We give an analytical interpretation of how subsample-based internal covariance estimators lead to biased estimates of the covariance, due to underestimating the super-sample covariance (SSC). This includes the jackknife and bootstrap methods as estimators for the full survey area, and subsampling as an estimator of the covariance of subsamples. The limitations of the jackknife covariance have been previously presented in the literature because it is effectively a rescaling of the covariance of the subsample area. However we point out that subsampling is also biased, but for a different reason: the subsamples are not independent, and the corresponding lack of power results in SSC underprediction. We develop the formalism in the case of cluster counts that allows the bias of each covariance estimator to be exactly predicted. We find significant effects for a small-scale area or when a low number of subsamples is used, with auto-redshift biases ranging from 0.4% to 15% for subsampling and from 5% to 75% for jackknife covariance estimates. The cross-redshift covariance is even more affected; biases range from 8% to 25% for subsampling and from 50% to 90% for jackknife. Owing to the redshift evolution of the probe, the covariances cannot be debiased by a simple rescaling factor, and an exact debiasing has the same requirements as the full SSC prediction. These results thus disfavour the use of internal covariance estimators on data itself or a single simulation, leaving analytical prediction and simulations suites as possible SSC predictors.



قيم البحث

اقرأ أيضاً

We present a numerically cheap approximation to super-sample covariance (SSC) of large scale structure cosmological probes, first in the case of angular power spectra. It necessitates no new elements besides those used for the prediction of the consi dered probes, thus relieving analysis pipelines from having to develop a full SSC modeling, and reducing the computational load. The approximation is asymptotically exact for fine redshift bins $Delta z rightarrow 0$. We furthermore show how it can be implemented at the level of a Gaussian likelihood or a Fisher matrix forecast, as a fast correction to the Gaussian case without needing to build large covariance matrices. Numerical application to a Euclid-like survey show that, compared to a full SSC computation, the approximation recovers nicely the signal-to-noise ratio as well as Fisher forecasts on cosmological parameters of the $w$CDM cosmological model. Moreover it allows for a fast prediction of which parameters are going to be the most affected by SSC and at which level. In the case of photometric galaxy clustering with Euclid-like specifications, we find that $sigma_8$, $n_s$ and the dark energy equation of state $w$ are particularly heavily affected. We finally show how to generalize the approximation for probes other than angular spectra (correlation functions, number counts and bispectra), and at the likelihood level, allowing for the latter to be non-Gaussian if needs be. We release publicly a Python module allowing to implement the SSC approximation, as well as a notebook reproducing the plots of the article, at https://github.com/fabienlacasa/PySSC
The covariance matrix $boldsymbol{Sigma}$ of non-linear clustering statistics that are measured in current and upcoming surveys is of fundamental interest for comparing cosmological theory and data and a crucial ingredient for the likelihood approxim ations underlying widely used parameter inference and forecasting methods. The extreme number of simulations needed to estimate $boldsymbol{Sigma}$ to sufficient accuracy poses a severe challenge. Approximating $boldsymbol{Sigma}$ using inexpensive but biased surrogates introduces model error with respect to full simulations, especially in the non-linear regime of structure growth. To address this problem we develop a matrix generalization of Convergence Acceleration by Regression and Pooling (CARPool) to combine a small number of simulations with fast surrogates and obtain low-noise estimates of $boldsymbol{Sigma}$ that are unbiased by construction. Our numerical examples use CARPool to combine GADGET-III $N$-body simulations with fast surrogates computed using COmoving Lagrangian Acceleration (COLA). Even at the challenging redshift $z=0.5$, we find variance reductions of at least $mathcal{O}(10^1)$ and up to $mathcal{O}(10^4)$ for the elements of the matter power spectrum covariance matrix on scales $8.9times 10^{-3}<k_mathrm{max} <1.0$ $h {rm Mpc^{-1}}$. We demonstrate comparable performance for the covariance of the matter bispectrum, the matter correlation function and probability density function of the matter density field. We compare eigenvalues, likelihoods, and Fisher matrices computed using the CARPool covariance estimate with the standard sample covariance estimators and generally find considerable improvement except in cases where $Sigma$ is severely ill-conditioned.
Photometric galaxy surveys probe the late-time Universe where the density field is highly non-Gaussian. A consequence is the emergence of the super-sample covariance (SSC), a non-Gaussian covariance term that is sensitive to fluctuations on scales la rger than the survey window. In this work, we study the impact of the survey geometry on the SSC and, subsequently, on cosmological parameter inference. We devise a fast SSC approximation that accounts for the survey geometry and compare its performance to the common approximation of rescaling the results by the fraction of the sky covered by the survey, $f_mathrm{SKY}$, dubbed full-sky approximation. To gauge the impact of our new SSC recipe, dubbed partial-sky, we perform Fisher forecasts on the parameters of the $(w_0,w_a)$-CDM model in a 3x2 points analysis, varying the survey area, the geometry of the mask and the galaxy distribution inside our redshift bins. The differences in the marginalised forecast errors, with the full-sky approximation performing poorly for small survey areas but excellently for stage-IV-like areas, are found to be absorbed by the marginalisation on galaxy bias nuisance parameters. For large survey areas, the unmarginalised errors are underestimated by about 10% for all probes considered. This is a hint that, even for stage-IV-like surveys, the partial-sky method introduced in this work will be necessary if tight priors are applied on these nuisance parameters.
We propose a novel pilot structure for covariance matrix estimation in massive multiple-input multiple-output (MIMO) systems in which each user transmits two pilot sequences, with the second pilot sequence multiplied by a random phase-shift. The cova riance matrix of a particular user is obtained by computing the sample cross-correlation of the channel estimates obtained from the two pilot sequences. This approach relaxes the requirement that all the users transmit their uplink pilots over the same set of symbols. We derive expressions for the achievable rate and the mean-squared error of the covariance matrix estimate when the proposed method is used with staggered pilots. The performance of the proposed method is compared with existing methods through simulations.
Aims. We investigate the contribution of shot-noise and sample variance to the uncertainty of cosmological parameter constraints inferred from cluster number counts in the context of the Euclid survey. Methods. By analysing 1000 Euclid-like light-con es, produced with the PINOCCHIO approximate method, we validate the analytical model of Hu & Kravtsov 2003 for the covariance matrix, which takes into account both sources of statistical error. Then, we use such covariance to define the likelihood function that better extracts cosmological information from cluster number counts at the level of precision that will be reached by the future Euclid photometric catalogs of galaxy clusters. We also study the impact of the cosmology dependence of the covariance matrix on the parameter constraints. Results. The analytical covariance matrix reproduces the variance measured from simulations within the 10 per cent level; such difference has no sizeable effect on the error of cosmological parameter constraints at this level of statistics. Also, we find that the Gaussian likelihood with cosmology-dependent covariance is the only model that provides an unbiased inference of cosmological parameters without underestimating the errors.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا