No Arabic abstract
Accurate photometric redshifts are a lynchpin for many future experiments to pin down the cosmological model and for studies of galaxy evolution. In this study, a novel sparse regression framework for photometric redshift estimation is presented. Simulated and real data from SDSS DR12 were used to train and test the proposed models. We show that approaches which include careful data preparation and model design offer a significant improvement in comparison with several competing machine learning algorithms. Standard implementations of most regression algorithms have as the objective the minimization of the sum of squared errors. For redshift inference, however, this induces a bias in the posterior mean of the output distribution, which can be problematic. In this paper we directly target minimizing $Delta z = (z_textrm{s} - z_textrm{p})/(1+z_textrm{s})$ and address the bias problem via a distribution-based weighting scheme, incorporated as part of the optimization objective. The results are compared with other machine learning algorithms in the field such as Artificial Neural Networks (ANN), Gaussian Processes (GPs) and sparse GPs. The proposed framework reaches a mean absolute $Delta z = 0.0026(1+z_textrm{s})$, over the redshift range of $0 le z_textrm{s} le 2$ on the simulated data, and $Delta z = 0.0178(1+z_textrm{s})$ over the entire redshift range on the SDSS DR12 survey, outperforming the standard ANNz used in the literature. We also investigate how the relative size of the training set affects the photometric redshift accuracy. We find that a training set of textgreater 30 per cent of total sample size, provides little additional constraint on the photometric redshifts, and note that our GP formalism strongly outperforms ANNz in the sparse data regime for the simulated data set.
This paper aims to put constraints on the transition redshift $z_t$, which determines the onset of cosmic acceleration, in cosmological-model independent frameworks. In order to do that, we use the non-parametric Gaussian Process method with $H(z)$ and SNe Ia data. The deceleration parameter reconstruction from $H(z)$ data yields $z_t=0.59^{+0.12}_{-0.11}$. The reconstruction from SNe Ia data assumes spatial flatness and yields $z_t=0.683^{+0.11}_{-0.082}$. These results were found with a Gaussian kernel and we show that they are consistent with two other kernel choices.
In the modern galaxy surveys photometric redshifts play a central role in a broad range of studies, from gravitational lensing and dark matter distribution to galaxy evolution. Using a dataset of about 25,000 galaxies from the second data release of the Kilo Degree Survey (KiDS) we obtain photometric redshifts with five different methods: (i) Random forest, (ii) Multi Layer Perceptron with Quasi Newton Algorithm, (iii) Multi Layer Perceptron with an optimization network based on the Levenberg-Marquardt learning rule, (iv) the Bayesian Photometric Redshift model (or BPZ) and (v) a classical SED template fitting procedure (Le Phare). We show how SED fitting techniques could provide useful information on the galaxy spectral type which can be used to improve the capability of machine learning methods constraining systematic errors and reduce the occurrence of catastrophic outliers. We use such classification to train specialized regression estimators, by demonstrating that such hybrid approach, involving SED fitting and machine learning in a single collaborative framework, is capable to improve the overall prediction accuracy of photometric redshifts.
We show that mid-infrared data from the all-sky WISE survey can be used as a robust photometric redshift indicator for powerful radio AGN, in the absence of other spectroscopic or multi-band photometric information. Our work is motivated by a desire to extend the well-known K-z relation for radio galaxies to the wavelength range covered by the all-sky WISE mid-infrared survey. Using the LARGESS radio spectroscopic sample as a training set, and the mid-infrared colour information to classify radio sources, we generate a set of redshift probability distributions for the hosts of high-excitation and low-excitation radio AGN. We test the method using spectroscopic data from several other radio AGN studies, and find good agreement between our WISE-based redshift estimates and published spectroscopic redshifts out to z ~ 1 for galaxies and z ~ 3-4 for radio-loud QSOs. Our chosen method is also compared against other classification methods and found to perform reliably. This technique is likely to be particularly useful in the analysis of upcoming large-area radio surveys with SKA pathfinder telescopes, and our code is publicly available. As a consistency check, we show that our WISE-based redshift estimates for sources in the 843 MHz SUMSS survey reproduce the redshift distribution seen in the CENSORS study up to z ~ 2. We also discuss two specific applications of our technique for current and upcoming radio surveys; an interpretation of large scale HI absorption surveys, and a determination of whether low-frequency peaked spectrum sources lie at high redshift.
Obtaining accurate photometric redshift estimations is an important aspect of cosmology, remaining a prerequisite of many analyses. In creating novel methods to produce redshift estimations, there has been a shift towards using machine learning techniques. However, there has not been as much of a focus on how well different machine learning methods scale or perform with the ever-increasing amounts of data being produced. Here, we introduce a benchmark designed to analyse the performance and scalability of different supervised machine learning methods for photometric redshift estimation. Making use of the Sloan Digital Sky Survey (SDSS - DR12) dataset, we analysed a variety of the most used machine learning algorithms. By scaling the number of galaxies used to train and test the algorithms up to one million, we obtained several metrics demonstrating the algorithms performance and scalability for this task. Furthermore, by introducing a new optimisation method, time-considered optimisation, we were able to demonstrate how a small concession of error can allow for a great improvement in efficiency. From the algorithms tested we found that the Random Forest performed best in terms of error with a mean squared error, MSE = 0.0042; however, as other algorithms such as Boosted Decision Trees and k-Nearest Neighbours performed incredibly similarly, we used our benchmarks to demonstrate how different algorithms could be superior in different scenarios. We believe benchmarks such as this will become even more vital with upcoming surveys, such as LSST, which will capture billions of galaxies requiring photometric redshifts.
The calibration of modern radio interferometers is a significant challenge, specifically at low frequencies. In this perspective, we propose a novel iterative calibration algorithm, which employs the popular sparse representation framework, in the regime where the propagation conditions shift dissimilarly the directions of the sources. More precisely, our algorithm is designed to estimate the apparent directions of the calibration sources, their powers, the directional and undirectional complex gains of the array elements and their noise powers, with a reasonable computational complexity. Numerical simulations reveal that the proposed scheme is statistically efficient at low SNR and even with additional non-calibration sources at unknown directions.