No Arabic abstract
We develop a machine learning-based framework to predict the HI content of galaxies using more straightforwardly observable quantities such as optical photometry and environmental parameters. We train the algorithm on z=0-2 outputs from the Mufasa cosmological hydrodynamic simulation, which includes star formation, feedback, and a heuristic model to quench massive galaxies that yields a reasonable match to a range of survey data including HI. We employ a variety of machine learning methods (regressors), and quantify their performance using the root mean square error ({sc rmse}) and the Pearson correlation coefficient (r). Considering SDSS photometry, 3$^{rd}$ nearest neighbor environment and line of sight peculiar velocities as features, we obtain r $> 0.8$ accuracy of the HI-richness prediction, corresponding to {sc rmse}$<0.3$. Adding near-IR photometry to the features yields some improvement to the prediction. Compared to all the regressors, random forest shows the best performance, with r $>0.9$ at $z=0$, followed by a Deep Neural Network with r $>0.85$. All regressors exhibit a declining performance with increasing redshift, which limits the utility of this approach to $zla 1$, and they tend to somewhat over-predict the HI content of low-HI galaxies which might be due to Eddington bias in the training sample. We test our approach on the RESOLVE survey data. Training on a subset of RESOLVE data, we find that our machine learning method can reasonably well predict the HI-richness of the remaining RESOLVE data, with {sc rmse}$sim0.28$. When we train on mock data from Mufasa and test on RESOLVE, this increases to {sc rmse}$sim0.45$. Our method will be useful for making galaxy-by-galaxy survey predictions and incompleteness corrections for upcoming HI 21cm surveys such as the LADUMA and MIGHTEE surveys on MeerKAT, over regions where photometry is already available.
We use observations made with the Giant Metrewave Radio Telescope (GMRT) to probe the neutral hydrogen (HI) gas content of field galaxies in the VIMOS VLT Deep Survey (VVDS) 14h field at $z approx 0.32$. Because the HI emission from individual galaxies is too faint to detect at this redshift, we use an HI spectral stacking technique using the known optical positions and redshifts of the 165 galaxies in our sample to co-add their HI spectra and thus obtain the average HI mass of the galaxies. Stacked HI measurements of 165 galaxies show that 95 per cent of the neutral gas is found in blue, star-forming galaxies. Among these galaxies, those having lower stellar mass are more gas-rich than more massive ones. We apply a volume correction to our HI measurement to evaluate the HI gas density at $z approx 0.32$ as $Omega_{HI}=(0.50pm0.18) times 10^{-3}$ in units of the cosmic critical density. This value is in good agreement with previous results at z < 0.4, suggesting no evolution in the neutral hydrogen gas density over the last $sim 4$ Gyr. However the $z approx 0.32$ gas density is lower than that at $z sim 5$ by at least a factor of two.
We examine the global HI properties of galaxies in quarter-billion particle cosmological simulations using Gadget-2, focusing on how galactic outflows impact HI content. We consider four outflow models, including a new one (ezw) motivated by recent interstellar medium simulations in which the wind speed and mass loading factor scale as expected for momentum-driven outflows for larger galaxies and energy-driven outflows for dwarfs (sigma<75 km/s). To obtain predicted HI masses, we employ a simple but effective local correction for particle self-shielding, and an observationally-constrained transition from neutral to molecular hydrogen. Our ezw simulation produces an HI mass function whose faint-end slope of -1.3 agrees well with observations from the ALFALFA survey; other models agree less well. Satellite galaxies have a bimodal distribution in HI fraction versus halo mass, with smaller satellites and/or those in larger halos more often being HI-deficient. At a given stellar mass, HI content correlates with star formation rate and inversely correlates with metallicity, as expected if driven by stochasticity in the accretion rate. To higher redshifts, massive HI galaxies disappear and the mass function steepens. The global cosmic HI density conspires to remain fairly constant from z~5-0, but the relative contribution from smaller galaxies increases with redshift.
Measurements of the neutral hydrogen gas content of a sample of 93 post-merger galaxies are presented, from a combination of matches to the ALFALFA.40 data release and new Arecibo observations. By imposing completeness thresholds identical to that of the ALFALFA survey, and by compiling a mass-, redshift- and environment-matched control sample from the public ALFALFA.40 data release, we calculate gas fraction offsets (Delta f_gas) for the post-mergers, relative to the control sample. We find that the post-mergers have HI gas fractions that are consistent with undisturbed galaxies. However, due to the relative gas richness of the ALFALFA.40 sample, from which we draw our control sample, our measurements of gas fraction enhancements are likely to be conservative lower limits. Combined with comparable gas fraction measurements by Fertig et al. in a sample of galaxy pairs, who also determine gas fraction offsets consistent with zero, we conclude that there is no evidence for significant neutral gas consumption throughout the merger sequence. From a suite of 75 binary merger simulations we confirm that star formation is expected to decrease the post-merger gas fraction by only 0.06 dex, even several Gyr after the merger. Moreover, in addition to the lack of evidence for gas consumption from gas fraction offsets, the observed HI detection fraction in the complete sample of post-mergers is twice as high as the controls, which suggests that the post-merger gas fractions may actually be enhanced. We demonstrate that a gas fraction enhancement in post-mergers, relative to a stellar mass-matched control sample, would indeed be the natural result of merging randomly drawn pairs from a parent population which exhibits a declining gas fraction with increasing stellar mass.
Dust plays an important role in shaping a galaxys spectral energy distribution (SED). It absorbs ultraviolet (UV) to near-infrared (NIR) radiation and re-emits this energy in the far-infrared (FIR). The FIR is essential to understand dust in galaxies. However, deep FIR observations require a space mission, none of which are still active today. We aim to infer the FIR emission across six Herschel bands, along with dust luminosity, mass, and effective temperature, based on the available UV to mid-infrared (MIR) observations. We also want to estimate the uncertainties of these predictions, compare our method to energy balance SED fitting, and determine possible limitations of the model. We propose a machine learning framework to predict the FIR fluxes from 14 UV-MIR broadband fluxes. We used a low redshift sample by combining DustPedia and H-ATLAS, and extracted Bayesian flux posteriors through SED fitting. We trained shallow neural networks to predict the far-infrared fluxes, uncertainties, and dust properties. We evaluated them on a test set using a root mean square error (RMSE) in log-space. Our results (RMSE = 0.19 dex) significantly outperform UV-MIR energy balance SED fitting (RMSE = 0.38 dex), and are inherently unbiased. We can identify when the predictions are off, for example when the input has large uncertainties on WISE 22, or when the input does not resemble the training set. The galaxies for which we have UV-FIR observations can be used as a blueprint for galaxies that lack FIR data. This results in a virtual FIR telescope, which can be applied to large optical-MIR galaxy samples. This helps bridge the gap until the next FIR mission.
Understanding the star-formation properties of galaxies as a function of cosmic epoch is a critical exercise in studies of galaxy evolution. Traditionally, stellar population synthesis models have been used to obtain best fit parameters that characterise star formation in galaxies. As multiband flux measurements become available for thousands of galaxies, an alternative approach to characterising star formation using machine learning becomes feasible. In this work, we present the use of deep learning techniques to predict three important star formation properties -- stellar mass, star formation rate and dust luminosity. We characterise the performance of our deep learning models through comparisons with outputs from a standard stellar population synthesis code.