No Arabic abstract
We present an innovative and widely applicable approach for the detection and classification of stellar clusters, developed for the PHANGS-HST Treasury Program, an $NUV$-to-$I$ band imaging campaign of 38 spiral galaxies. Our pipeline first generates a unified master source list for stars and candidate clusters, to enable a self-consistent inventory of all star formation products. To distinguish cluster candidates from stars, we introduce the Multiple Concentration Index (MCI) parameter, and measure inner and outer MCIs to probe morphology in more detail than with a single, standard concentration index (CI). We improve upon cluster candidate selection, jointly basing our criteria on expectations for MCI derived from synthetic cluster populations and published cluster catalogues, yielding model and empirical selection regions (respectively). Selection purity (confirmed clusters versus candidates, assessed via human-based classification) is high (up to 70%) for moderately luminous sources in the empirical selection region, and somewhat lower overall (outside the region or fainter). The number of candidates rises steeply with decreasing luminosity, but pipeline-integrated Machine Learning (ML) classification prevents this from being problematic. We quantify the performance of our PHANGS-HST methods in comparison to LEGUS for a sample of four galaxies in common to both surveys, finding overall agreement with 50-75% of human verified star clusters appearing in both catalogues, but also subtle differences attributable to specific choices adopted by each project. The PHANGS-HST ML-classified Class 1 or 2 catalogues reach $sim1$ magnitude fainter, $sim2times$ lower stellar mass, and are $2{-}5times$ larger in number, than attained in the human classified samples.
The sensitivity and angular resolution of photometric surveys executed by the Hubble Space Telescope (HST) enable studies of individual star clusters in galaxies out to a few tens of megaparsecs. The fitting of spectral energy distributions (SEDs) of star clusters is essential for measuring their physical properties and studying their evolution. We report on the use of the publicly available Code Investigating GALaxy Emission (CIGALE) SED fitting package to derive ages, stellar masses, and reddenings for star clusters identified in the Physics at High Angular resolution in Nearby GalaxieS-HST (PHANGS-HST) survey. Using samples of star clusters in the galaxy NGC 3351, we present results of benchmark analyses performed to validate the code and a comparison to SED fitting results from the Legacy ExtraGalactic Ultraviolet Survey (LEGUS). We consider procedures for the PHANGS-HST SED fitting pipeline, e.g., the choice of single stellar population models, the treatment of nebular emission and dust, and the use of fluxes versus magnitudes for the SED fitting. We report on the properties of clusters in NGC 3351 and find, on average, the clusters residing in the inner star-forming ring of NGC 3351 are young ($< 10$ Myr) and massive ($10^{5} M_{odot}$) while clusters in the stellar bulge are significantly older. Cluster mass function fits yield $beta$ values around -2, consistent with prior results with a tendency to be shallower at the youngest ages. Finally, we explore a Bayesian analysis with additional physically-motivated priors for the distribution of ages and masses and analyze the resulting cluster distributions.
We present the results of a proof-of-concept experiment which demonstrates that deep learning can successfully be used for production-scale classification of compact star clusters detected in HST UV-optical imaging of nearby spiral galaxies (D < 20 Mpc) in the PHANGS-HST survey. Given the relatively small nature of existing, human-labelled star cluster samples, we transfer the knowledge of state-of-the-art neural network models for real-object recognition to classify star clusters candidates into four morphological classes. We perform a series of experiments to determine the dependence of classification performance on: neural network architecture (ResNet18 and VGG19-BN); training data sets curated by either a single expert or three astronomers; and the size of the images used for training. We find that the overall classification accuracies are not significantly affected by these choices. The networks are used to classify star cluster candidates in the PHANGS-HST galaxy NGC 1559, which was not included in the training samples. The resulting prediction accuracies are 70%, 40%, 40-50%, 50-70% for class 1, 2, 3 star clusters, and class 4 non-clusters respectively. This performance is competitive with consistency achieved in previously published human and automated quantitative classification of star cluster candidate samples (70-80%, 40-50%, 40-50%, and 60-70%). The methods introduced herein lay the foundations to automate classification for star clusters at scale, and exhibit the need to prepare a standardized dataset of human-labelled star cluster classifications, agreed upon by a full range of experts in the field, to further improve the performance of the networks introduced in this study.
We present PHANGS-ALMA, the first survey to map CO J=2-1 line emission at ~1 ~ 100pc spatial resolution from a representative sample of 90 nearby (d<~20 Mpc) galaxies that lie on or near the z=0 main sequence of star-forming galaxies. CO line emission traces the bulk distribution of molecular gas, which is the cold, star-forming phase of the interstellar medium. At the resolution achieved by PHANGS-ALMA, each beam reaches the size of a typical individual giant molecular cloud (GMC), so that these data can be used to measure the demographics, life-cycle, and physical state of molecular clouds across the population of galaxies where the majority of stars form at z=0. This paper describes the scientific motivation and background for the survey, sample selection, global properties of the targets, ALMA observations, and characteristics of the delivered ALMA data and derived data products. As the ALMA sample serves as the parent sample for parallel surveys with VLT/MUSE, HST, AstroSat, VLA, and other facilities, we include a detailed discussion of the sample selection. We detail the estimation of galaxy mass, size, star formation rate, CO luminosity, and other properties, compare estimates using different systems and provide best-estimate integrated measurements for each target. We also report the design and execution of the ALMA observations, which combine a Cycle~5 Large Program, a series of smaller programs, and archival observations. Finally, we present the first 1 resolution atlas of CO emission from nearby galaxies and describe the properties and contents of the first PHANGS-ALMA public data release.
When completed, the PHANGS-HST project will provide a census of roughly 50,000 compact star clusters and associations, as well as human morphological classifications for roughly 20,000 of those objects. These large numbers motivated the development of a more objective and repeatable method to help perform source classifications. In this paper we consider the results for five PHANGS-HST galaxies (NGC 628, NGC 1433, NGC 1566, NGC 3351, NGC 3627) using classifications from two convolutional neural network architectures (RESNET and VGG) trained using deep transfer learning techniques. The results are compared to classifications performed by humans. The primary result is that the neural network classifications are comparable in quality to the human classifications with typical agreement around 70 to 80$%$ for Class 1 clusters (symmetric, centrally concentrated) and 40 to 70$%$ for Class 2 clusters (asymmetric, centrally concentrated). If Class 1 and 2 are considered together the agreement is 82 $pm$ 3$%$. Dependencies on magnitudes, crowding, and background surface brightness are examined. A detailed description of the criteria and methodology used for the human classifications is included along with an examination of systematic differences between PHANGS-HST and LEGUS. The distribution of data points in a colour-colour diagram is used as a figure of merit to further test the relative performances of the different methods. The effects on science results (e.g., determinations of mass and age functions) of using different cluster classification methods are examined and found to be minimal.
The PHANGS program is building the first dataset to enable the multi-phase, multi-scale study of star formation across the nearby spiral galaxy population. This effort is enabled by large Treasury programs with ALMA, VLT/MUSE, and HST, with which we have obtained CO(2-1) imaging, optical spectroscopic mapping, and high resolution UV-optical imaging, respectively. Here, we present PHANGS-HST, which is obtaining five band NUV-U-B-V-I imaging of the disks of 38 spiral galaxies at distances of 4-23 Mpc, and parallel V and I band imaging of their halos, to provide a census of tens of thousands of compact star clusters and associations. The combination of HST, ALMA, and VLT/MUSE observations will yield an unprecedented joint catalog of the observed and physical properties of $sim$100,000 star clusters, associations, HII regions, and molecular clouds. With these basic units of star formation, PHANGS will systematically chart the evolutionary cycling between gas and stars, across a diversity of galactic environments found in nearby galaxies. We discuss the design of the PHANGS-HST survey, and provide an overview of the HST data processing pipeline and first results, highlighting new methods for selecting star cluster candidates, morphological classification of candidates with convolutional neural networks, and identification of stellar associations over a range of physical scales with a watershed algorithm. We describe the cross-observatory imaging, catalogs, and software products to be released, which will seed a broad range of community science, in particular, upcoming JWST study of dust embedded star formation and ISM physics.