No Arabic abstract
Over the past decade, the study of extrasolar planets has evolved rapidly from plain detection and identification to comprehensive categorization and characterization of exoplanet systems and their atmospheres. Atmospheric retrieval, the inverse modeling technique used to determine an exoplanetary atmospheres temperature structure and composition from an observed spectrum, is both time-consuming and compute-intensive, requiring complex algorithms that compare thousands to millions of atmospheric models to the observational data to find the most probable values and associated uncertainties for each model parameter. For rocky, terrestrial planets, the retrieved atmospheric composition can give insight into the surface fluxes of gaseous species necessary to maintain the stability of that atmosphere, which may in turn provide insight into the geological and/or biological processes active on the planet. These atmospheres contain many molecules, some of them biosignatures, spectral fingerprints indicative of biological activity, which will become observable with the next generation of telescopes. Runtimes of traditional retrieval models scale with the number of model parameters, so as more molecular species are considered, runtimes can become prohibitively long. Recent advances in machine learning (ML) and computer vision offer new ways to reduce the time to perform a retrieval by orders of magnitude, given a sufficient data set to train with. Here we present an ML-based retrieval framework called Intelligent exoplaNet Atmospheric RetrievAl (INARA) that consists of a Bayesian deep learning model for retrieval and a data set of 3,000,000 synthetic rocky exoplanetary spectra generated using the NASA Planetary Spectrum Generator. Our work represents the first ML retrieval model for rocky, terrestrial exoplanets and the first synthetic data set of terrestrial spectra generated at this scale.
Machine learning is now used in many areas of astrophysics, from detecting exoplanets in Kepler transit signals to removing telescope systematics. Recent work demonstrated the potential of using machine learning algorithms for atmospheric retrieval by implementing a random forest to perform retrievals in seconds that are consistent with the traditional, computationally-expensive nested-sampling retrieval method. We expand upon their approach by presenting a new machine learning model, texttt{plan-net}, based on an ensemble of Bayesian neural networks that yields more accurate inferences than the random forest for the same data set of synthetic transmission spectra. We demonstrate that an ensemble provides greater accuracy and more robust uncertainties than a single model. In addition to being the first to use Bayesian neural networks for atmospheric retrieval, we also introduce a new loss function for Bayesian neural networks that learns correlations between the model outputs. Importantly, we show that designing machine learning models to explicitly incorporate domain-specific knowledge both improves performance and provides additional insight by inferring the covariance of the retrieved atmospheric parameters. We apply texttt{plan-net} to the Hubble Space Telescope Wide Field Camera 3 transmission spectrum for WASP-12b and retrieve an isothermal temperature and water abundance consistent with the literature. We highlight that our method is flexible and can be expanded to higher-resolution spectra and a larger number of atmospheric parameters.
Deep learning algorithms are growing in popularity in the field of exoplanetary science due to their ability to model highly non-linear relations and solve interesting problems in a data-driven manner. Several works have attempted to perform fast retrievals of atmospheric parameters with the use of machine learning algorithms like deep neural networks (DNNs). Yet, despite their high predictive power, DNNs are also infamous for being black boxes. It is their apparent lack of explainability that makes the astrophysics community reluctant to adopt them. What are their predictions based on? How confident should we be in them? When are they wrong and how wrong can they be? In this work, we present a number of general evaluation methodologies that can be applied to any trained model and answer questions like these. In particular, we train three different popular DNN architectures to retrieve atmospheric parameters from exoplanet spectra and show that all three achieve good predictive performance. We then present an extensive analysis of the predictions of DNNs, which can inform us - among other things - of the credibility limits for atmospheric parameters for a given instrument and model. Finally, we perform a perturbation-based sensitivity analysis to identify to which features of the spectrum the outcome of the retrieval is most sensitive. We conclude that for different molecules, the wavelength ranges to which the DNNs predictions are most sensitive, indeed coincide with their characteristic absorption regions. The methodologies presented in this work help to improve the evaluation of DNNs and to grant interpretability to their predictions.
One of the principal bottlenecks to atmosphere characterisation in the era of all-sky surveys is the availability of fast, autonomous and robust atmospheric retrieval methods. We present a new approach using unsupervised machine learning to generate informed priors for retrieval of exoplanetary atmosphere parameters from transmission spectra. We use principal component analysis (PCA) to efficiently compress the information content of a library of transmission spectra forward models generated using the PLATON package. We then apply a $k$-means clustering algorithm in PCA space to segregate the library into discrete classes. We show that our classifier is almost always able to instantaneously place a previously unseen spectrum into the correct class, for low-to-moderate spectral resolutions, $R$, in the range $R~=~30-300$ and noise levels up to $10$~per~cent of the peak-to-trough spectrum amplitude. The distribution of physical parameters for all members of the class therefore provides an informed prior for standard retrieval methods such as nested sampling. We benchmark our informed-prior approach against a standard uniform-prior nested sampler, finding that our approach is up to a factor two faster, with negligible reduction in accuracy. We demonstrate the application of this method to existing and near-future observatories, and show that it is suitable for real-world application. Our general approach is not specific to transmission spectroscopy and should be more widely applicable to cases that involve repetitive fitting of trusted high-dimensional models to large data catalogues, including beyond exoplanetary science.
We present an improved, hybrid CPU-GPU atmospheric retrieval code, Helios-r2, which is applicable to medium-resolution emission spectra of brown dwarfs, in preparation for precision atmospheric spectroscopy in the era of the James Webb Space Telescope. The model is available as open-source code on the Exoclimes Simulation Platform. We subject Helios-r2 to a battery of tests of varying difficulty. The simplest test involves a mock retrieval on a forward model generated using the same radiative transfer technique, the same implementation of opacities, and the same chemistry model. The least trivial test involves a mock retrieval on synthetic spectra from the Sonora model grid, which uses a different radiative transfer technique, a different implementation of opacities, and a different chemistry model. A calibration factor, which is included to capture uncertainties in the brown dwarf radius, distance to the brown dwarf and flux calibration of the spectrum, may compensate, sometimes erroneously, for discrepancies in modeling choices and implementation. We analyze spectra of the benchmark brown dwarf GJ 570 D and the binary brown dwarf companions in the Epsilon Indi system. The retrieved surface gravities are consistent with previous studies and/or values inferred from dynamical masses (for Epsilon Indi Ba and Bb only). There remains no clear criterion on how to reject unphysical values of the retrieved brown dwarf radii. The inferred radii and corresponding masses should be taken with great caution. The retrieved carbon-to-oxygen ratios and metallicity depend on whether chemical equilibrium is assumed.
This brief review focuses on methods and applications of modeling exoplanetary atmospheres. We discuss various kinds of state of the art self-consistent and retrieval models in 1D and multi-D with a focus on open questions and short- and long-term goals in the field. Expertise previously developed in modeling cool stellar atmospheres and in modeling solar system planetary atmospheres has proven valuable to the field, and will continue to do so. We described upcoming opportunities for making progress in our understanding of atmospheres, and close with what we see as the fields challenges.