No Arabic abstract
The Kepler DR25 planet candidate catalog was produced using an automated method of planet candidate identification based on various tests. These tests were tuned to obtain a reasonable but arbitrary balance between catalog completeness and reliability. We produce new catalogs with differing balances of completeness and reliability by varying these tests, and study the impact of these alternative catalogs on occurrence rates. We find that if there is no correction for reliability, different catalogs give statistically inconsistent occurrence rates, while if we correct for both completeness and reliability, we get statistically consistent occurrence rates. This is a strong indication that correction for completeness and reliability is critical for the accurate computation of occurrence rates. Additionally, we find that this result is the same whether using Bayesian Poisson likelihood MCMC or Approximate Bayesian Computation methods. We also examine the use of a Robovetter disposition score cut as an alternative to reliability correction, and find that while a score cut does increase the reliability of the catalog, it is not as accurate as performing a full reliability correction. We get the same result when performing a reliability correction with and without a score cut. Therefore removing low-score planets removes data without providing any advantage, and should be avoided when possible. We make our alternative catalogs publicly available, and propose that these should be used as a test of occurrence rate methods, with the requirement that a method should provide statistically consistent occurrence rates for all these catalogs.
Exoplanet catalogs produced by surveys suffer from a lack of completeness (not every planet is detected) and less than perfect reliability (not every planet in the catalog is a true planet), particularly near the surveys detection limit. Exoplanet occurrence rate studies based on such a catalog must be corrected for completeness and reliability. The final Kepler data release, DR25, features a uniformly vetted planet candidate catalog and data products that facilitate corrections. We present a new probabilistic approach to the characterization of Kepler completeness and reliability, making full use of the Kepler DR25 products. We illustrate the impact of completeness and reliability corrections with a Poisson-likelihood occurrence rate method, using a recent stellar properties catalog that incorporates Gaia stellar radii and essentially uniform treatment of the stellar population. Correcting for reliability has a significant impact: the exoplanet occurrence rate for orbital period and radius within 20% of Earths around GK dwarf stars, corrected for reliability, is 0.015+0.011-0.007, whereas not correcting results in 0.034+0.018-0.012 - correcting for reliability reduces this occurrence rate by more than a factor of two. We further show that using Gaia-based vs. DR25 stellar properties impacts the same occurrence rate by a factor of two. We critically examine the the DR25 catalog and the assumptions behind our occurrence rate method. We propose several ways in which confidence in both the Kepler catalog and occurrence rate calculations can be improved. This work provides an example of how the community can use the DR25 completeness and reliability products.
We infer the number of planets-per-star as a function of orbital period and planet size using $Kepler$ archival data products with updated stellar properties from the $Gaia$ Data Release 2. Using hierarchical Bayesian modeling and Hamiltonian Monte Carlo, we incorporate planet radius uncertainties into an inhomogeneous Poisson point process model. We demonstrate that this model captures the general features of the outcome of the planet formation and evolution around GK stars, and provides an infrastructure to use the $Kepler$ results to constrain analytic planet distribution models. We report an increased mean and variance in the marginal posterior distributions for the number of planets per $GK$ star when including planet radius measurement uncertainties. We estimate the number of planets-per-$GK$ star between 0.75 and 2.5 $R_{oplus}$ and 50 to 300 day orbital periods to have a $68%$ credible interval of $0.49$ to $0.77$ and a posterior mean of $0.63$. This posterior has a smaller mean and a larger variance than the occurrence rate calculated in this work and in Burke et al. (2015) for the same parameter space using the $Q1-Q16$ (previous $Kepler$ planet candidate and stellar catalog). We attribute the smaller mean to many of the instrumental false positives at longer orbital periods being removed from the $DR25$ catalog. We find that the accuracy and precision of our hierarchical Bayesian model posterior distributions are less sensitive to the total number of planets in the sample, and more so on the characteristics of the catalog completeness and reliability and the span of the planet parameter space.
We present a new framework to characterize the occurrence rates of planet candidates identified by Kepler based on hierarchical Bayesian modeling, Approximate Bayesian Computing (ABC), and sequential importance sampling. For this study we adopt a simple 2-D grid in planet radius and orbital period as our model and apply our algorithm to estimate occurrence rates for Q1-Q16 planet candidates orbiting around solar-type stars. We arrive at significantly increased planet occurrence rates for small planet candidates ($R_p<1.25 R_{oplus}$) at larger orbital periods ($P>80$d) compared to the rates estimated by the more common inverse detection efficiency method. Our improved methodology estimates that the occurrence rate density of small planet candidates in the habitable zone of solar-type stars is $1.6^{+1.2}_{-0.5}$ per factor of 2 in planet radius and orbital period. Additionally, we observe a local minimum in the occurrence rate for strong planet candidates marginalized over orbital period between 1.5 and 2$R_{oplus}$ that is consistent with previous studies. For future improvements, the forward modeling approach of ABC is ideally suited to incorporating multiple populations, such as planets, astrophysical false positives and pipeline false alarms, to provide accurate planet occurrence rates and uncertainties. Furthermore, ABC provides a practical statistical framework for answering complex questions (e.g., frequency of different planetary architectures) and providing sound uncertainties, even in the face of complex selection effects, observational biases, and follow-up strategies. In summary, ABC offers a powerful tool for accurately characterizing a wide variety of astrophysical populations.
Correlations between the occurrence rate of exoplanets and their host star properties provide important clues about the planet formation processes. We studied the dependence of the observed properties of exoplanets (radius, mass, and orbital period) as a function of their host star metallicity. We analyzed the planetary radii and orbital periods of over 2800 $Kepler$ candidates from the latest $Kepler$ data release DR25 (Q1-Q17) with revised planetary radii based on $Gaia$~DR2 as a function of host star metallicity (from the Q1-Q17 (DR25) stellar and planet catalog). With a much larger sample and improved radius measurements, we are able to reconfirm previous results in the literature. We show that the average metallicity of the host star increases as the radius of the planet increases. We demonstrate this by first calculating the average host star metallicity for different radius bins and then supplementing these results by calculating the occurrence rate as a function of planetary radius and host star metallicity. We find a similar trend between host star metallicity and planet mass: the average host star metallicity increases with increasing planet mass. This trend, however, reverses for masses $> 4.0, M_mathrm{J}$: host star metallicity drops with increasing planetary mass. We further examined the correlation between the host star metallicity and the orbital period of the planet. We find that for planets with orbital periods less than 10 days, the average metallicity of the host star is higher than that for planets with periods greater than 10 days.
This paper aims to derive a map of relative planet occurrence rates that can provide constraints on the overall distribution of terrestrial planets around FGK stars. Based on the planet candidates in the Kepler DR25 data release, I first generate a continuous density map of planet distribution using a Gaussian kernel model and correct the geometric factor that the discovery space of a transit event decreases along with the increase of planetary orbital distance. Then I fit two exponential decay functions of detection efficiency along with the increase of planetary orbital distance and the decrease of planetary radius. Finally, the density map of planet distribution is compensated for the fitted exponential decay functions of detection efficiency to obtain a relative occurrence rate distribution of terrestrial planets. The result shows two regions with planet abundance: one corresponds to planets with radii between 0.5 and 1.5 R_Earth within 0.2 AU, the other corresponds to planets with radii between 1.5 and 3 R_Earth beyond 0.5 AU. It also confirms the features that may be caused by atmospheric evaporation: there is a vacancy of planets of sizes between 2.0 and 4.0 R_Earth inside of ~ 0.5 AU, and a valley with relatively low occurrence rates between 0.2 and 0.5 AU for planets with radii between 1.5 and 3.0 R_Earth.