ﻻ يوجد ملخص باللغة العربية
Double machine learning (DML) is becoming an increasingly popular tool for automated model selection in high-dimensional settings. At its core, DML assumes unconfoundedness, or exogeneity of all considered controls, which might likely be violated if the covariate space is large. In this paper, we lay out a theory of bad controls building on the graph-theoretic approach to causality. We then demonstrate, based on simulation studies and an application to real-world data, that DML is very sensitive to the inclusion of bad controls and exhibits considerable bias even with only a few endogenous variables present in the conditioning set. The extent of this bias depends on the precise nature of the assumed causal model, which calls into question the ability of selecting appropriate controls for regressions in a purely data-driven way.
Resolution studies of test problems set baselines and help define minimum resolution requirements, however, resolution studies must also be performed on scientific simulations to determine the effect of resolution on the specific scientific results.
We present three different methods to estimate error bars on the predictions made using a neural network. All of them represent lower bounds for the extrapolation errors. For example, we did not include an analysis on robustness against small perturb
Analysis of cluster and field star uvby data demonstrates the existence of a previously undetected discrepancy in a widely used photometric metallicity calibration for G dwarfs. The discrepancy is systematic and strongly color-dependent, reducing the
We propose a practical and robust method for making inferences on average treatment effects estimated by synthetic controls. We develop a $K$-fold cross-fitting procedure for bias-correction. To avoid the difficult estimation of the long-run variance
The attenuation of light in star forming galaxies is correlated with a multitude of physical parameters including star formation rate, metallicity and total dust content. This variation in attenuation is even more prevalent on the kiloparsec scale, w