Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Counterexamples to The Blessings of Multiple Causes by Wang and Blei

63 0 0.0 ( 0 )

Download Cite

Added by Elizabeth Ogburn

Publication date 2020

fields Mathematical Statistics

and research's language is English

Authors Elizabeth L. Ogburn - Ilya Shpitser - Eric J. Tchetgen Tchetgen

Methodology Machine Learning

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This note has been updated (April, 2020) to respond to Towards Clarifying the Theory of the Deconfounder by Yixin Wang, David M. Blei (arXiv:2003.04948). This original note, posted in January, 2020, is meant to complement our previous comment on The Blessings of Multiple Causes by Wang and Blei (2019). We provide a more succinct and transparent explanation of the fact that the deconfounder does not control for multi-cause confounding. The argument given in Wang and Blei (2019) makes two mistakes: (1) attempting to infer independence conditional on one variable from independence conditional on a different, unrelated variable, and (2) attempting to infer joint independence from pairwise independence. We give two simple counterexamples to the deconfounder claim.

rate research

Discussion of The Blessings of Multiple Causes by Wang and Blei

68 - Kosuke Imai , Zhichao Jiang 2019

This commentary has two goals. We first critically review the deconfounder method and point out its advantages and limitations. We then briefly consider three possible ways to address some of the limitations of the deconfounder method.

Methodology Machine Learning

Comment on Blessings of Multiple Causes

74 - Elizabeth L. Ogburn , Ilya Shpitser , 2019

(This comment has been updated to respond to Wang and Bleis rejoinder [arXiv:1910.07320].) The premise of the deconfounder method proposed in Blessings of Multiple Causes by Wang and Blei [arXiv:1805.06826], namely that a variable that renders multiple causes conditionally independent also controls for unmeasured multi-cause confounding, is incorrect. This can be seen by noting that no fact about the observed data alone can be informative about ignorability, since ignorability is compatible with any observed data distribution. Methods to control for unmeasured confounding may be valid with additional assumptions in specific settings, but they cannot, in general, provide a checkable approach to causal inference, and they do not, in general, require weaker assumptions than the assumptions that are commonly used for causal inference. While this is outside the scope of this comment, we note that much recent work on applying ideas from latent variable modeling to causal inference problems suffers from similar issues.

Methodology Machine Learning

Identifiability of causal effects with multiple causes and a binary outcome

213 - Dehan Kong , Shu Yang , Linbo Wang 2019

Unobserved confounding presents a major threat to causal inference from observational studies. Recently, several authors suggest that this problem may be overcome in a shared confounding setting where multiple treatments are independent given a common latent confounder. It has been shown that under a linear Gaussian model for the treatments, the causal effect is not identifiable without parametric assumptions on the outcome model. In this paper, we show that the causal effect is indeed identifiable if we assume a general binary choice model for the outcome with a non-probit link. Our identification approach is based on the incongruence between Gaussianity of the treatments and latent confounder, and non-Gaussianity of a latent outcome variable. We further develop a two-step likelihood-based estimation procedure.

Methodology

Necessary and sufficient conditions for causal feature selection in time series with latent common causes

72 - Atalanti A. Mastakouri , Bernhard Scholkopf , Dominik Janzing 2020

We study the identification of direct and indirect causes on time series and provide conditions in the presence of latent variables, which we prove to be necessary and sufficient under some graph constraints. Our theoretical results and estimation algorithms require two conditional independence tests for each observed candidate time series to determine whether or not it is a cause of an observed target time series. We provide experimental results in simulations, as well as real data. Our results show that our method leads to very low false positives and relatively low false negative rates, outperforming the widely used Granger causality.

Methodology Machine Learning

Are Clusterings of Multiple Data Views Independent?

57 - Lucy L. Gao , Jacob Bien , Daniela Witten 2019

In the Pioneer 100 (P100) Wellness Project (Price and others, 2017), multiple types of data are collected on a single set of healthy participants at multiple timepoints in order to characterize and optimize wellness. One way to do this is to identify clusters, or subgroups, among the participants, and then to tailor personalized health recommendations to each subgroup. It is tempting to cluster the participants using all of the data types and timepoints, in order to fully exploit the available information. However, clustering the participants based on multiple data views implicitly assumes that a single underlying clustering of the participants is shared across all data views. If this assumption does not hold, then clustering the participants using multiple data views may lead to spurious results. In this paper, we seek to evaluate the assumption that there is some underlying relationship among the clusterings from the different data views, by asking the question: are the clusters within each data view dependent or independent? We develop a new test for answering this question, which we then apply to clinical, proteomic, and metabolomic data, across two distinct timepoints, from the P100 study. We find that while the subgroups of the participants defined with respect to any single data type seem to be dependent across time, the clustering among the participants based on one data type (e.g. proteomic data) appears not to be associated with the clustering based on another data type (e.g. clinical data).

Methodology Machine Learning

comments

Fetching comments

Helwan

Additional details More universities

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Counterexamples to The Blessings of Multiple Causes by Wang and Blei

Ask ChatGPT about the research

No Arabic abstract

Read More