Sciweavers

ICANN
2010
Springer

Discovery of Exogenous Variables in Data with More Variables Than Observations

14 years 1 months ago
Discovery of Exogenous Variables in Data with More Variables Than Observations
Many statistical methods have been proposed to estimate causal models in classical situations with fewer variables than observations. However, modern datasets including gene expression data increase the needs of high-dimensional causal modeling in challenging situations with orders of magnitude more variables than observations. In this paper, we propose a method to find exogenous variables in a linear nonGaussian causal model, which requires much smaller sample sizes than conventional methods and works even when orders of magnitude more variables than observations. Exogenous variables work as triggers that activate causal chains in the model, and their identification leads to more efficient experimental designs and better understanding of the causal mechanism. We present experiments with artificial data and real-world gene expression data to evaluate the method. Key words: Bayesian networks, independent component analysis, nonGaussianity, data with more variables than observations
Yasuhiro Sogawa, Shohei Shimizu, Aapo Hyvärin
Added 09 Nov 2010
Updated 09 Nov 2010
Type Conference
Year 2010
Where ICANN
Authors Yasuhiro Sogawa, Shohei Shimizu, Aapo Hyvärinen, Takashi Washio, Teppei Shimamura, Seiya Imoto
Comments (0)