Large-Sample Learning of Bayesian Networks is NP-Hard

15 years 8 months ago

Download jmlr.csail.mit.edu

In this paper, we provide new complexity results for algorithms that learn discrete-variable Bayesian networks from data. Our results apply whenever the learning algorithm uses a scoring criterion that favors the simplest structure for which the model is able to represent the generative distribution exactly. Our results therefore hold whenever the learning algorithm uses a consistent scoring criterion and is applied to a sufﬁciently large dataset. We show that identifying high-scoring structures is NPhard, even when any combination of one or more of the following hold: the generative distribution is perfect with respect to some DAG containing hidden variables; we are given an independence oracle; we are given an inference oracle; we are given an information oracle; we restrict potential solutions to structures in which each node has at most k parents, for all k ≥ 3. Our proof relies on a new technical result that we establish in the appendices. In particular, we provide a method f...

David Maxwell Chickering, Christopher Meek, David

Real-time Traffic

Generative Distribution | Learning Algorithm | Scoring Criterion | UAI 2003 | UAI 2008 |

claim paper

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2003
Where	UAI
Authors	David Maxwell Chickering, Christopher Meek, David Heckerman

Sciweavers

Large-Sample Learning of Bayesian Networks is NP-Hard

Generative Distribution | Learning Algorithm | Scoring Criterion | UAI 2003 | UAI 2008 |

Explore & Download

Productivity Tools

Sciweavers