Scalable Probabilistic Databases with Factor Graphs and MCMC

14 years 3 months ago

Download www.cs.umass.edu

Incorporating probabilities into the semantics of incomplete databases has posed many challenges, forcing systems to sacrifice modeling power, scalability, or treatment of relational algebra operators. We propose an alternative approach where the underlying relational database always represents a single world, and an external factor graph encodes a distribution over possible worlds; Markov chain Monte Carlo (MCMC) inference is then used to recover this uncertainty to a desired level of fidelity. Our approach allows the efficient evaluation of arbitrary queries over probabilistic databases with arbitrary dependencies expressed by graphical models with structure that changes during inference. MCMC sampling provides efficiency by hypothesizing modifications to possible worlds rather than generating entire worlds from scratch. Queries are then run over the portions of the world that change, avoiding the onerous cost of running full queries over each sampled world. A significant innovation...

Michael L. Wick, Andrew McCallum, Gerome Miklau

Real-time Traffic

CORR 2010 | Database | Education | Possible Worlds | View Maintenance Techniques |

claim paper

Post Info
More Details (n/a)

Added	09 Dec 2010
Updated	09 Dec 2010
Type	Journal
Year	2010
Where	CORR
Authors	Michael L. Wick, Andrew McCallum, Gerome Miklau

Comments (0)

Sciweavers

Scalable Probabilistic Databases with Factor Graphs and MCMC

CORR 2010 | Database | Education | Possible Worlds | View Maintenance Techniques |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers