Hybrid in-database inference for declarative information extraction

14 years 9 months ago

Download db.cs.berkeley.edu

In the database community, work on information extraction (IE) has centered on two themes: how to effectively manage IE tasks, and how to manage the uncertainties that arise in the IE process in a scalable manner. Recent work has proposed a probabilistic database (PDB) based declarative IE system that supports a leading statistical IE model, and an associated inference algorithm to answer top-k-style queries over the probabilistic IE outcome. Still, the broader problem of effectively supporting general probabilistic inference inside a PDB-based declarative IE system remains open. In this paper, we explore the in-database implementations of a wide variety of inference algorithms suited to IE, including two Markov chain Monte Carlo algorithms, Viterbi and sum-product algorithms. We describe the rules for choosing appropriate inference algorithms based on the model, the query and the text, considering the trade-off between accuracy and runtime. Based on these rules, we describe a hybrid ...

Daisy Zhe Wang, Michael J. Franklin, Minos N. Garo

Real-time Traffic

Database | Markov Chain Monte | Markov Chain Monte Carlo | Monte Carlo Algorithms | SIGMOD 2011 |

claim paper

» Declarative analysis of noisy information networks

» Hybrid body representation for integrated pose recognition localization and segmentation

» Unique Renaming of Java Using Source Transformation

» Combining Declarative and Procedural Knowledge to Automate and Represent Ontology Mapping

» Tuffy Scaling up Statistical Inference in Markov Logic Networks using an RDBMS

» Hybrid Textons Modeling Surfaces with Reflectance and Geometry

» Integrated Surface Curve and Junction Inference from Sparse 3D Data Sets

» Discovery of Frequent Distributed Event Patterns in Sensor Networks

Post Info
More Details (n/a)

Added	17 Sep 2011
Updated	17 Sep 2011
Type	Journal
Year	2011
Where	SIGMOD
Authors	Daisy Zhe Wang, Michael J. Franklin, Minos N. Garofalakis, Joseph M. Hellerstein, Michael L. Wick

Comments (0)

Sciweavers

Hybrid in-database inference for declarative information extraction

Database | Markov Chain Monte | Markov Chain Monte Carlo | Monte Carlo Algorithms | SIGMOD 2011 |

Explore & Download

Productivity Tools

Sciweavers