Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

98

ACL
2007

favoriteEmaildiscussreport

140views Computational Linguistics» more ACL 2007»

Sparse Information Extraction: Unsupervised Language Models to the Rescue

15 years 3 months ago

Sparse Information Extraction: Unsupervised Language Models to the Rescue

Download turing.cs.washington.edu

Even in a massive corpus such as the Web, a substantial fraction of extractions appear infrequently. This paper shows how to assess the correctness of sparse extractions by utilizing unsupervised language models. The REALM system, which combines HMMbased and n-gram-based language models, ranks candidate extractions by the likelihood that they are correct. Our experiments show that REALM reduces extraction error by 39%, on average, when compared with previous work. Because REALM pre-computes language models based on its corpus and does not require any hand-tagged seeds, it is far more scalable than approaches that learn models for each individual relation from handtagged data. Thus, REALM is ideally suited for open information extraction where the relations of interest are not speciﬁed in advance and their number is potentially vast.

Doug Downey, Stefan Schoenmackers, Oren Etzioni

Real-time Traffic

ACL 2007 | Computational Linguistics | Language Models | REALM | Unsupervised Language Models |

claim paper

Related Content

» Topic Segmentation A First Stage to DialogBased Information Extraction

» Improved Extraction Assessment through Better Language Models

» An iterative unsupervised learning method for information distillation

» Bootstrapping Information Extraction from Field Books

» Relational duality unsupervised extraction of semantic relations between entities on the w...

» An Unsupervised AspectSentiment Model for Online Reviews

» Unsupervised Topic Adaptation for Lecture Speech Retrieval

» Incremental Information Extraction Using TreeBased Context Representations

» Detecting Inflection Patterns in Natural Language by Minimization of Morphological Model

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	ACL
Authors	Doug Downey, Stefan Schoenmackers, Oren Etzioni

Comments (0)