Sciweavers

1002 search results - page 14 / 201
» Unsupervised Relation Extraction From Web Documents
Sort
View
COLING
2010
13 years 2 months ago
EM-based Hybrid Model for Bilingual Terminology Extraction from Comparable Corpora
In this paper, we present an unsupervised hybrid model which combines statistical, lexical, linguistic, contextual, and temporal features in a generic EMbased framework to harvest...
Lianhau Lee, AiTi Aw, Min Zhang, Haizhou Li
EMNLP
2004
13 years 9 months ago
Scaling Web-based Acquisition of Entailment Relations
Paraphrase recognition is a critical step for natural language interpretation. Accordingly, many NLP applications would benefit from high coverage knowledge bases of paraphrases. ...
Idan Szpektor, Hristo Tanev, Ido Dagan, Bonaventur...
ICASSP
2008
IEEE
14 years 2 months ago
An iterative unsupervised learning method for information distillation
Information distillation techniques are used to analyze and interpret large volumes of speech and text archives in multiple languages and produce structured information of interes...
Kamand Kamangar, Dilek Hakkani-Tür, Gökh...
WWW
2006
ACM
14 years 8 months ago
Robust web content extraction
We present an empirical evaluation and comparison of two content extraction methods in HTML: absolute XPath expressions and relative XPath expressions. We argue that the relative ...
Marek Kowalkiewicz, Maria E. Orlowska, Tomasz Kacz...
WWW
2006
ACM
14 years 8 months ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner