Search Sciweavers | Sciweavers

203 search results - page 19 / 41

» Conceptual-Model-Based Data Extraction from Multiple-Record ...

135

click to vote

KDD
2007
ACM

189views Data Mining» more KDD 2007»

Corroborate and learn facts from the web

16 years 3 months ago

Download delivery.acm.org

The web contains lots of interesting factual information about entities, such as celebrities, movies or products. This paper describes a robust bootstrapping approach to corrobora...

Shubin Zhao, Jonathan Betz

claim paper

Read More »

149

click to vote

WWW
2005
ACM

154views Internet Technology» more WWW 2005»

Thresher: automating the unwrapping of semantic content from the World Wide Web

16 years 3 months ago

Download www2005.org

We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...

Andrew Hogue, David R. Karger

claim paper

Read More »

125

click to vote

PODS
2004
ACM

189views Database» more PODS 2004»

The Lixto Data Extraction Project - Back and Forth between Theory and Practice

16 years 3 months ago

Download www.cs.cornell.edu

We present the Lixto project, which is both a research project in database theory and a commercial enterprise that develops Web data extraction (wrapping) and Web service definiti...

Georg Gottlob, Christoph Koch, Robert Baumgartner,...

claim paper

Read More »

127

click to vote

ACL
2008

160views Computational Linguistics» more ACL 2008»

Mining Parenthetical Translations from the Web by Word Alignment

15 years 4 months ago

Download www.aclweb.org

Documents in languages such as Chinese, Japanese and Korean sometimes annotate terms with their translations in English inside a pair of parentheses. We present a method to extrac...

Dekang Lin, Shaojun Zhao, Benjamin Van Durme, Mari...

claim paper

Read More »

151

click to vote

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

15 years 3 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

« Prev « First page 19 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers