Search Sciweavers | Sciweavers

971 search results - page 139 / 195

» Common Sense from the Web

130

click to vote

WWW
2007
ACM

131views Internet Technology» more WWW 2007»

U-REST: an unsupervised record extraction system

16 years 4 months ago

Download people.csail.mit.edu

In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...

Yuan Kui Shen, David R. Karger

claim paper

Read More »

140

click to vote

WWW
2004
ACM

89views Internet Technology» more WWW 2004»

Enforcing strict model-view separation in template engines

16 years 4 months ago

Download www.cs.usfca.edu

The mantra of every experienced web application developer is the same: thou shalt separate business logic from display. Ironically, almost all template engines allow violation of ...

Terence John Parr

claim paper

Read More »

135

click to vote

DOCENG
2009
ACM

166views Document Analysis» more DOCENG 2009»

Object-level document analysis of PDF files

15 years 10 months ago

Download www.dbai.tuwien.ac.at

The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...

Tamir Hassan

claim paper

Read More »

158

click to vote

ICDM
2006
IEEE

164views Data Mining» more ICDM 2006»

Unsupervised Learning of Tree Alignment Models for Information Extraction

15 years 10 months ago

Download users.soe.ucsc.edu

We propose an algorithm for extracting ﬁelds from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...

Philip Zigoris, Damian Eads, Yi Zhang

claim paper

Read More »

115

click to vote

ECAI
2004
Springer

78views Artificial Intelligence» more ECAI 2004»

Stacked Generalization for Information Extraction

15 years 9 months ago

Download cgi.di.uoa.gr

1 This paper defines a new stacked generalization framework in the context of information extraction (IE) from online sources. The proposed setting removes the constraint of apply...

Georgios Sigletos, Georgios Paliouras, Constantine...

claim paper

Read More »

« Prev « First page 139 / 195 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers