Search Sciweavers | Sciweavers

2677 search results - page 14 / 536

» Extracting Structured Data from Web Pages

181

click to vote

ADBIS
1997
Springer

120views Database» more ADBIS 1997»

Semistructured Data: The Tsimmis Experience

15 years 11 months ago

Download www.cise.ufl.edu

In this paper we discuss the management of semi-structured data, i.e., data that has irregular or dynamically changing structure. We describe components of the Stanford Tsimmis Pr...

Joachim Hammer, Jason McHugh, Hector Garcia-Molina

claim paper

Read More »

192

Voted

VLDB
2001
ACM

144views Database» more VLDB 2001»

RoadRunner: Towards Automatic Data Extraction from Large Web Sites

15 years 11 months ago

Download www.vldb.org

The paper investigates techniques for extracting data from HTML sites through the use of automatically generated wrappers. To automate the wrapper generation and the data extracti...

Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...

claim paper

Read More »

147

click to vote

ER
2001
Springer

148views Database» more ER 2001»

On the Automatic Extraction of Data from the Hidden Web

15 years 11 months ago

Download www.deg.byu.edu

An increasing amount of Web data is accessible only by ﬁlling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are e...

Stephen W. Liddle, Sai Ho Yau, David W. Embley

claim paper

Read More »

212

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 6 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

194

Voted

KDD
2007
ACM

189views Data Mining» more KDD 2007»

Corroborate and learn facts from the web

16 years 7 months ago

Download delivery.acm.org

The web contains lots of interesting factual information about entities, such as celebrities, movies or products. This paper describes a robust bootstrapping approach to corrobora...

Shubin Zhao, Jonathan Betz

claim paper

Read More »

« Prev « First page 14 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers