Search Sciweavers | Sciweavers

563 search results - page 36 / 113

» Crawling the web for structured documents

156

click to vote

WEBDB
1999
Springer

196views Database» more WEBDB 1999»

Web Ecology: Recycling HTML Pages as XML Documents Using W4F

15 years 8 months ago

Download db.cis.upenn.edu

In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...

Arnaud Sahuguet, Fabien Azavant

claim paper

Read More »

114

click to vote

WWW
2005
ACM

101views Internet Technology» more WWW 2005»

Processing link structures and linkbases on the web

16 years 4 months ago

Download www2005.org

Hyperlinks are an essential feature of the World Wide Web, highly responsible for its success. XLink improves on HTML's linking capabilities in several ways. In particular, l...

François Bry, Michael Eckert

claim paper

Read More »

125

Voted

WWW
2007
ACM

118views Internet Technology» more WWW 2007»

Integrating web directories by learning their structures

16 years 4 months ago

Download www2007.org

Documents in the Web are often organized using category trees by information providers (e.g. CNN, BBC) or search engines (e.g. Google, Yahoo!). Such category trees are commonly kn...

Christopher C. Yang, Jianfeng Lin

claim paper

Read More »

150

click to vote

CIKM
2011
Springer

218views Information Technology» more CIKM 2011»

Integrating and querying web databases and documents

14 years 4 months ago

Download www2.cs.uh.edu

There exist many interrelated information sources on the Internet that can be categorized into structured (database) and semistructured (documents). A key challenge is to integrat...

Carlos Garcia-Alvarado, Carlos Ordonez

claim paper

Read More »

200

click to vote

ISEC
2001
Springer

180views ECommerce» more ISEC 2001»

i-Cube: A Tool-Set for the Dynamic Extraction and Integration of Web Data Content

15 years 8 months ago

Download www.swen.uwaterloo.ca

Over the past decade the Internet has evolved into the largest public community in the world. It provides a wealth of data content and services in almost every field of science, t...

Frankie Poon, Kostas Kontogiannis

claim paper

Read More »

« Prev « First page 36 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers