Search Sciweavers | Sciweavers

416 search results - page 6 / 84

» Structured Web Pages Management for Efficient Data Retrieval

202

Voted

SIGIR
2004
ACM

135views Information Technology» more SIGIR 2004»

16 years 26 days ago

Query-related data extraction of hidden web documents

Download dis.shef.ac.uk

The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...

Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...

claim paper

Read More »

178

click to vote

WWW
2007
ACM

150views Internet Technology» more WWW 2007»

Adaptive record extraction from web pages

16 years 8 months ago

Download www2007.org

We describe an adaptive method for extracting records from web pages. Our algorithm combines a weighted tree matching metric with clustering for obtaining data extraction patterns...

Justin Park, Denilson Barbosa

claim paper

Read More »

229

Voted

KDD
2003
ACM

161views Data Mining» more KDD 2003»

Eliminating noisy information in Web pages for data mining

16 years 7 months ago

Download www.cs.uic.edu

A commercial Web page typically contains many information blocks. Apart from the main content blocks, it usually has such blocks as navigation panels, copyright and privacy notice...

Lan Yi, Bing Liu, Xiaoli Li

claim paper

Read More »

178

Voted

WWW
2004
ACM

116views Internet Technology» more WWW 2004»

Efficient web change monitoring with page digest

16 years 8 months ago

Download www.iw3c2.org

The Internet and the World Wide Web have enabled a publishing explosion of useful online information, which has produced the unfortunate side effect of information overload: it is...

David Buttler, Daniel Rocco, Ling Liu

claim paper

Read More »

195

click to vote

WIDM
2006
ACM

148views Internet Technology» more WIDM 2006»

Coarse-grained classification of web sites by their structural properties

16 years 1 months ago

Download rvs.informatik.uni-leipzig.de

In this paper, we identify and analyze structural properties which reflect the functionality of a Web site. These structural properties consider the size, the organization, the co...

Christoph Lindemann, Lars Littig

claim paper

Read More »

« Prev « First page 6 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers