Search Sciweavers | Sciweavers

708 search results - page 42 / 142

» Identifying Content Blocks from Web Documents

136

click to vote

IJCAI
2003

99views Artificial Intelligence» more IJCAI 2003»

Predicting Web Information Content

15 years 4 months ago

Download maya.cs.depaul.edu

In this paper, we propose a novel method to infer the web user’s Information Content (IC), which is the information that the user must examine to complete her task. In particula...

Tingshao Zhu, Russell Greiner, Gerald Häubl, ...

claim paper

Read More »

106

Voted

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

16 years 3 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

153

Voted

WWW
2008
ACM

163views Internet Technology» more WWW 2008»

As we may perceive: finding the boundaries of compound documents on the web

16 years 3 months ago

Download www2008.org

This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...

Pavel Dmitriev

claim paper

Read More »

109

click to vote

WWW
2001
ACM

141views Internet Technology» more WWW 2001»

Towards second and third generation web-based multimedia

16 years 3 months ago

Download www10.org

First generation Web-content encodes information in handwritten (HTML) Web pages. Second generation Web content generates HTML pages on demand, e.g. by filling in templates with c...

Jacco van Ossenbruggen, Joost Geurts, Frank Cornel...

claim paper

Read More »

116

Voted

DOCENG
2007
ACM

134views Document Analysis» more DOCENG 2007»

Extracting reusable document components for variable data printing

15 years 7 months ago

Download eprints.nottingham.ac.uk

Variable Data Printing (VDP) has brought new flexibility and dynamism to the printed page. Each printed instance of a specific class of document can now have different degrees of ...

Steven R. Bagley, David F. Brailsford, James A. Ol...

claim paper

Read More »

« Prev « First page 42 / 142 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers