Search Sciweavers | Sciweavers

139 search results - page 14 / 28

» An Approach to Identify Duplicated Web Pages

click to vote

WWW
2007
ACM

164views Internet Technology» more WWW 2007»

Csurf: a context-driven non-visual web-browser

14 years 8 months ago

Download www2007.org

Web sites are designed for graphical mode of interaction. Sighted users can "cut to the chase" and quickly identify relevant information in Web pages. On the contrary, i...

Jalal Mahmud, Yevgen Borodin, I. V. Ramakrishnan

claim paper

Read More »

click to vote

ER
2010
Springer

90views Database» more ER 2010»

W-Ray: A Strategy to Publish Deep Web Geographic Data

13 years 6 months ago

Download webscience.org.br

Abstract. This paper introduces an approach to address the problem of accessing conventional and geographic data from the Deep Web. The approach relies on describing the relevant d...

Helena Piccinini, Melissa Lemos, Marco A. Casanova...

claim paper

Read More »

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

14 years 2 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

click to vote

WISE
2009
Springer

126views Internet Technology» more WISE 2009»

Recommending Improvements to Web Applications Using Quality-Driven Heuristic Search

14 years 2 months ago

Download www-etud.iro.umontreal.ca

Planning out maintenance tasks to increase the quality of Web applications can be diﬃcult for a manager. First, it is hard to evaluate the precise eﬀect of a task on quality. S...

Stéphane Vaucher, Samuel Boclinville, Houar...

claim paper

Read More »

click to vote

WWW
2008
ACM

163views Internet Technology» more WWW 2008»

As we may perceive: finding the boundaries of compound documents on the web

14 years 8 months ago

Download www2008.org

This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...

Pavel Dmitriev

claim paper

Read More »

« Prev « First page 14 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers