Search Sciweavers | Sciweavers

443 search results - page 15 / 89

» Recycling Course Web Pages for the Semantic Web

203

click to vote

WWW
2007
ACM

110views Internet Technology» more WWW 2007»

Random web crawls

16 years 8 months ago

Download www2007.org

This paper proposes a random Web crawl model. A Web crawl is a (biased and partial) image of the Web. This paper deals with the hyperlink structure, i.e. a Web crawl is a graph, w...

Toufik Bennouas, Fabien de Montgolfier

claim paper

Read More »

194

Voted

WWW
2004
ACM

156views Internet Technology» more WWW 2004»

What's new on the web?: the evolution of the web from a search engine perspective

16 years 8 months ago

Download www.iw3c2.org

We seek to gain improved insight into how Web search engines should cope with the evolving Web, in an attempt to provide users with the most up-to-date results possible. For this ...

Alexandros Ntoulas, Junghoo Cho, Christopher Olsto...

claim paper

Read More »

182

click to vote

WIDM
2005
ACM

125views Internet Technology» more WIDM 2005»

DirectoryRank: ordering pages in web directories

16 years 1 months ago

Download nike.psu.edu

Web Directories are repositories of Web pages organized in a hierarchy of topics and sub-topics. In this paper, we present DirectoryRank, a ranking framework that orders the pages...

Vlassis Krikos, Sofia Stamou, Pavlos Kokosis, Alex...

claim paper

Read More »

196

Voted

SIGIR
2004
ACM

168views Information Technology» more SIGIR 2004»

Block-based web search

16 years 27 days ago

Download research.microsoft.com

Multiple-topic and varying-length of web pages are two negative factors significantly affecting the performance of web search. In this paper, we explore the use of page segmentati...

Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma

claim paper

Read More »

183

click to vote

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

16 years 8 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

« Prev « First page 15 / 89 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers