Search Sciweavers | Sciweavers

416 search results - page 17 / 84

» Structured Web Pages Management for Efficient Data Retrieval

182

click to vote

WWW
2004
ACM

89views Internet Technology» more WWW 2004»

Ranking the web frontier

16 years 8 months ago

Download www.mccurley.org

The celebrated PageRank algorithm has proved to be a very effective paradigm for ranking results of web search algorithms. In this paper we refine this basic paradigm to take into...

Nadav Eiron, Kevin S. McCurley, John A. Tomlin

claim paper

Read More »

184

Voted

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

16 years 8 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

209

click to vote

WWW
2008
ACM

139views Internet Technology» more WWW 2008»

Sailer: an effective search engine for unified retrieval of heterogeneous xml and web documents

16 years 8 months ago

Download www2008.org

This paper studies the problem of unified ranked retrieval of heterogeneous XML documents and Web data. We propose an effective search engine called Sailer to adaptively and versa...

Guoliang Li, Jianhua Feng, Jianyong Wang, Xiaoming...

claim paper

Read More »

194

Voted

EMNLP
2010

167views Natural Language Processing» more EMNLP 2010»

Storing the Web in Memory: Space Efficient Language Models with Constant Time Retrieval

15 years 5 months ago

Download www.aclweb.org

We present three novel methods of compactly storing very large n-gram language models. These methods use substantially less space than all known approaches and allow n-gram probab...

David Guthrie, Mark Hepple

claim paper

Read More »

196

click to vote

ECIR
2008
Springer

167views Information Technology» more ECIR 2008»

The Importance of Link Evidence in Wikipedia

15 years 9 months ago

Download staff.science.uva.nl

Wikipedia is one of the most popular information sources on the Web. The free encyclopedia is densely linked. The link structure in Wikipedia differs from the Web at large: interna...

Jaap Kamps, Marijn Koolen

claim paper

Read More »

« Prev « First page 17 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers