Search Sciweavers | Sciweavers

4 search results - page 1 / 1

» Efficient Crawling Through URL Ordering

147

click to vote

CN
1998

54views more CN 1998»

Efficient Crawling Through URL Ordering

15 years 6 months ago

Download ilpubs.stanford.edu

In this paper we study in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first. Obtaining important pages rapidly can ...

Junghoo Cho, Hector Garcia-Molina, Lawrence Page

claim paper

Read More »

189

click to vote

KDD
2008
ACM

183views Data Mining» more KDD 2008»

De-duping URLs via rewrite rules

16 years 7 months ago

Download research.yahoo.com

A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...

Anirban Dasgupta, Ravi Kumar, Amit Sasturkar

claim paper

Read More »

221

click to vote

IC
2009

227views Applied Computing» more IC 2009»

Language Based Crawling: Crawling the Arabic Content of the Web

15 years 4 months ago

Download www.salabbad.info

- Crawling web pages written in Arabic or any other language with limited content in the web may, at first, seem to parallel the process of crawling the English content. However, t...

Saad H. Alabbad, Sultan Alanazi

claim paper

Read More »

177

click to vote

WWW
2005
ACM

200views Internet Technology» more WWW 2005»

The infocious web search engine: improving web searching through linguistic analysis

16 years 7 months ago

Download oak.cs.ucla.edu

In this paper we present the Infocious Web search engine [23]. Our goal in creating Infocious is to improve the way people find information on the Web by resolving ambiguities pre...

Alexandros Ntoulas, Gerald Chao, Junghoo Cho

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers