Search Sciweavers | Sciweavers

288 search results - page 5 / 58

» Crawling, Indexing, and Similarity Searching Images on the W...

click to vote

WWW
2005
ACM

135views Internet Technology» more WWW 2005»

LSH forest: self-tuning indexes for similarity search

14 years 10 months ago

Download www2005.org

We consider the problem of indexing high-dimensional data for answering (approximate) similarity-search queries. Similarity indexes prove to be important in a wide variety of sett...

Mayank Bawa, Tyson Condie, Prasanna Ganesan

claim paper

Read More »

click to vote

ICAPR
2005
Springer

130views Pattern Recognition» more ICAPR 2005»

Combining Text and Link Analysis for Focused Crawling

14 years 3 months ago

Download poseidon.csd.auth.gr

The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we de...

George Almpanidis, Constantine Kotropoulos

claim paper

Read More »

click to vote

WWW
2005
ACM

151views Internet Technology» more WWW 2005»

User-centric Web crawling

14 years 10 months ago

Download www2005.org

Search engines are the primary gateways of information access on the Web today. Behind the scenes, search engines crawl the Web to populate a local indexed repository of Web pages...

Sandeep Pandey, Christopher Olston

claim paper

Read More »

click to vote

PVLDB
2008

124views more PVLDB 2008»

Google's Deep Web crawl

13 years 9 months ago

Download www.cs.cornell.edu

The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...

Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...

claim paper

Read More »

click to vote

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

14 years 10 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

« Prev « First page 5 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers