Search Sciweavers | Sciweavers

132

JCDL
2005
ACM

100views Education» more JCDL 2005»

What's there and what's not?: focused crawling for missing documents in digital libraries

15 years 8 months ago

Some large scale topical digital libraries, such as CiteSeer, harvest online academic documents by crawling open-access archives, university and author homepages, and authors’ s...

Ziming Zhuang, Rohit Wagle, C. Lee Giles

claim paper

Read More »

116

click to vote

SIGMETRICS
2000
ACM

117views Hardware» more SIGMETRICS 2000»

Crawler-Friendly Web Servers

15 years 3 months ago

Download oak.cs.ucla.edu

In this paper we study how to make web servers (e.g., Apache) more crawler friendly. Current web servers offer the same interface to crawlers and regular web surfers, even though ...

Onn Brandman, Junghoo Cho, Hector Garcia-Molina, N...

claim paper

Read More »

138

click to vote

ERCIMDL
2005
Springer

305views Education» more ERCIMDL 2005»

Focused Crawling Using Latent Semantic Indexing - An Application for Vertical Search Engines

15 years 8 months ago

Download poseidon.csd.auth.gr

Vertical search engines and web portals are gaining ground over the general-purpose engines due to their limited size and their high precision for the domain they cover. The number...

George Almpanidis, Constantine Kotropoulos, Ioanni...

claim paper

Read More »

113

click to vote

ICAPR
2005
Springer

130views Pattern Recognition» more ICAPR 2005»

Combining Text and Link Analysis for Focused Crawling

15 years 8 months ago

Download poseidon.csd.auth.gr

The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we de...

George Almpanidis, Constantine Kotropoulos

claim paper

Read More »

136

click to vote

AAMAS
2002
Springer

136views Intelligent Agents» more AAMAS 2002»

MySpiders: Evolve Your Own Intelligent Web Crawlers

15 years 3 months ago

Download dollar.biz.uiowa.edu

The dynamic nature of the World Wide Web makes it a challenge to find information that is both relevant and recent. Intelligent agents can complement the power of search engines to...

Gautam Pant, Filippo Menczer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers