Search Sciweavers | Sciweavers

151

ADC
2004
Springer

79views Database» more ADC 2004»

Performance and Cost Tradeoffs in Web Search.

16 years 2 days ago

Web search engines crawl the web to fetch the data that they index. In this paper we re-examine that need, and evaluate the network costs associated with data acquisition, and alt...

Nick Craswell, Francis Crimmins, David Hawking, Al...

claim paper

Read More »

161

click to vote

WEBI
2009
Springer

192views Internet Technology» more WEBI 2009»

Learning Deep Web Crawling with Diverse Features

16 years 1 months ago

Download 117.36.50.52

—The key to Deep Web crawling is to submit promising keywords to query form and retrieve Deep Web content efficiently. To select keywords, existing methods make a decision based ...

Lu Jiang, Zhaohui Wu, Qinghua Zheng, Jun Liu

claim paper

Read More »

176

click to vote

ICDE
2006
IEEE

144views Database» more ICDE 2006»

Finding Thai Web Pages in Foreign Web Spaces

16 years 21 days ago

Download www.ieice.org

While the Web has been increasingly recognized as a culturally valuable social artifact, many nations endeavor to create national Web archives for long term preservation. However, ...

Kulwadee Somboonviwat, Takayuki Tamura, Masaru Kit...

claim paper

Read More »

168

click to vote

WWW
2004
ACM

161views Internet Technology» more WWW 2004»

Small world peer networks in distributed web search

16 years 7 months ago

Download www.iw3c2.org

In ongoing research, a collaborative peer network application is being proposed to address the scalability limitations of centralized search engines. Here we introduce a local ada...

Ruj Akavipat, Le-Shin Wu, Filippo Menczer

claim paper

Read More »

172

Voted

WWW
2001
ACM

113views Internet Technology» more WWW 2001»

Crawling the Hidden Web

16 years 7 months ago

Download www.dia.uniroma3.it

Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...

Sriram Raghavan, Hector Garcia-Molina

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers