Sciweavers

295 search results - page 6 / 59
» Web Crawling
Sort
View
WWW
2005
ACM
14 years 9 months ago
User-centric Web crawling
Search engines are the primary gateways of information access on the Web today. Behind the scenes, search engines crawl the Web to populate a local indexed repository of Web pages...
Sandeep Pandey, Christopher Olston
ICDE
2006
IEEE
144views Database» more  ICDE 2006»
14 years 2 months ago
Finding Thai Web Pages in Foreign Web Spaces
While the Web has been increasingly recognized as a culturally valuable social artifact, many nations endeavor to create national Web archives for long term preservation. However, ...
Kulwadee Somboonviwat, Takayuki Tamura, Masaru Kit...
STOC
2002
ACM
95views Algorithms» more  STOC 2002»
14 years 8 months ago
Crawling on web graphs
Colin Cooper, Alan M. Frieze
ICDE
2006
IEEE
146views Database» more  ICDE 2006»
14 years 10 months ago
Query Selection Techniques for Efficient Crawling of Structured Web Sources
The high quality, structured data from Web structured sources is invaluable for many applications. Hidden Web databases are not directly crawlable by Web search engines and are on...
Ping Wu, Ji-Rong Wen, Huan Liu, Wei-Ying Ma
ICDE
2007
IEEE
167views Database» more  ICDE 2007»
14 years 10 months ago
DSphere: A Source-Centric Approach to Crawling, Indexing and Searching the World Wide Web
We describe DSPHERE1 - a decentralized system for crawling, indexing, searching and ranking of documents in the World Wide Web. Unlike most of the existing search technologies tha...
Bhuvan Bamba, Ling Liu, James Caverlee, Vaibhav Pa...