Sciweavers

459 search results - page 19 / 92
» Random sampling from a search engine's index
Sort
View
MM
2004
ACM
178views Multimedia» more  MM 2004»
14 years 1 months ago
A bootstrapping framework for annotating and retrieving WWW images
Most current image retrieval systems and commercial search engines use mainly text annotations to index and retrieve WWW images. This research explores the use of machine learning...
HuaMin Feng, Rui Shi, Tat-Seng Chua
WWW
2007
ACM
14 years 9 months ago
Efficient search in large textual collections with redundancy
Current web search engines focus on searching only the most recent snapshot of the web. In some cases, however, it would be desirable to search over collections that include many ...
Jiangong Zhang, Torsten Suel
ITCC
2002
IEEE
14 years 1 months ago
Taxonomy-based Adaptive Web Search Method
Current crawler-based search engines usually return a long list of search results containing a lot of noise documents. By indexing collected documents on topic path in taxonomy, t...
Said Mirza Pahlevi, Hiroyuki Kitagawa
WWW
2010
ACM
14 years 3 months ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...
SEMWEB
2007
Springer
14 years 2 months ago
Web Search Personalization Via Social Bookmarking and Tagging
Abstract. In this paper, we present a new approach to web search personalization based on user collaboration and sharing of information about web documents. The proposed personaliz...
Michael G. Noll, Christoph Meinel