Sciweavers

70 search results - page 10 / 14
» Integrating Web Content Clustering into Web Log Association ...
Sort
View
WWW
2006
ACM
14 years 1 months ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
ICMCS
2008
IEEE
115views Multimedia» more  ICMCS 2008»
14 years 1 months ago
Spatial pyramid mining for logo detection in natural scenes
This work introduces a novel data mining scheme, spatial pyramid mining, to discover association rules at multiple resolutions in order to identify frequent spatial configuration...
Jim Kleban, Xing Xie, Wei-Ying Ma
WWW
2008
ACM
14 years 8 months ago
Analysis of geographic queries in a search engine log
Geography is becoming increasingly important in web search. Search engines can often return better results to users by analyzing features such as user location or geographic terms...
Qingqing Gan, Josh Attenberg, Alexander Markowetz,...
KDD
2002
ACM
166views Data Mining» more  KDD 2002»
14 years 7 months ago
Frequent term-based text clustering
Text clustering methods can be used to structure large sets of text or hypertext documents. The well-known methods of text clustering, however, do not really address the special p...
Florian Beil, Martin Ester, Xiaowei Xu
WWW
2001
ACM
14 years 8 months ago
Clustering user queries of a search engine
In order to increase retrieval precision, some new search engines provide manually verified answers to Frequently Asked Queries (FAQs). An underlying task is the identification of...
Ji-Rong Wen, Jian-Yun Nie, HongJiang Zhang