Sciweavers

638 search results - page 44 / 128
» Scalable Techniques for Clustering the Web
Sort
View
EMNLP
2011
12 years 8 months ago
Approximate Scalable Bounded Space Sketch for Large Data NLP
We exploit sketch techniques, especially the Count-Min sketch, a memory, and time efficient framework which approximates the frequency of a word pair in the corpus without explic...
Amit Goyal, Hal Daumé III
INCDM
2010
Springer
125views Data Mining» more  INCDM 2010»
13 years 11 months ago
Web-Site Boundary Detection
Defining the boundaries of a web-site, for (say) archiving or information retrieval purposes, is an important but complicated task. In this paper a web-page clustering approach to...
Ayesh Alshukri, Frans Coenen, Michele Zito
MIR
2006
ACM
120views Multimedia» more  MIR 2006»
14 years 3 months ago
Scalable search-based image annotation of personal images
With the prevalence of digital cameras, more and more people have considerable digital images on their personal devices. As a result, there are increasing needs to effectively sea...
Changhu Wang, Feng Jing, Lei Zhang, HongJiang Zhan...
MM
2004
ACM
195views Multimedia» more  MM 2004»
14 years 2 months ago
Hierarchical clustering of WWW image search results using visual, textual and link information
We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics. Organizing the resu...
Deng Cai, Xiaofei He, Zhiwei Li, Wei-Ying Ma, Ji-R...
WEBI
2007
Springer
14 years 3 months ago
Document-Centric Query Answering for the Semantic Web
In this paper, we propose document-centric query answering, a novel form of query answering for the Semantic Web. We discuss how we have built a knowledge base system to support t...
Yuanbo Guo, Jeff Heflin