Sciweavers

112 search results - page 5 / 23
» Anti-Serendipity: Finding Useless Documents and Similar Docu...
Sort
View
IJCAI
2007
13 years 10 months ago
Semantic Smoothing of Document Models for Agglomerative Clustering
In this paper, we argue that the agglomerative clustering with vector cosine similarity measure performs poorly due to two reasons. First, the nearest neighbors of a document belo...
Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu
SIGIR
2008
ACM
13 years 8 months ago
A comparative evaluation of different link types on enhancing document clustering
With a growing number of works utilizing link information in enhancing document clustering, it becomes necessary to make a comparative evaluation of the impacts of different link ...
Xiaodan Zhang, Xiaohua Hu, Xiaohua Zhou
ITCC
2003
IEEE
14 years 1 months ago
A Method for Calculating Term Similarity on Large Document Collections
We present an efficient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...
Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva
KSEM
2007
Springer
14 years 2 months ago
Finding Similar RSS News Articles Using Correlation-Based Phrase Matching
Traditional phrase matching approaches, which can discover documents containing exactly the same phrases, fail to detect documents including phrases that are semantically relevant,...
Maria Soledad Pera, Yiu-Kai Ng
ECIR
2003
Springer
13 years 10 months ago
Query-Based Document Skimming: A User-Centred Evaluation of Relevance Profiling
We present a user-centred, task-oriented, comparative evaluation of two query-based document skimming tools. ProfileSkim bases within-document retrieval on computing a relevance pr...
David J. Harper, Ivan Koychev, Sun Yixing