Sciweavers

TKDE
2008
175views more  TKDE 2008»
14 years 10 days ago
Efficient Phrase-Based Document Similarity for Clustering
Phrase has been considered as a more informative feature term for improving the effectiveness of document clustering. In this paper, we propose a phrase-based document similarity t...
Hung Chim, Xiaotie Deng
ALGORITHMICA
2005
129views more  ALGORITHMICA 2005»
14 years 11 days ago
A Comparison of Multicast Pull Models
We consider the setting of a web server that receives requests for documents from clients, and returns the requested documents over a multicast/broadcast channel. We compare the q...
Kirk Pruhs, Patchrawat Uthaisombut
IJDAR
2007
77views more  IJDAR 2007»
14 years 11 days ago
Genre as noise: noise in genre
Given a specific information need, documents of the wrong genre can be considered as noise. From this perspective, genre classification helps to separate relevant documents from...
Andrea Stubbe, Christoph Ringlstetter, Klaus U. Sc...
SIGIR
2008
ACM
14 years 11 days ago
Knowledge transformation from word space to document space
In most IR clustering problems, we directly cluster the documents, working in the document space, using cosine similarity between documents as the similarity measure. In many real...
Tao Li, Chris H. Q. Ding, Yi Zhang 0005, Bo Shao
SAC
2006
ACM
14 years 11 days ago
High performance XSL-FO rendering for variable data printing
High volume print jobs are getting more common due to the growing demand for personalized documents. In this context, Variable Data Printing (VDP) has become a useful tool for mar...
Fabio Giannetti, Luiz Gustavo Fernandes, Rogerio T...
SIGIR
2008
ACM
14 years 11 days ago
Re-ranking search results using document-passage graphs
We present a novel passage-based approach to re-ranking documents in an initially retrieved list so as to improve precision at top ranks. While most work on passage-based document...
Michael Bendersky, Oren Kurland
SIGIR
2008
ACM
14 years 11 days ago
A lattice-based approach to query-by-example spoken document retrieval
Recent efforts on the task of spoken document retrieval (SDR) have made use of speech lattices: speech lattices contain information about alternative speech transcription hypothes...
Tee Kiah Chia, Khe Chai Sim, Haizhou Li, Hwee Tou ...
SIGIR
2008
ACM
14 years 11 days ago
Exploiting sequential dependencies for expert finding
We propose an expert finding method based on assumption of sequential dependence between a candidate expert and the query terms in the scope of a document. We assume that the stre...
Pavel Serdyukov, Henning Rode, Djoerd Hiemstra
SIGIR
2008
ACM
14 years 11 days ago
Comments-oriented document summarization: understanding documents with readers' feedback
Comments left by readers on Web documents contain valuable information that can be utilized in different information retrieval tasks including document search, visualization, and ...
Meishan Hu, Aixin Sun, Ee-Peng Lim
SIGIR
2008
ACM
14 years 11 days ago
A comparative evaluation of different link types on enhancing document clustering
With a growing number of works utilizing link information in enhancing document clustering, it becomes necessary to make a comparative evaluation of the impacts of different link ...
Xiaodan Zhang, Xiaohua Hu, Xiaohua Zhou