Sciweavers

2340 search results - page 46 / 468
» Speculative document evaluation
Sort
View
IJCNLP
2005
Springer
14 years 2 months ago
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method differs from previous approaches to paraphrase...
Marius Pasca, Péter Dienes
ICA
2007
Springer
14 years 24 days ago
Text Clustering on Latent Thematic Spaces: Variants, Strengths and Weaknesses
Deriving a thematically meaningful partition of an unlabeled document corpus is a challenging task. In this context, the use of document representations based on latent thematic ge...
Xavier Sevillano, Germán Cobo, Francesc Al&...
CORR
1998
Springer
98views Education» more  CORR 1998»
13 years 8 months ago
Bayesian Stratified Sampling to Assess Corpus Utility
This paper describes a method for asking statistical questions about a large text corpus. We exemplify the method by addressing the question, "What percentage of Federal Regi...
Judith Hochberg, Clint Scovel, Timothy Thomas, Sam...
CHI
2006
ACM
14 years 9 months ago
PaperSpace: a system for managing digital and paper documents
Here we present PaperSpace a computer vision based document management system that allows users to combine paper and digital documents. Using PaperSpace users can locate paper cop...
Jeff Smith, Jeremy Long, Tanya Lung, Mohd M. Anwar...
SIGIR
2003
ACM
14 years 2 months ago
Document clustering based on non-negative matrix factorization
In this paper, we propose a novel document clustering method based on the non-negative factorization of the termdocument matrix of the given document corpus. In the latent semanti...
Wei Xu, Xin Liu, Yihong Gong