Sciweavers

8795 search results - page 10 / 1759
» Measuring Generality of Documents
Sort
View
PAMI
2010
113views more  PAMI 2010»
13 years 5 months ago
Hierarchical Bayesian Modeling of Topics in Time-Stamped Documents
—We consider the problem of inferring and modeling topics in a sequence of documents with known publication dates. The documents at a given time are each characterized by a topic...
Iulian Pruteanu-Malinici, Lu Ren, John William Pai...
CHI
2010
ACM
14 years 22 days ago
Estimating residual error rate in recognized handwritten documents using artificial error injection
Both handwriting recognition systems and their users are error prone. Handwriting recognizers make recognition errors, and users may miss those errors when verifying output. As a ...
Edward Lank, Ryan Stedman, Michael Terry
ERCIMDL
2007
Springer
87views Education» more  ERCIMDL 2007»
14 years 1 months ago
A Model of Uncertainty for Near-Duplicates in Document Reference Networks
We introduce a model of uncertainty where documents are not uniquely identified in a reference network, and some links may be incorrect. It generalizes the probabilistic approach ...
Claudia Hess, Michel de Rougemont
WWW
2009
ACM
14 years 8 months ago
Detecting the origin of text segments efficiently
In the origin detection problem an algorithm is given a set S of documents, ordered by creation time, and a query document D. It needs to output for every consecutive sequence of ...
Ossama Abdel Hamid, Behshad Behzadi, Stefan Christ...
CIKM
2008
Springer
13 years 9 months ago
An extension of PLSA for document clustering
In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to cocluster documents and terms simultaneously. We show on three datase...
Young-Min Kim, Jean-François Pessiot, Massi...