Sciweavers

129 search results - page 24 / 26
» Comprehensible and Accurate Cluster Labels in Text Clusterin...
Sort
View
SIGIR
2004
ACM
14 years 1 months ago
GaP: a factor model for discrete data
We present a probabilistic model for a document corpus that combines many of the desirable features of previous models. The model is called “GaP” for Gamma-Poisson, the distri...
John F. Canny
KDD
2009
ACM
141views Data Mining» more  KDD 2009»
14 years 8 months ago
Meme-tracking and the dynamics of the news cycle
Tracking new topics, ideas, and "memes" across the Web has been an issue of considerable interest. Recent work has developed methods for tracking topic shifts over long ...
Jure Leskovec, Lars Backstrom, Jon M. Kleinberg
DOCENG
2010
ACM
13 years 8 months ago
Glyph extraction from historic document images
This paper is about the reproduction of ancient texts with vectorised fonts. While for OCR only recognition rates count, a reproduction process does not necessarily require the re...
Lothar Meyer-Lerbs, Arne Schuldt, Björn Gottf...
JMLR
2010
105views more  JMLR 2010»
13 years 2 months ago
Collective Inference for Extraction MRFs Coupled with Symmetric Clique Potentials
Many structured information extraction tasks employ collective graphical models that capture interinstance associativity by coupling them with various clique potentials. We propos...
Rahul Gupta, Sunita Sarawagi, Ajit A. Diwan
IPM
2007
106views more  IPM 2007»
13 years 7 months ago
Patent document categorization based on semantic structural information
The number of patent documents is currently rising rapidly worldwide, creating the need for an automatic categorization system to replace time-consuming and labor-intensive manual...
Jae-Ho Kim, Key-Sun Choi