Sciweavers

260 search results - page 11 / 52
» Compression of Compound Documents
Sort
View
ICML
2010
IEEE
13 years 8 months ago
The IBP Compound Dirichlet Process and its Application to Focused Topic Modeling
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric mixed membership model--each data point is modeled with a collection of components of different proportions. T...
Sinead Williamson, Chong Wang, Katherine A. Heller...
WIA
2005
Springer
14 years 26 days ago
Compressing XML Documents Using Recursive Finite State Automata
Abstract. We propose a scheme for automatically generating compressors for XML documents from Document Type Definition(DTD) specifications. Our algorithm is a lossless adaptive a...
Hariharan Subramanian, Priti Shankar
ICML
2005
IEEE
14 years 8 months ago
Modeling word burstiness using the Dirichlet distribution
Multinomial distributions are often used to model text documents. However, they do not capture well the phenomenon that words in a document tend to appear in bursts: if a word app...
Rasmus Elsborg Madsen, David Kauchak, Charles Elka...
22
Voted
JUCS
2011
113views more  JUCS 2011»
13 years 2 months ago
Nabuco - Two Decades of Document Processing in Latin America
: This paper reports on the Joaquim Nabuco Project, a pioneering work in Latin America on document digitalization, enhancement, compression, indexing, retrieval and network transmi...
Rafael Dueire Lins
ICDAR
2009
IEEE
14 years 2 months ago
Author Identification Using Compression Models
Daniel Pavelec, Luiz S. Oliveira, Edson J. R. Just...