Sciweavers

114 search results - page 12 / 23
» Text Categorization Using Compression Models
Sort
View
IPM
2011
71views more  IPM 2011»
12 years 11 months ago
Improving semistatic compression via phrase-based modeling
In recent years, new semistatic word-based byte-oriented text compressors, such as Tagged Huffman and those based on Dense Codes, have shown that it is possible to perform fast d...
Nieves R. Brisaboa, Antonio Fariña, Gonzalo...
AI
2008
Springer
13 years 9 months ago
A Statistical Model for Topic Segmentation and Clustering
This paper presents a statistical model for discovering topical clusters of words in unstructured text. The model uses a hierarchical Bayesian structure and it is also able to iden...
M. Mahdi Shafiei, Evangelos E. Milios
ICANN
2005
Springer
14 years 1 months ago
A Neural Network for Text Representation
Text categorization and retrieval tasks are often based on a good representation of textual data. Departing from the classical vector space model, several probabilistic models have...
Mikaela Keller, Samy Bengio
DCC
2010
IEEE
14 years 2 months ago
Lossless Compression Based on the Sequence Memoizer
In this work we describe a sequence compression method based on combining a Bayesian nonparametric sequence model with entropy encoding. The model, a hierarchy of Pitman-Yor proce...
Jan Gasthaus, Frank Wood, Yee Whye Teh
LREC
2010
151views Education» more  LREC 2010»
13 years 9 months ago
Modeling Wikipedia Articles to Enhance Encyclopedic Search
Reflecting the rapid growth of science, technology, and culture, it has become common practice to consult tools on the World Wide Web for various terms. Existing search engines pr...
Atsushi Fujii