Sciweavers

177 search results - page 13 / 36
» Methods for the semantic analysis of document markup
Sort
View
WWW
2008
ACM
14 years 8 months ago
Automatic web image selection with a probabilistic latent topic model
We propose a new method to select relevant images to the given keywords from images gathered from the Web based on the Probabilistic Latent Semantic Analysis (PLSA) model which is...
Keiji Yanai
EMNLP
2010
13 years 5 months ago
Translingual Document Representations from Discriminative Projections
Representing documents by vectors that are independent of language enhances machine translation and multilingual text categorization. We use discriminative training to create a pr...
John Platt, Kristina Toutanova, Wen-tau Yih
ICDAR
2007
IEEE
14 years 1 months ago
A Proposition of Retrieval Tools for Historical Document Images Libraries
In this article, we propose a method of characterization of pictures of old documents based on a texture approach. This characterization is carried out with the help of a multires...
Nicholas Journet, Jean-Yves Ramel, Rémy Mul...
WWW
2004
ACM
14 years 8 months ago
Query and content suggestion based on latent interest and topic class
To improve the process of user information retrieval, we propose the concept of a latent semantic map (LSM), along with a method of generating this map. The novel aspect of the LS...
Noriaki Kawaeme, Hideaki Suzuki, Osamu Mizuno
CIKM
2008
Springer
13 years 9 months ago
Modeling hidden topics on document manifold
Topic modeling has been a key problem for document analysis. One of the canonical approaches for topic modeling is Probabilistic Latent Semantic Indexing, which maximizes the join...
Deng Cai, Qiaozhu Mei, Jiawei Han, Chengxiang Zhai