Sciweavers

162 search results - page 20 / 33
» Latent Dirichlet Allocation
Sort
View
MMM
2010
Springer
203views Multimedia» more  MMM 2010»
14 years 7 months ago
TV News Story Segmentation Based on Semantic Coherence and Content Similarity
In this paper, we introduce and evaluate two novel approaches, one using video stream and the other using close-caption text stream, for segmenting TV news into stories. The segmen...
Hemant Misra, Frank Hopfgartner, Anuj Goyal, P. Pu...
WIAMIS
2009
IEEE
14 years 5 months ago
Automatic topic detection strategy for information retrieval in spoken document
This paper suggests an alternative solution for the task of spoken document retrieval (SDR). The proposed system runs retrieval on multi-level transcriptions (word and phone) prod...
Shan Jin, Hemant Misra, Thomas Sikora, Joemon M. J...
ICASSP
2008
IEEE
14 years 5 months ago
Unsupervised language model adaptation via topic modeling based on named entity hypotheses
Language model (LM) adaptation is often achieved by combining a generic LM with a topic-specific model that is more relevant to the target document. Unlike previous work on unsup...
Yang Liu, Feifan Liu
NIPS
2003
14 years 7 days ago
Hierarchical Topic Models and the Nested Chinese Restaurant Process
We address the problem of learning topic hierarchies from data. The model selection problem in this domain is daunting—which of the large collection of possible trees to use? We...
David M. Blei, Thomas L. Griffiths, Michael I. Jor...
CIKM
2011
Springer
12 years 11 months ago
Towards noise-resilient document modeling
We introduce a generative probabilistic document model based on latent Dirichlet allocation (LDA), to deal with textual errors in the document collection. Our model is inspired by...
Tao Yang, Dongwon Lee