Sciweavers

808 search results - page 51 / 162
» Keyword-based document clustering
Sort
View
ICASSP
2009
IEEE
14 years 3 months ago
Incorporating monolingual corpora into bilingual latent semantic analysis for crosslingual LM adaptation
The major limitation in bilingual latent semantic analysis (bLSA) is the requirement of parallel training corpora. Motivated by semi-supervised learning, we propose a clusterbased...
Yik-Cheung Tam, Tanja Schultz
SIGIR
2002
ACM
13 years 8 months ago
Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering
A novel method for simultaneous keyphrase extraction and generic text summarization is proposed by modeling text documents as weighted undirected and weighted bipartite graphs. Sp...
Hongyuan Zha
SIGIR
2005
ACM
14 years 2 months ago
Relation between PLSA and NMF and implications
Non-negative Matrix Factorization (NMF, [5]) and Probabilistic Latent Semantic Analysis (PLSA, [4]) have been successfully applied to a number of text analysis tasks such as docum...
Éric Gaussier, Cyril Goutte
ECIR
2003
Springer
13 years 10 months ago
Clustering and Visualization in a Multi-lingual Multi-document Summarization System
Abstract. To measure the similarity of words, sentences, and documents is one of the major issues in multi-lingual multi-document summarization. This paper presents five strategies...
Hsin-Hsi Chen, June-Jei Kuo, Tsei-Chun Su
ACL
1997
13 years 10 months ago
Document Classification Using a Finite Mixture Model
We propose a new method of classifying documents into categories. We define for each category a finite mixture model based on soft clustering of words. We treat the problem of cla...
Hang Li, Kenji Yamanishi