Sciweavers

1125 search results - page 83 / 225
» A flocking based algorithm for document clustering analysis
Sort
View
SAC
2008
ACM
13 years 7 months ago
Discovering relationships among categories using misclassification information
Knowledge of relationships among categories is of the interest in different domains such as text classification, content analysis, and text mining. We propose and evaluate approac...
Saket S. R. Mengle, Nazli Goharian, Alana Platt
ICASSP
2008
IEEE
14 years 2 months ago
Unsupervised language model adaptation via topic modeling based on named entity hypotheses
Language model (LM) adaptation is often achieved by combining a generic LM with a topic-specific model that is more relevant to the target document. Unlike previous work on unsup...
Yang Liu, Feifan Liu
WWW
2010
ACM
14 years 3 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
KDD
2005
ACM
112views Data Mining» more  KDD 2005»
14 years 8 months ago
Model-based overlapping clustering
While the vast majority of clustering algorithms are partitional, many real world datasets have inherently overlapping clusters. Several approaches to finding overlapping clusters...
Arindam Banerjee, Chase Krumpelman, Joydeep Ghosh,...
JMLR
2010
198views more  JMLR 2010»
13 years 6 months ago
On Learning with Integral Operators
A large number of learning algorithms, for example, spectral clustering, kernel Principal Components Analysis and many manifold methods are based on estimating eigenvalues and eig...
Lorenzo Rosasco, Mikhail Belkin, Ernesto De Vito