Sciweavers

645 search results - page 79 / 129
» The Data Warehouse of Newsgroups
Sort
View
ICML
2004
IEEE
14 years 9 months ago
K-means clustering via principal component analysis
Principal component analysis (PCA) is a widely used statistical technique for unsupervised dimension reduction. K-means clustering is a commonly used data clustering for unsupervi...
Chris H. Q. Ding, Xiaofeng He
KDD
2005
ACM
112views Data Mining» more  KDD 2005»
14 years 9 months ago
Model-based overlapping clustering
While the vast majority of clustering algorithms are partitional, many real world datasets have inherently overlapping clusters. Several approaches to finding overlapping clusters...
Arindam Banerjee, Chase Krumpelman, Joydeep Ghosh,...
CIKM
2011
Springer
12 years 9 months ago
Emerging topic detection using dictionary learning
Streaming user-generated content in the form of blogs, microblogs, forums, and multimedia sharing sites, provides a rich source of data from which invaluable information and insig...
Shiva Prasad Kasiviswanathan, Prem Melville, Arind...
SIGMOD
2001
ACM
118views Database» more  SIGMOD 2001»
14 years 9 months ago
Proxy-Server Architectures for OLAP
Data warehouses have been successfully employed for assisting decision making by offering a global view of the enterprise data and providing mechanisms for On-Line Analytical proc...
Panos Kalnis, Dimitris Papadias
ICML
2006
IEEE
14 years 9 months ago
Pachinko allocation: DAG-structured mixture models of topic correlations
Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not ca...
Wei Li, Andrew McCallum