Sciweavers

1211 search results - page 105 / 243
» Topics in 0--1 data
Sort
View
131
Voted
JCDL
2009
ACM
103views Education» more  JCDL 2009»
15 years 8 months ago
Query parameters for harvesting digital video and associated contextual information
Video is increasingly important to digital libraries and archives as both primary content and as context for the primary objects in collections. Services like YouTube not only off...
Gary Marchionini, Chirag Shah, Christopher A. Lee,...
SDM
2008
SIAM
140views Data Mining» more  SDM 2008»
15 years 5 months ago
Creating a Cluster Hierarchy under Constraints of a Partially Known Hierarchy
Although clustering under constraints is a current research topic, a hierarchical setting, in which a hierarchy of clusters is the goal, is usually not considered. This paper trie...
Korinna Bade, Andreas Nürnberger
NIPS
2007
15 years 5 months ago
Distributed Inference for Latent Dirichlet Allocation
We investigate the problem of learning a widely-used latent-variable model – the Latent Dirichlet Allocation (LDA) or “topic” model – using distributed computation, where ...
David Newman, Arthur Asuncion, Padhraic Smyth, Max...
SDM
2007
SIAM
177views Data Mining» more  SDM 2007»
15 years 5 months ago
Bursty Feature Representation for Clustering Text Streams
Text representation plays a crucial role in classical text mining, where the primary focus was on static text. Nevertheless, well-studied static text representations including TFI...
Qi He, Kuiyu Chang, Ee-Peng Lim, Jun Zhang
132
Voted
SDM
2003
SIAM
134views Data Mining» more  SDM 2003»
15 years 5 months ago
Hierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...
Benjamin C. M. Fung, Ke Wang, Martin Ester