Sciweavers

3233 search results - page 462 / 647
» Probabilistic Data Exchange
Sort
View
KDD
2008
ACM
135views Data Mining» more  KDD 2008»
14 years 8 months ago
Effective and efficient itemset pattern summarization: regression-based approaches
In this paper, we propose a set of novel regression-based approaches to effectively and efficiently summarize frequent itemset patterns. Specifically, we show that the problem of ...
Ruoming Jin, Muad Abu-Ata, Yang Xiang, Ning Ruan
KDD
2007
ACM
206views Data Mining» more  KDD 2007»
14 years 8 months ago
Automatic labeling of multinomial topic models
Multinomial distributions over words are frequently used to model topics in text collections. A common, major challenge in applying all such topic models to any text mining proble...
Qiaozhu Mei, Xuehua Shen, ChengXiang Zhai
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 5 months ago
GeoFolk: Latent spatial semantics in Web 2.0 social media
We describe an approach for multi-modal characterization of social media by combining text features (e.g. tags as a prominent example of short, unstructured text labels) with spat...
Sergej Sizov
CIKM
2009
Springer
14 years 2 months ago
Automatic link detection: a sequence labeling approach
The popularity of Wikipedia and other online knowledge bases has recently produced an interest in the machine learning community for the problem of automatic linking. Automatic hy...
James J. Gardner, Li Xiong
ICDM
2007
IEEE
133views Data Mining» more  ICDM 2007»
14 years 2 months ago
Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval
Most topic models, such as latent Dirichlet allocation, rely on the bag-of-words assumption. However, word order and phrases are often critical to capturing the meaning of text in...
Xuerui Wang, Andrew McCallum, Xing Wei