Sciweavers

572 search results - page 92 / 115
» Winnowing-based text clustering
Sort
View
ICML
2010
IEEE
13 years 8 months ago
Distance dependent Chinese restaurant processes
We develop the distance dependent Chinese restaurant process (CRP), a flexible class of distributions over partitions that allows for nonexchangeability. This class can be used to...
David M. Blei, Peter Frazier
ICDM
2010
IEEE
226views Data Mining» more  ICDM 2010»
13 years 5 months ago
Edge Weight Regularization over Multiple Graphs for Similarity Learning
The growth of the web has directly influenced the increase in the availability of relational data. One of the key problems in mining such data is computing the similarity between o...
Pradeep Muthukrishnan, Dragomir R. Radev, Qiaozhu ...
ICDAR
2009
IEEE
13 years 5 months ago
Document Content Extraction Using Automatically Discovered Features
We report an automatic feature discovery method that achieves results comparable to a manually chosen, larger feature set on a document image content extraction problem: the locat...
Sui-Yu Wang, Henry S. Baird, Chang An
JCDL
2011
ACM
244views Education» more  JCDL 2011»
12 years 10 months ago
Product review summarization from a deeper perspective
With product reviews growing in depth and becoming more numerous, it is growing challenge to acquire a comprehensive understanding of their contents, for both customers and produc...
Duy Khang Ly, Kazunari Sugiyama, Ziheng Lin, Min-Y...
SIGMOD
2008
ACM
157views Database» more  SIGMOD 2008»
14 years 7 months ago
CRD: fast co-clustering on large datasets utilizing sampling-based matrix decomposition
The problem of simultaneously clustering columns and rows (coclustering) arises in important applications, such as text data mining, microarray analysis, and recommendation system...
Feng Pan, Xiang Zhang, Wei Wang 0010