Sciweavers

2936 search results - page 346 / 588
» Genetic Process Mining
Sort
View
97
Voted
KDD
2006
ACM
136views Data Mining» more  KDD 2006»
16 years 3 months ago
Very sparse random projections
There has been considerable interest in random projections, an approximate algorithm for estimating distances between pairs of points in a high-dimensional vector space. Let A Rn...
Ping Li, Trevor Hastie, Kenneth Ward Church
96
Voted
KDD
2005
ACM
109views Data Mining» more  KDD 2005»
16 years 3 months ago
Formulating distance functions via the kernel trick
Tasks of data mining and information retrieval depend on a good distance function for measuring similarity between data instances. The most effective distance function must be for...
Gang Wu, Edward Y. Chang, Navneet Panda
139
Voted
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 3 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
152
Voted
KDD
2002
ACM
184views Data Mining» more  KDD 2002»
16 years 3 months ago
The Community of Multimedia Agents
Multimedia data mining requires the ability to automatically analyze and understand the content. The Community of Multimedia Agents project is devoted to creating a community of re...
Gang Wei, Valery A. Petrushin, Anatole Gershman
123
Voted
WSDM
2010
ACM
197views Data Mining» more  WSDM 2010»
16 years 11 hour ago
Adapting Information Bottleneck Method for Automatic Construction of Domain-oriented Sentiment Lexicon
Domain-oriented sentiment lexicons are widely used for finegrained sentiment analysis on reviews; therefore, the automatic construction of domain-oriented sentiment lexicon is a f...
Songbo Tan, Weifu Du, Xiaochun Yun, Xueqi Cheng