Sciweavers

2497 search results - page 259 / 500
» A Partial-Repeatability Approach to Data Mining
Sort
View
126
Voted
DEXAW
2009
IEEE
131views Database» more  DEXAW 2009»
15 years 11 months ago
Clustering of Short Strings in Large Databases
—A novel method CLOSS intended for textual databases is proposed. It successfully identifies misspelled string clusters, even if the cluster border is not prominent. The method ...
Michail Kazimianec, Arturas Mazeika
SIGKDD
2000
231views more  SIGKDD 2000»
15 years 4 months ago
KDD-99 Classifier Learning Contest: LLSoft's Results Overview
Kernel Miner is a new data-mining tool based on building the optimal decision forest. The tool won second place in the KDD'99 Classifier Learning Contest, August 1999. We des...
Itzhak Levin
128
Voted
KDD
2007
ACM
141views Data Mining» more  KDD 2007»
16 years 5 months ago
Detecting anomalous records in categorical datasets
We consider the problem of detecting anomalies in high arity categorical datasets. In most applications, anomalies are defined as data points that are 'abnormal'. Quite ...
Kaustav Das, Jeff G. Schneider
WWW
2011
ACM
14 years 11 months ago
Automatic construction of a context-aware sentiment lexicon: an optimization approach
The explosion of Web opinion data has made essential the need for automatic tools to analyze and understand people’s sentiments toward different topics. In most sentiment analy...
Yue Lu, Malú Castellanos, Umeshwar Dayal, C...
169
Voted
KDD
2000
ACM
118views Data Mining» more  KDD 2000»
15 years 8 months ago
Generating non-redundant association rules
The traditional association rule mining framework produces many redundant rules. The extent of redundancy is a lot larger than previously suspected. We present a new framework for...
Mohammed Javeed Zaki