Sciweavers

390 search results - page 10 / 78
» Virtual Attribute Subsetting
Sort
View
KDD
1999
ACM
166views Data Mining» more  KDD 1999»
14 years 2 months ago
CACTUS - Clustering Categorical Data Using Summaries
Clustering is an important data mining problem. Most of the earlier work on clustering focussed on numeric attributes which have a natural ordering on their attribute values. Rece...
Venkatesh Ganti, Johannes Gehrke, Raghu Ramakrishn...
AAAI
2004
13 years 11 months ago
Error Detection and Impact-Sensitive Instance Ranking in Noisy Datasets
Given a noisy dataset, how to locate erroneous instances and attributes and rank suspicious instances based on their impacts on the system performance is an interesting and import...
Xingquan Zhu, Xindong Wu, Ying Yang
JCP
2006
129views more  JCP 2006»
13 years 9 months ago
Cancer Classification With MicroRNA Expression Patterns Found By An Information Theory Approach
Abstract-- Some non-coding small RNAs, known as microRNAs (miRNAs), have been shown to play important roles in gene regulation and various biological processes. The abnormal expres...
Yun Zheng, Chee Keong Kwoh
KDD
2007
ACM
165views Data Mining» more  KDD 2007»
14 years 10 months ago
Finding low-entropy sets and trees from binary data
The discovery of subsets with special properties from binary data has been one of the key themes in pattern discovery. Pattern classes such as frequent itemsets stress the co-occu...
Eino Hinkkanen, Hannes Heikinheimo, Heikki Mannila...
KDD
2008
ACM
195views Data Mining» more  KDD 2008»
14 years 10 months ago
Anomaly pattern detection in categorical datasets
We propose a new method for detecting patterns of anomalies in categorical datasets. We assume that anomalies are generated by some underlying process which affects only a particu...
Kaustav Das, Jeff G. Schneider, Daniel B. Neill