Sciweavers

PAKDD
2005
ACM
124views Data Mining» more  PAKDD 2005»
14 years 5 months ago
Finding Sporadic Rules Using Apriori-Inverse
We define sporadic rules as those with low support but high confidence: for example, a rare association of two symptoms indicating a rare disease. To find such rules using the w...
Yun Sing Koh, Nathan Rountree
PAKDD
2005
ACM
63views Data Mining» more  PAKDD 2005»
14 years 5 months ago
Pruning Derivative Partial Rules During Impact Rule Discovery
Because exploratory rule discovery works with data that is only a sample of the phenomena to be investigated, some resulting rules may appear interesting only by chance. Techniques...
Shiying Huang, Geoffrey I. Webb
PAKDD
2005
ACM
100views Data Mining» more  PAKDD 2005»
14 years 5 months ago
Pushing Tougher Constraints in Frequent Pattern Mining
In this paper we extend the state-of-art of the constraints that can be pushed in a frequent pattern computation. We introduce a new class of tough constraints, namely Loose Anti-m...
Francesco Bonchi, Claudio Lucchese
PAKDD
2005
ACM
111views Data Mining» more  PAKDD 2005»
14 years 5 months ago
Training Support Vector Machines Using Greedy Stagewise Algorithm
Abstract. Hard margin support vector machines (HM-SVMs) have a risk of getting overfitting in the presence of the noise. Soft margin SVMs deal with this
Liefeng Bo, Ling Wang, Licheng Jiao
PAKDD
2005
ACM
117views Data Mining» more  PAKDD 2005»
14 years 5 months ago
Automatic View Selection: An Application to Image Mining
Abstract. In this paper we discuss an image mining application of Egeria detection. Egeria is a type of weed found in various lands and water regions over San Joaquin and Sacrament...
Manoranjan Dash, Deepak Kolippakkam
PAKDD
2005
ACM
94views Data Mining» more  PAKDD 2005»
14 years 5 months ago
Progressive Sampling for Association Rules Based on Sampling Error Estimation
We explore in this paper a progressive sampling algorithm, called Sampling Error Estimation (SEE), which aims to identify an appropriate sample size for mining association rules. S...
Kun-Ta Chuang, Ming-Syan Chen, Wen-Chieh Yang
PAKDD
2005
ACM
160views Data Mining» more  PAKDD 2005»
14 years 5 months ago
Improving Mining Quality by Exploiting Data Dependency
The usefulness of the results produced by data mining methods can be critically impaired by several factors such as (1) low quality of data, including errors due to contamination, ...
Fang Chu, Yizhou Wang, Carlo Zaniolo, Douglas Stot...
PAKDD
2005
ACM
146views Data Mining» more  PAKDD 2005»
14 years 5 months ago
An Incremental Data Stream Clustering Algorithm Based on Dense Units Detection
Abstract. The data stream model of computation is often used for analyzing huge volumes of continuously arriving data. In this paper, we present a novel algorithm called DUCstream ...
Jing Gao, Jianzhong Li, Zhaogong Zhang, Pang-Ning ...
PAKDD
2005
ACM
133views Data Mining» more  PAKDD 2005»
14 years 5 months ago
An Anomaly Detection Method for Spacecraft Using Relevance Vector Learning
This paper proposes a novel anomaly detection system for spacecrafts based on data mining techniques. It constructs a nonlinear probabilistic model w.r.t. behavior of a spacecraft ...
Ryohei Fujimaki, Takehisa Yairi, Kazuo Machida