Progressive Sampling for Association Rules Based on Sampling Error Estimation

14 years 9 months ago

Download arbor.ee.ntu.edu.tw

We explore in this paper a progressive sampling algorithm, called Sampling Error Estimation (SEE), which aims to identify an appropriate sample size for mining association rules. SEE has two advantages over previous works in the literature. First, SEE is highly eﬃcient because an appropriate sample size can be determined without the need of executing association rules. Second, the identiﬁed sample size of SEE is very accurate, meaning that association rules can be highly eﬃciently executed on a sample of this size to obtain a suﬃciently accurate result. This is attributed to the merit of SEE for being able to signiﬁcantly reduce the inﬂuence of randomness by examining several samples with the same size in one database scan. As validated by experiments on various real data and synthetic data, SEE can achieve very prominent improvement in eﬃciency and also the resulting accuracy over previous works.

Kun-Ta Chuang, Ming-Syan Chen, Wen-Chieh Yang

Real-time Traffic

Appropriate Sample Size | Association Rules | Data Mining | PAKDD 2005 | Sample Size |

claim paper

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	PAKDD
Authors	Kun-Ta Chuang, Ming-Syan Chen, Wen-Chieh Yang

Comments (0)

Sciweavers

Progressive Sampling for Association Rules Based on Sampling Error Estimation

Appropriate Sample Size | Association Rules | Data Mining | PAKDD 2005 | Sample Size |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers